基于改进YOLOv8的煤矿井下无人运输车辆目标检测研究

Research on target detection of unmanned vehicles in coal mines based on improved YOLOv8

  • 摘要: 针对煤矿井下无人运输车作业场景中存在的低光照、多粉尘及目标形态多变等因素导致传统检测模型易出现漏检、错检和实时性不足的问题,提出了一种高效精准的煤矿井下无人运输车辆目标检测模型——YOLOv8-CBD。首先,结合Dual思想融合CSPNet与HetConv,提出了一种新的CSPConv模块,用于替代原YOLOv8网络中的C2f模块,在保证模型检测精度的同时,降低了模型的计算复杂度;其次,引入加权双向特征金字塔网络(BiFPN)来改进YOLOv8的颈部网络,根据不同的特征重要性分配不同的权重,有效地融合了不同尺度的目标特征,保证目标物特征信息的高效利用;同时,将传统的检测头替换为Dynamic Head,使检测头在复杂背景中的表现得到显著提升,强化模型对小目标的检测能力;最后,使用Focaler-IoU代替CIoU,重构IoU损失的线性区间映射,实现模型损失值在不同难度回归样本间的动态调节,增强模型在复杂煤矿井下环境中的定位和检测能力。利用煤矿井下无人运输车辆目标检测数据集对所提出的YOLOv8-CBD模型进行测试,实验结果表明:YOLOv8-CBD模型的mAP@50达到91.5%、mAP@50:95达到65.9%、Recall达到80.9%、F1-score达到0.861、FPS达到106,相较于基线模型YOLOv8分别提升了3.3%、4.7%、2.9%、3.1%和16%;其参数量仅有2.63M,计算复杂度为6.4GFLOPs,相较基线模型YOLOv8n分别下降了12%和22%。与Faster-Rcnn、SSD、YOLOv5n、YOLOv6n、YOLOv7-tiny、YOLOv8s、RT-DETR、Deformable-DETR检测模型相比,YOLOv8-CBD在精度、召回率、参数量以及计算复杂度等各项指标上均优于其他模型。YOLOv8-CBD在提高检测精度的前提下,使得模型更加轻量化,拥有更少的参数量与计算量,更加适合部署在煤矿井下作业的各种无人运输车上,有效解决了因光照不足、烟雾以及粉尘影响引起的无人运输车目标检测困难的问题。

     

    Abstract: In underground coal mine unmanned vehicle operations, traditional object detection models often suffer from issues such as missed detection, false detection, and insufficient real-time performance due to factors like low light, high dust, and varying object shapes. To address these challenges, a highly efficient and accurate object detection model for coal mine underground unmanned vehicles, named YOLOv8-CBD, is proposed in this paper. First, the Dual approach is applied to fuse CSPNet and HetConv, and a new CSPConv module is introduced to replace the C2f module in the original YOLOv8 network, reducing computational complexity while maintaining detection accuracy. Second, a weighted bidirectional feature pyramid network (BiFPN) is introduced to improve the neck network of YOLOv8. Different weights are assigned based on the importance of various features, effectively fusing target features across multiple scales and ensuring efficient utilization of target feature information. Additionally, the traditional detection head is replaced with a Dynamic Head, which significantly enhances the model's performance in complex backgrounds and strengthens its ability to detect small targets. Finally, CIoU is replaced with Focaler-IoU, reconstructing the linear interval mapping of the IoU loss and enabling dynamic adjustment of the model's loss value across regression samples of varying difficulty. This enhances the model's localization and detection capability in complex underground environments. The proposed YOLOv8-CBD model was tested on the coal mine underground unmanned vehicle object detection dataset. The experimental results show that YOLOv8-CBD achieves a mAP@50 of 91.5%, mAP@50:95 of 65.9%, and a Recall of 80.9%, F1-score of 0.861, FPS of 106, which represents improvements of 3.3%, 4.7%, and 2.9%, 3.1% and 16%, respectively, compared to the baseline model YOLOv8. Its parameter count is only 2.63M, and its computational complexity is 6.4 GFLOPs, which are 12% and 22% lower than the baseline model YOLOv8n, respectively. Compared with Faster-RCNN, SSD, YOLOv5n, YOLOv6n, YOLOv7-tiny, and YOLOv8s detection models, as well as RT-DETR and Deformable-DETR, YOLOv8-CBD outperforms all in terms of accuracy, recall, parameter size, and computational complexity. YOLOv8-CBD not only improves detection accuracy but also reduces model size and computational load, making it more suitable for deployment on various unmanned vehicles operating in underground coal mine environments. This model effectively addresses the challenges in object detection caused by insufficient lighting, smoke, and dust in such environments.

     

/

返回文章
返回