高级检索

改进YOLOv7和DeepSORT的井下人员检测与跟踪算法

Underground personnel detection and tracking using improved YOLOv7 and DeepSORT

  • 摘要: 煤炭行业正在经历以“安全、高效、智能、绿色”为核心的智能矿山开采理念的变革,而计算机视觉作为一种高效、智能、低成本的新兴技术,已经成为当下智能矿山建设中的重要亮点。针对井下监控视频容易受到人工光源、粉尘和喷雾等干扰因素的影响,导致现有基于计算机视觉的井下人员检测方法存在实时性差、漏检和误检率高以及跟踪精度差的问题,提出了一种改进YOLOv7和DeepSORT的井下人员检测与跟踪算法。首先,为了提取到更为关键的井下人员图像特征,提高模型在煤矿井下复杂场景中的适应能力,在YOLOv7的Neck模块中融入SimAM注意力机制,并采用改进后的YOLOv7模型检测井下人员目标;然后,为了在降低模型参数量和网络复杂度的同时,进一步提高人员目标的跟踪精度,在DeepSORT的特征提取网络中引入ShuffleNetV2轻量化模块,并采用改进的DeepSORT模型对井下人员目标进行编码跟踪;最后,在已构建的井下人员视频图像数据集与公开数据集上对所述算法进行试验验证。结果表明:改进YOLOv7模型的平均检测精度相比YOLOv7模型提高了3.9%;改进DeepSORT模型的人员目标跟踪准确率达到了74.9%,跟踪精确度达到了82.3%,速度达到了24 FPS。相较于YOLOv7-DeepSORT算法,本文所述算法的网络参数量减少了36%,显著提高了井下多人员目标检测与跟踪的实时性能,可望部署于井下智能边缘计算监测平台。

     

    Abstract: The coal industry is undergoing a transformation in the concept of intelligent mining with "safety, efficiency, intelligence, and green" as its core. Computer vision, as an emerging technology with high efficiency, intelligence, and low cost, has become an important highlight in the current construction of intelligent mines. The underground monitoring video is susceptible to interference factors such as artificial light source, dust and spray, which leads to the poor real-time performance, high missed & false detection rates and poor tracking accuracy of existing underground personnel monitoring methods using computer vision. Based on this, an improved YOLOv7 and DeepSORT underground personnel detection and tracking algorithm is proposed. First, in order to be able to extract more critical underground personnel image features and improve the model's adaptability in the complex scene of coal mine underground, the SimAM attention mechanism is incorporated into the Neck module of YOLOv7, and the improved YOLOv7 model is used to detect underground personnel targets. After that, in order to be able to further improve the tracking accuracy of personnel targets while reducing the number of model parameters and network complexity, the ShuffleNetV2 lightweight module is introduced into the feature extraction network of DeepSORT, and the improved DeepSORT model is used to encode and track downhole personnel targets. Finally, the experimental validation of the proposed algorithm is carried out on the established downhole personnel video image dataset and the public dataset, and the results show that: the average detection accuracy of the improved YOLOv7 model is improved by 3.9% compared with that of the YOLOv7 model; the multi-target tracking accuracy of the improved DeepSORT algorithm reaches 74.9%, the tracking accuracy reaches 82.3%, and the speed reaches 24 FPS. Compared with YOLOv7-DeepSORT algorithm, the number of network parameters of the improved algorithm is reduced by 36%, significantly improving the real-time performance of underground multi personnel target detection and tracking, and is expected to be deployed in the underground intelligent edge computing monitoring platform.

     

/

返回文章
返回