Advance Search
CHEN Wei,JIANG Zhicheng,TIAN Zijian,et al. Unsafe action detection algorithm of underground personnel in coal mine based on YOLOv8[J]. Coal Science and Technology,2024,52(S2):267−283. DOI: 10.12438/cst.2023-1772
Citation: CHEN Wei,JIANG Zhicheng,TIAN Zijian,et al. Unsafe action detection algorithm of underground personnel in coal mine based on YOLOv8[J]. Coal Science and Technology,2024,52(S2):267−283. DOI: 10.12438/cst.2023-1772

Unsafe action detection algorithm of underground personnel in coal mine based on YOLOv8

More Information
  • Received Date: November 22, 2023
  • Available Online: February 20, 2025
  • There are problems such as interference information, low illumination and mechanical equipment occlusion in the complex environment of underground coal mine, which makes the speed and accuracy of the existing object detection algorithm have a series of challenges when carrying out the task of personnel unsafe action detection. An improved YOLOv8l method, called MAC-YOLO, is proposed to solve the problems of complex computation, large number of parameters, long inference time and difficult feature extraction in existing object detection models. By replacing the convolution in the original baseline model with the receptive field attention convolution (RFAConv), the MAC-YOLO model allows the model to dynamically adjust the receptive field weight according to the complexity and importance of the input data, and solve the parameter sharing problem in the standard convolution operation, so that the network can more effectively capture and utilize the information in the image. At the same time, an efficient multi scale attention (EMA) module is introduced into the baseline model, which can integrate context information of different scales, and learn effective channel descriptions without channel dimensionality reduction during convolution operation, so that the model can produce better pixel-level attention to high-level feature maps. It also can capture the inter-dimensional interaction and establish the dependency between dimensions, so that the huge local receptor field of neurons can efficiently obtain clearer multi scale features, reduce the influence of interference factors in the image, further improve the focusing ability of the model on object features, and help the model to efficiently carry out convolutional operations to extract abnormal actions of personnel in the coal mine. Improve the detection accuracy of the model. In addition, the boundary box regression loss function (LMPDIoU) is introduced to directly minimize the distance between the upper left point and the lower right point of the predicted box and the real box, which solves the problem that the model cannot be optimized effectively when the original loss function has the same aspect ratio of the predicted box and the real box (the value is different), accelerates the convergence speed of the model and improves the positioning accuracy. In order to reduce the computational complexity of the model, the complexity of the network structure, and enhance the flexibility of the network, it uses the slim-neck design paradigm to transform the neck of the baseline model, enhances the ability to handle network characteristics through the GSbottleneck module, and improves the learning ability of the model through GSConv module stacking. The VoV-GSCSP module improves the feature utilization efficiency and network performance. The experimental results show that in the scenario-specific coal miner action dataset (MACD), compared with the baseline model YOLOv8l, the mAP@0.5 and mAP@0.5:0.95 of MAC-YOLO increased by 1.9% and 3.6%, respectively, and the FPS value is 81ms. This shows that the MAC-YOLO model meets the needs of real-time and lightweight models while maintaining good detection accuracy, demonstrating high flexibility, accuracy and efficiency. In addition, the effectiveness of each improved module to improve the performance of the model is proved by the ablation experiments.

  • [1]
    ZENG N Y,WU P S,WANG Z D,et al. A small-sized object detection oriented multi-scale feature fusion approach with application to defect detection[J]. IEEE Transactions on Instrumentation and Measurement,2022,71:3507014.
    [2]
    GOU S P,WANG X L,MAO S S,et al. Weakly-supervised semantic feature refinement network for MMW concealed object detection[J]. IEEE Transactions on Circuits and Systems for Video Technology,2023,33(3):1363−1373. doi: 10.1109/TCSVT.2022.3210931
    [3]
    LENG J X,MO M,ZHOU Y H,et al. Pareto refocusing for drone-view object detection[J]. IEEE Transactions on Circuits and Systems for Video Technology,2023,33(3):1320−1334. doi: 10.1109/TCSVT.2022.3210207
    [4]
    YE T,ZHANG J,LI Y W,et al. CT-net:An efficient network for low-altitude object detection based on convolution and transformer[J]. IEEE Transactions on Instrumentation and Measurement,2022,71:2507412.
    [5]
    REN S Q,HE K M,GIRSHICK R,et al. Faster R-CNN:Towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,39(6):1137−1149. doi: 10.1109/TPAMI.2016.2577031
    [6]
    HE K M,GKIOXARI G,DOLLÁR P,et al. Mask R-CNN[C]//2017 IEEE International Conference on Computer Vision (ICCV). Piscataway,NJ:IEEE,2017:2980−2988.
    [7]
    郭永存,童佳乐,王爽. 井下无人驾驶电机车行驶场景中多目标检测研究[J]. 工矿自动化,2022,48(6):56−63.

    GUO Yongcun,TONG Jiale,WANG Shuang. Research on multi-object detection in driving scene of underground unmanned electric locomotive[J]. Journal of Mine Automation,2022,48(6):56−63.
    [8]
    CHEN X Y,LI H L,WU Q B,et al. Bal-R2CNN:High quality recurrent object detection with balance optimization[J]. IEEE Transactions on Multimedia,2021,24:1558−1569.
    [9]
    REDMON J,DIVVALA S,GIRSHICK R,et al. You only look once:Unified,real-time object detection[C]//2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway,NJ:IEEE,2016:779−788.
    [10]
    LIU W, ANGUELOV D, ERHAN D, et al. SSD: Single shot MultiBox detector[EB/OL]. (2015-12-08) [2023−10−02]. https: //arxiv.org/abs/1512.02325v5.

    LIU W,ANGUELOV D,ERHAN D,et al. SSD:Single shot MultiBox detector[EB/OL]. (2015-12-08) [2023−10−02]. https://arxiv.org/abs/1512.02325v5.
    [11]
    LIN T Y,GOYAL P,GIRSHICK R,et al. Focal loss for dense object detection[C]//2017 IEEE International Conference on Computer Vision (ICCV). Piscataway,NJ:IEEE,2017:2999−3007.
    [12]
    REDMON J,FARHADI A. YOLO9000:Better,faster,stronger[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway,NJ:IEEE,2017:6517−6525.
    [13]
    REDMON J, FARHADI A. YOLOv3: An incremental improvement[EB/OL]. (2018−04−08) [2023−10−02]. https: //arxiv.org/abs/1804.02767v1.

    REDMON J,FARHADI A. YOLOv3:An incremental improvement[EB/OL]. (2018−04−08) [2023−10−02]. https://arxiv.org/abs/1804.02767v1.
    [14]
    BOCHKOVSKIY A, WANG C Y, LIAO H M. YOLOv4: Optimal speed and accuracy of object detection[EB/OL]. (2020−04−23) [2023−10−02]. https: //arxiv.org/abs/2004.10934v1.

    BOCHKOVSKIY A,WANG C Y,LIAO H M. YOLOv4:Optimal speed and accuracy of object detection[EB/OL]. (2020−04−23) [2023−10−02]. https://arxiv.org/abs/2004.10934v1.
    [15]
    WANG C Y,BOCHKOVSKIY A,LIAO H M. YOLOv7:Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors[C]//2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway,NJ:IEEE,2023:7464-7475.
    [16]
    CHEN W,LI Y,TIAN Z J,et al. 2D and 3D object detection algorithms from images:A Survey[J]. Array,2023,19:100305. doi: 10.1016/j.array.2023.100305
    [17]
    任国强,韩洪勇,李成江,等. 基于FastYOLOv3算法的煤矿胶带运输异物检测[J]. 工矿自动化,2021,47(12):128−133.

    REN Guoqiang,HAN Hongyong,LI Chengjiang,et al. Foreign object detection in coal mine belt transportation based on FastYOLOv3 algorithm[J]. Industry and Mine Automation,2021,47(12):128−133.
    [18]
    郝明月,闵冰冰,张新建,等. 基于改进YOLOv5s的矿工排队检测方法[J]. 工矿自动化,2023,49(11):160−166.

    HAO Mingyue,MIN Bingbing,ZHANG Xinjian,et al. A miner queue detection method based on improved YOLOv5s[J]. Journal of Mine Automation,2023,49(11):160−166.
    [19]
    GE Z,LIU S T,LI Z M,et al. OTA:Optimal transport assignment for object detection[C]//2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE,2021:303−312.
    [20]
    FENG C J,ZHONG Y J,GAO Y,et al. TOOD:Task-aligned one-stage object detection[C]//2021 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway,NJ:IEEE,2021:3490−3499.
    [21]
    LYU C Q, ZHANG W W, HUANG H A, et al. RTMDet: An empirical study of designing real-time object detectors[EB/OL]. (2022−12−14) [2023−10−02]. https: //arxiv.org/abs/2212.07784v2.

    LYU C Q,ZHANG W W,HUANG H A,et al. RTMDet:An empirical study of designing real-time object detectors[EB/OL]. (2022−12−14) [2023−10−02]. https://arxiv.org/abs/2212.07784v2.
    [22]
    陈伟,任鹏,田子建,等. 基于注意力机制的无监督矿井人员跟踪[J]. 煤炭学报,2021,46(S1):601−608.

    CHEN Wei,REN Peng,TIAN Zijian,et al. Unsupervised mine personnel tracking based on attention mechanism[J]. Journal of China Coal Society,2021,46(S1):601−608.
    [23]
    ZHANG X, LIU C, YANG D G, et al. RFAConv: Innovating spatial attention and standard convolutional operation[EB/OL]. (2023−04−06) [2023−10−02]. https: //arxiv.org/abs/2304.03198v6.

    ZHANG X,LIU C,YANG D G,et al. RFAConv:Innovating spatial attention and standard convolutional operation[EB/OL]. (2023−04−06) [2023−10−02]. https://arxiv.org/abs/2304.03198v6.
    [24]
    OUYANG D L,HE S,ZHANG G Z,et al. Efficient multi-scale attention module with cross-spatial learning[C]//ICASSP 2023 - 2023 IEEE International Conference on Acoustics,Speech and Signal Processing (ICASSP). Piscataway,NJ:IEEE,2023:1−5.
    [25]
    LI H, LI J, WEI H, et al. Slim-neck by GSConv: A better design paradigm of detector architectures for autonomous vehicles[EB/OL]. (2022−06−06) [2023−10−02]. http: //export.arxiv.org/abs/2206.02424.

    LI H,LI J,WEI H,et al. Slim-neck by GSConv:A better design paradigm of detector architectures for autonomous vehicles[EB/OL]. (2022−06−06) [2023−10−02]. http://export.arxiv.org/abs/2206.02424.
    [26]
    SILIANG M, YONG X. MPDIoU: A Loss for Efficient and Accurate Bounding Box Regression[EB/OL]. (2022−07−14) [2023−10−02]. https: //arxiv.org/abs/2307.07662.

    SILIANG M,YONG X. MPDIoU:A Loss for Efficient and Accurate Bounding Box Regression[EB/OL]. (2022−07−14) [2023−10−02]. https://arxiv.org/abs/2307.07662.
    [27]
    YANG W J,ZHANG X H,MA B,et al. An open dataset for intelligent recognition and classification of abnormal condition in longwall mining[J]. Scientific Data,2023,10(1):416. doi: 10.1038/s41597-023-02322-9
  • Related Articles

    [1]YAO Hui, YIN Huichao, YIN Shangxian, HOU Enke, BI Meng, LIAN Huiqing, XIA Xiangxue, LIANG Manyu. Developing of the evaluation of water inrush risk from coal seam floor[J]. COAL SCIENCE AND TECHNOLOGY, 2024, 52(S1): 183-191. DOI: 10.12438/cst.2023-0346
    [2]LYU Yuguang, QIAO Wei, HU Falun, LIU Mengnan, LYU Bo. Study on evaluation technology of coal seam roof water hazard risk with protection coefficient[J]. COAL SCIENCE AND TECHNOLOGY, 2024, 52(3): 180-188. DOI: 10.12438/cst.2023-0992
    [3]ZENG Yifan, WU Qiang, ZHAO Suqi, MIAO Yaowu, ZHANG Ye, MEI Aoshuang, MENG Shihao, LIU Xiaoxiu. Characteristics, causes, and prevention measures of coal mine water hazard accidents in China[J]. COAL SCIENCE AND TECHNOLOGY, 2023, 51(7): 1-14. DOI: 10.13199/j.cnki.cst.2023-0500
    [4]GUO Xiaoming, WANG Hao, ZHOU Linsheng. Evaluation of spatial water enrichment of ultra-thick bedrockaquifer in coal seam roof[J]. COAL SCIENCE AND TECHNOLOGY, 2021, 49(9): 167-175.
    [5]Hu Weiyue Zhou Jjianjun, . Discussion on some confused key concepts used in mine water disater control and protection[J]. COAL SCIENCE AND TECHNOLOGY, 2017, (8).
    [6]Zhai Minghua Liu Rentai Sha Fei Bai Jiwen, . Mechanism and prevention and control key technology of hysteretic water inrush from fault of coal mining face in deep underground mine[J]. COAL SCIENCE AND TECHNOLOGY, 2017, (8).
    [7]LI Hong-jie CHEN Qing-tong MU Yi, . Research on Prevention and Mechanism of Roof Water Inrush Under Low Permeability Aquifer for Thick Coal Seam[J]. COAL SCIENCE AND TECHNOLOGY, 2014, (10).
    [8]Study on Prevention Method and Analysis of Water Inrush Sources in Construction Mine[J]. COAL SCIENCE AND TECHNOLOGY, 2013, (8).
    [9]Analysis on Suitability of Water Prevention and Control Measures with Water Pumping and Pressure Releasing in Underground Mine[J]. COAL SCIENCE AND TECHNOLOGY, 2012, (11).
    [10]Mine Water Disaster Prevention and Control Method Under Complicated Geological Conditions in Huayingshan Mine[J]. COAL SCIENCE AND TECHNOLOGY, 2011, (3).

Catalog

    Article views (65) PDF downloads (32) Cited by()
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return