高级检索

井下煤炭运输多环节复杂背景下高精度煤矸识别方法

High-precision coal gangue recognition method in multi-stage coal transportation with complex backgrounds underground

  • 摘要: 煤矸智能分选是发展煤炭智能绿色开采的关键技术之一。准确识别煤矸石是智能分选的先决条件,如何克服井下复杂环境的影响,已成为影响机器视觉识别效果的现实难题。针对井下煤炭运输系统,构建具有单一、二混合和三混合背景的11类图像数据集,包含水渍、煤粉、碎煤与碎矸石及运输设备部件等背景干扰因素。提出了一种融合多种注意力与残差连接的高精度煤矸智能识别方法,高分辨率阶段使用残差卷积块快速生成高质量标记(token),后续阶段使用级联的Channel Spatial Swin Transformer Block (CSSTB)进行深层表征学习。为提升模型对背景噪声的抗干扰能力,网络集成全局、通道、空间多种注意力机制,增强特征表达的鲁棒性。CSSTB中利用基于负斜率特性强化稀疏激活的LeakyReLU线性注意力机制建模全局信息,通过卷积注意力机制模块(CBAM)优化模型注意力分布,提升模型泛化能力。此外,考虑到煤、矸石和输送设备零部件的尺度差异,跨阶段使用残差连接以增强多尺度特征的通信和信息流通。结果表明:所提模型在单一、二混合和三混合背景上的平均准确度达到95.06%、97.77%、95.65%,相较于基线网络Swin Transformer-Tiny分别提高7.01%、4.83%、1.03%。可视化试验表明,对比模型在水渍、暗光和反光等复杂背景干扰下难以准确区分煤和矸石,而所提出模型能够精准聚焦于煤和矸石的关键特征区域,抗干扰能力强。研究结果为井下原煤运输中的煤矸高效分选提供了理论参考。

     

    Abstract: Intelligent coal gangue sorting is one of the key technologies for advancing intelligent and green coal mining. Accurate identification of coal gangue is a prerequisite for intelligent sorting, and overcoming the challenges posed by complex underground environments has become a critical issue affecting machine vision recognition. For underground coal transportation systems, we constructed an 11-class image dataset with single, dual-mixed, and triple-mixed backgrounds, including interference factors such as water stains, coal dust, broken coal, broken gangue, and components of transportation equipment. We propose a high-precision coal gangue recognition method that integrates multiple attention mechanisms and residual connections. In the high-resolution stage, residual convolution blocks are used to quickly generate high-quality tokens, and in the subsequent stages, deep feature representation learning is performed using a cascaded Channel Spatial Swin Transformer Block (CSSTB). To improve the model's robustness against background noise, the network incorporates global, channel, and spatial attention mechanisms, enhancing feature expression. The CSSTB leverages a LeakyReLU-based linear attention mechanism to model global information, strengthening sparse activation through its negative slope characteristics, while the Convolutional Block Attention Module (CBAM) is utilized to optimize attention distribution and improve model generalization. Additionally, considering the scale differences between coal, gangue, and equipment components, residual connections are applied across stages to enhance communication and information flow between multi-scale features. The results show that the proposed model achieves average accuracies of 95.06%, 97.77%, and 95.65% on single, dual-mixed, and triple-mixed backgrounds, respectively, representing improvements of 7.01%, 4.83%, and 1.03% compared to the baseline Swin Transformer-Tiny network. Visualization experiments demonstrate that, unlike the baseline model, which struggles to accurately distinguish between coal and gangue under complex background interferences such as water stains, low light, and reflections, the proposed model can precisely focus on key feature regions of coal and gangue, exhibiting strong anti-interference capabilities. The findings provide a theoretical reference for efficient coal-gangue sorting in underground coal transportation.

     

/

返回文章
返回