高级检索

基于RPD-UNet的露天矿山爆堆图像分割方法

Image segmentation method for blast pile in open-pit mine based on RPD-UNet model

  • 摘要: 在露天矿山爆堆图像分割方法研究中,针对爆堆图像背景复杂、目标尺度差异大以及边缘模糊等特点导致的传统分割方法精度低、泛化能力差的问题,提出了一种基于RPD-UNet的露天矿山爆堆图像分割模型,以实现爆堆图像的高精度分割。RPD-UNet模型以编码器−解码器结构的U-Net为基本框架,引入预训练的ResNet34作为骨干特征提取网络,其深层的残差结构,可缓解块度图像背景复杂、纹理多样导致的深层特征提取困难与梯度消散问题;针对复杂光照、阴影遮挡及前景背景相似造成的关键特征模糊与噪声干扰等问题,提出了并行化的空间维度和通道维度的注意力机制(Parallel-CBAM),并行化处理可同时处理通道和空间信息,增强模型对关键区域的聚焦,使模型能更精准地锁定爆堆图像细节特征;针对模型计算复杂度高、模型计算效率低的问题,采用深度可分离卷积(Depthwise Separable Convolution, DSC)替代标准卷积操作,在不损失甚至提升模型能力的前提下,解决了模型计算效率低下的问题。为验证提出的RPD-UNet模型的有效性,将爆堆识别结果与主流分割模型的结果进行对比,结果表明:在分割精度方面,RPD-UNet的mIoU、mPA、Precision及F1分数分别为89.90%、94.54、94.52%及94.51%;在推理效率方面,单张图像处理时间为91 ms,均优于常见主流分割模型。RPD-UNet模型提升了分割精度和泛化能力,同时保证了模型的计算效率,为露天矿山的智能识别与爆破质量的评估提供了参考。

     

    Abstract: In the research of image segmentation methods for open-pit mine blast piles, aiming at the problems of low accuracy and poor generalization ability of traditional segmentation methods caused by the characteristics of blasting pile image, such as complex background, large target scale difference and fuzzy edge, an open-pit mine blast piles image segmentation methods based on RPD-UNet is proposed to realize high-precision segmentation of blast piles. The RPD-UNet model takes u-net of encoder decoder structure as the basic framework, and introduces the pre trained resnet34 as the backbone feature extraction network. Its deep residual structure can alleviate the difficulties of deep feature extraction and gradient dissipation caused by the complex background and texture diversity of block image. Aiming at the problems of key feature ambiguity and noise interference caused by complex lighting, shadow occlusion and similar foreground and background, a parallel CBAM module operating on both spatial and channel dimensions is proposed. The parallel processing can simultaneously process the channel and spatial information, enhance the focus of the model on the key areas, and make the model more accurately lock the blast pile and detail features. Aiming at the problems of high computational complexity and low computational efficiency of the model, the deep separable convolution (DSC) is used to replace the standard convolution operation, which solves the problem of low computational efficiency of the model without losing or even improving the ability of the model. In order to verify the effectiveness of the proposed RPD-UNet model, the results of explosive pile segmentation were compared with the results of mainstream segmentation models. The comparison results showed that the mIoU and mPA and Precision and F1-score of RPD-UNET were 89.90%、94.54%、94.52% and 94.51%, respectively; In terms of reasoning efficiency, the processing time of a single image is 91 ms, which is better than the mainstream segmentation model. The RPD-UNet model not only improves the segmentation accuracy and generalization ability, but also ensures the computational efficiency of the model. The model provides a reference for the intelligent identification of blast pile in open-pit mines and the assessment of blasting quality.

     

/

返回文章
返回