高级检索

基于自适应低照度图像增强的煤矸实时分割方法

Real-time segmentation method of coal gangue based on adaptive low illumination image enhancement

  • 摘要: 针对煤矿井下人造光源亮度低、煤矸分布密集等不利环境因素,导致现有煤矸检测算法存在特征提取困难和定位精度低等问题,提出一种基于自适应低照度图像增强的煤矸实时分割方法,其主要由图像增强模块CG-IENet(Coal Gangue Image Enhancement Network)和目标分割模块CG-TSNet(Coal Gangue Target Segmentation Network)两个部分组成。图像增强模块基于CycleGAN网络模型框架,通过注意力机制对生成器网络中的编码器、特征提取网络和解码器进行优化设计,同时采用PatchGAN作为判别器网络,以实现对低照度煤矸图像的自适应画质增强,从而更好地保留煤矸表面纹理信息。目标分割模块基于DeepLabV3+网络模型框架,在编码器中引入轻量化主干网络MobileNetV2和颈部网络SPS-ASPP模块,在解码器中结合特征融合模块、注意力机制和动态上采样,在提升模型分割精度的同时有效减少模型复杂程度,使其具有更快的处理速度和更低的能耗,便于边缘式计算设备的部署。试验结果表明:CG-IENet在峰值信噪比、结构相似度、信息保真度准则、信息熵和灰度均值指标上与其他低照度图像增强模型相比分别平均提升了47.86%、61.20%、14.30%、31.77%和23.68%,平均色度误差最低为1.18,同时采用三维灰度分布图和灰度直方图对各类模型进行直观对比分析,最终表明CG-IENet增强效果最佳,其能够更好提高图像亮度,避免色彩失真同时保留图像细节信息;CG-TSNet在保证模型内存为4.567 MB的基础上,与HRNet、UNet和PSPNet等其他4种语义分割模型相比,平均交并比、类别平均像素准确率、像素准确率和F1分数指标上均为最高,分别为91.76%、95.84%、96.85%和95.04%,综合检测性能最佳。该模型能够适应煤矿井下复杂工况,从而实现煤矸精准高效分选。

     

    Abstract: Aiming at the unfavorable environmental factors such as low brightness of artificial light source and dense distribution of coal gangue in coal mine underground, which lead to the problems of feature extraction difficulty and low localization accuracy of existing coal gangue detection algorithms, a real-time segmentation method of coal gangue based on adaptive low illumination image enhancement is proposed, which consists of CG-IENet (Coal Gangue Image Enhancement Network) and CG-TSNet (Coal Gangue Target Segmentation Network). The CG-IENet is based on the CycleGAN network model framework, which optimizes the design of the encoder, feature extraction network and decoder in the generator network through the attention mechanism, and adopts PatchGAN as the discriminator network to achieve adaptive picture quality enhancement of low illumination coal gangue images, so as to better retain the surface texture information of the coal gangue. The CG-TSNet is based on the DeepLabV3 + network model framework, which introduces the lightweight backbone network MobileNetV2 and the neck network SPS-ASPP module in the encoder, and combines the feature fusion module, the attention mechanism, and the dynamic sampling in the decoder, which improves the model segmentation accuracy and reduces the model complexity effectively at the same time, it enables faster processing speeds and lower energy consumption for easy deployment of edge-based computing devices. The experimental results show that CG-IENet improves 47.86%, 61.20%, 14.30%, 31.77% and 23.68% on average in the peak signal-to-noise ratio, structural similarity index, information fidelity criterion, entropy and gray mean, respectively, compared with the other low illumination image enhancement models, and the mean chromaticity error is as low as 1.18. The three-dimensional gray scale distribution graph and gray scale histogram are used to visually compare and analyze the various models, which finally shows that CG-IENet has the best enhancement effect, which can better improve the brightness of the image to avoid color distortion while retaining the detailed information of the image. CG-TSNet has the highest mean intersection over union, mean pixel accuracy, pixel accuracy, and F1-Score (F1) metrics, 91.76%, 95.84%, 96.85%, and 95.04%, respectively, and the best overall detection performance when compared to the other four semantic segmentation models, such as HRNet, UNet, and PSPNet, on the basis of the guaranteed model memory of 4.567 MB. The model can be adapted to the complex working conditions in underground coal mines, thus realizing accurate and efficient sorting of coal gangue.

     

/

返回文章
返回