高级检索

基于注意力机制和空洞卷积的CycleGAN煤矿井下低照度图像增强算法

Low illumination image enhancement algorithm of CycleGAN coal mine based on attention mechanism and Dilated convolution

  • 摘要: 井下环境复杂,充斥着大量粉尘和水汽且人造光源光照不均,导致井下监控设备采集到的图像往往呈现出照度低、细节特征丢失等问题,严重影响了矿业安全监控设备的实时性,不利于后续计算机视觉任务,同时井下数据采集困难,难以制作配对的井下低照度图像数据集用于模型训练。针对上述问题,提出了一种基于CycleGAN的低照度图像增强算法。针对矿井下采集配对数据集困难,使用CycleGAN网络进行无监督学习;为改善生成器网络的细节特征提取能力,利用无参注意力机制(simAM)和双通道注意力机制(CBAM)构建图像增强网络,提高复杂背景下模型的抗干扰能力,使模型恢复图像细节特征效果更好;引入由残差空洞卷积构建亮度增强模块,在提升图像亮度的同时增大生成器网络的感受野,有利于细节的恢复,提高图像视觉质量;以Patch-GAN作为网络的判别器,将输入映射成一个矩阵,更加全面的关注到图像不同区域的细节特征,提高判别器对图像细节的分辨能力。实验结果表明,相较于算法CycleGAN,本文方法在峰值信噪比(PSNR)、结构相似度(SSIM)、信息熵和视觉信息保真度(VIF)上平均提高了11.31%、8.07%、2.58%、6.18%。

     

    Abstract: The complex underground environment, filled with a large amount of dust and water vapor, and uneven illumination of artificial light source, leads to problems such as low illumination and loss of detail features in images collected by underground monitoring equipment, which seriously affects the real-time performance of mining safety monitoring equipment, is not good for subsequent computer vision tasks, and it is difficult to collect underground data. It is difficult to make paired low-light image data sets for model training. To solve these problems, a low illumination image enhancement algorithm based on CycleGAN is proposed. In view of the difficulty of collecting paired data set under mine, CycleGAN network is used for unsupervised learning. In order to improve the detail feature extraction ability of the generator network, the image enhancement network was constructed by using the Parameter-Free Attention Mechanism (simAM) and the Dual-Channel Attention Mechanism (CBAM) to improve the anti-interference ability of the model in complex background, so that the model could recover the image detail features better, which improved the anti-interference ability of the model under complex background and made the model recover the detail features better. A luminance enhancement module based on residual cavity convolution is introduced to increase the luminance of the image while enlarging the receptive field of the generator network, which is conducive to detail recovery and visual quality improvement. Patch-GAN is apply for the discriminator of the network, and the input is mapped into a matrix to pay more comprehensive attention to the details of different regions of the image, and improve the discriminator's resolution of image details. Experimental results show that compared with the CycleGAN algorithm, the proposed method improves the peak signal-to-noise ratio (PSNR), structural similarity (SSIM), information entropy and visual information fidelity (VIF) by 11.31%, 8.07%, 2.58% and 6.18% on average.

     

/

返回文章
返回