论文中文题名: | 基于边缘生成的图像修复方法研究 |
姓名: | |
学号: | 21207223050 |
保密级别: | 公开 |
论文语种: | chi |
学科代码: | 085400 |
学科名称: | 工学 - 电子信息 |
学生类型: | 硕士 |
学位级别: | 工程硕士 |
学位年度: | 2024 |
培养单位: | 西安科技大学 |
院系: | |
专业: | |
研究方向: | 数字图像处理 |
第一导师姓名: | |
第一导师单位: | |
论文提交日期: | 2024-06-11 |
论文答辩日期: | 2024-05-29 |
论文外文题名: | Research on Image Inpainting Algorithm based on Edge Generation |
论文中文关键词: | |
论文外文关键词: | Image inpainting ; Generative adversarial network ; Residual network ; Attention mechanism |
论文中文摘要: |
图像修复是图像处理领域中的重要研究方向之一,其目标是对信息传递过程受噪音影响或受污染的缺损图像进行合理填充,生成符合人主观视觉需求的修复结果。对修复结果的生成图像结构、纹理质量、清晰度有着极高要求。传统的图像修复算法基于图像的低级像素信息计算相似度来完成对破损图像的修复,不具有提取深层语义的信息,在面对大规模破损时修复质量不佳;现阶段基于深度学习的图像修复技术能够充分提取图像的深层语义信息,适用范围更广,但是在面对大面积破损时,修复结果仍然会出现伪影、结构模糊、修复边缘结构断层等问题。此外,实际应用中图像的缺损区域往往呈现随机不规则的形状,当前的图像修复算法研究在面对随机缺损时修复质量不佳。因此,本文为了解决当前图像修复技术出现的问题,设计了一个基于边缘生成的两阶段图像修复算法,本文的主要工作内容如下: (1)基于堆叠生成网络设计了一个两阶段图像修复网络,通过分析人工修复技术的原理,将修复过程分为边缘预测和纹理填补两个阶段。第一阶段网络为边缘生成网络,使用 Canny 边缘提取算子对破损图像进行边缘提取,将提取到的破损边缘图作为输入,训练网络进行边缘预测,对破损图像进行结构修复。第二阶段网络为修复填补网络,使用第一阶段网络生成的边缘图作为结构引导和破损图像进行合并作为输入,最终输出修复结果。两阶段网络生成器设计为编码器、解码器和中间层结构,对图像进行特征提取融合,判别器使用谱归一化马尔可夫判别器提升网络训练稳定性。 (2)设计多尺度特征融合模块代替传统的残差网络作为两阶段网络中间层,使用符合混合扩张卷积理论扩张率的不同扩张卷积提升模块感受野,同时保留对邻近信息的提取,使用通道注意力机制对不同卷积结果进行权重分配,训练后的第二阶段修复填补网络在 Places365-Standard 场景数据集上相较于对比算法,峰值信噪比提升了 0.58 dB,结构相似度提升了 0.86%。 (3)使用门控卷积对代替编码器解码器中的普通卷积,为多尺度特征融合模块添加门控机制,实现网络训练时对有效和无效像素特殊的处理,提高网络对有效部分信息的利用,在部分像素缺失和大面积损坏情况下,能够更好地对局部特征进行精确描述。在修复填补网络中引入上下文注意力机制模块对特征图进行前后板块划分,将背景板块重塑为卷积核,通过和前景板块的卷积计算注意力分数,使网络在色彩填补过程中可以从遥远空间位置获取纹理特征信息。改进后的网络在 Places365-Standard 场景数据集上相较于对比算法,峰值信噪比提升了 1.44 dB,结构相似度提升了 2.26%。在人脸数据集 CelebA上进行人脸修复实验,相较于对比算法峰值信噪比提升了 2.12 dB,结构相似度提升了2.58%。 |
论文外文摘要: |
Image inpainting is one of the important research directions in the field of image processing, and its goal is to reasonably fill in the defective images affected or polluted by noise in the information transmission process, and generate restoration results that meet the subjective visual needs of people. There are extremely high requirements for the image structure, texture quality and clarity of the restoration results. At present, the image restoration technology based on deep learning can fully extract the deep semantic information of the image and has a wider range of applications, but in the face of large-scale damage, the restoration results still have problems such as artifacts, blurred structures, and repaired edge structure faults. In addition, the defective areas of images in practical applications often show random irregular shapes, and the current image restoration algorithm research has poor repair quality in the face of random defects. Therefore, in order to solve the problems of current image inpainting technology, a two-stage image inpainting algorithm based on edge generation is designed, and the main contects of this thesis is as follows: (1) A two-stage image inpainting network was designed based on the stacked generation network StackGAN, and the restoration process was divided into two stages: edge prediction and texture filling by analyzing the principle of artificial restoration technology. In the first stage, the network is an edge generation network, in which the Canny edge extraction operator is used to extract the edge of the damaged image, and the extracted edge map is used as the input to train the network for edge prediction and structural repair of the damaged image. The second stage network is the repair and filling network, and the edge map generated by the first-stage network is used as the structure guide and the damaged image is merged as the input, and the repair result is finally output. The two-stage network generator is designed as an encoder, decoder and middle-layer structure, which performs feature extraction and fusion on the image, and the discriminator uses the spectral normalized Markov discriminator to improve the stability of network training. (2) The multi-scale feature fusion module is designed to replace the traditional residual network as the middle layer of the two-stage network, and the receptive field of the module is improved by using different expansion convolutional expansion rates that conform to the expansion rate of the hybrid expansion convolution theory, while retaining the extraction of adjacent information, and the channel attention mechanism is used to distribute the weights of different convolution results, and the PSNR and SSIM of the second-stage repair filling network after training are increased by 0.58 dB and the SSIM is increased by 0.86% compared with the comparison algorithm on the Places365-Standard scene dataset. (3) Gated convolution pairs are used to replace the ordinary convolutions in the encoder decoder, and a gating mechanism is added to the multi-scale feature fusion module, so as to realize the special processing of valid and invalid pixels during network training, improve the utilization of effective part information by the network, and better describe the local features accurately in the case of missing pixels and large-scale damage. In the repair filling network, the context attention mechanism module is introduced to divide the front and back plates of the feature map, the background plate is reshaped into a convolutional kernel, and the attention score is calculated by convolution with the foreground plate, so that the network can obtain texture feature information from distant spatial locations in the process of color filling. Compared with the comparison algorithm, the PSNR and SSIM of the improved network are increased by 1.44 dB and the SSIM is increased by 2.26% compared with the comparison algorithm on the Places365-Standard scene dataset. Compared with the comparison algorithm, the PSNR increased by 2.12 dB and SSIM is increased by 2.58%. |
参考文献: |
[1]彭进业,余喆,屈书毅,等.基于深度学习的图像修复方法研究综述[J].西北大学学报(自然科学版),2023,53(06):943-963. [2]李月龙,高云,闫家良等.基于深度神经网络的图像缺损修复方法综述[J].计算机学报,2021,44(11):2295-2316. [3]罗海银,郑钰辉.图像修复方法研究综述[J].计算机科学与探索,2022,16(10):2193-2218. [10]李奇,李龙,王卫,等.基于改进Criminisi算法的破损纺织品文物图像修复[J].激光与光电子学进展,2023,60(16):173-182. [35]杨红菊,李丽琴,王鼎.联合语义分割与边缘重建的深度学习图像修复[J].中国图象图形学报,2022,27(12):3553-3565. [36]章登勇,文凰,李峰,等.基于双分支网络的图像修复取证方法[J].网络与信息安全学报,2022,8(06):110-122. [37]孙宇,李晗.基于结构-纹理分解的数字图像分层修复算法[J].计算机仿真,2023,40(06):220-224. [45]武文杰,王红蕾.基于生成对抗网络的遮挡人脸图像修复的改进与实现[J].计算机应用与软件,2021,38(01):217-221+249. |
中图分类号: | TP391 |
开放日期: | 2024-06-11 |