论文中文题名: | 基于生成对抗网络的图像超分辨率重建算法研究 |
姓名: | |
学号: | 19207040010 |
保密级别: | 公开 |
论文语种: | chi |
学科代码: | 081002 |
学科名称: | 工学 - 信息与通信工程 - 信号与信息处理 |
学生类型: | 硕士 |
学位级别: | 工学硕士 |
学位年度: | 2022 |
培养单位: | 西安科技大学 |
院系: | |
专业: | |
研究方向: | 计算机视觉 |
第一导师姓名: | |
第一导师单位: | |
论文提交日期: | 2022-06-20 |
论文答辩日期: | 2022-06-06 |
论文外文题名: | Research on Image Super-Resolution reconstruction Algorithm based on Generative Adversarial Network |
论文中文关键词: | |
论文外文关键词: | Super-resolution reconstruction ; Generative adversarial network ; Residual network ; Dense network |
论文中文摘要: |
图像是当今信息传递和呈现的主要载体,高分辨率图像因其具有清晰的画质和丰富的细节,为计算机视觉领域的发展提供了坚实的基础。然而,由于成像设备、外界环境等因素的干扰,实际获取到的图像较为模糊,此时可通过超分辨率重建技术来提高图像的清晰度。近年来,基于生成对抗网络的图像超分辨率重建方法恢复的图像真实自然,更符合人眼的视觉感知。但重建纹理色彩丰富的图像会出现细节模糊、纹理杂乱等现象,并且生成对抗网络训练困难,存在模型不稳定、易崩溃等情况。 针对生成对抗网络的图像超分辨率重建算法(SRGAN)恢复的图像细节不够清晰、纹理杂乱无章的问题,提出了一种基于密集残差网络的超分辨率重建算法(SRGAN-R)。首先,在网络结构中删除批量归一化层降低网络的计算复杂度,同时保证图像的色彩、亮度信息不变。然后,借鉴密集网络分层信息和残差网络快速融合底层特征与高层特征的优势,在网络中加入密集残差块结构,提升网络的表达能力。最后,在四个基准数据集和一个自采医学数据集上进行测试,结果表明,所提算法与SRGAN算法相比,加快了模型的重建速度,重建图像的平均PSNR值和SSIM值分别提升了0.488dB和0.0246,重建图像的细节更清晰、纹理更逼真。该算法虽然提高了重建图像的质量,但在实际应用场景中对算法的稳定性也有要求,因此本文进一步对算法进行优化。 针对生成对抗网络难以收敛、训练不稳定的问题,在SRGAN-R的基础上,提出了一种优化损失函数的超分辨率重建算法(SRWGAN-R)。首先,在判别网络中借鉴WGAN来优化算法,稳定网络的训练,并用优化的残差块来提取深层特征。然后,在损失函数中使用优化的感知损失、内容损失、对抗损失以及纹理损失共同监督模型训练。最后,在四个基准数据集和一个自采医学数据集上进行测试,结果表明,所提算法与SRGAN算法相比,模型收敛速度更快,更稳定。同时,重建图像的平均PSNR值和SSIM值相比SRGAN算法提高了1.088dB和0.0329,相比SRGAN-R算法提高了0.58dB和0.0187,重建图像纹理更丰富,亮度色彩信息更符合原始图像,较好地提升了重建图像的质量。 |
论文外文摘要: |
Image is the main carrier of information transmission and presentation. High-resolution image provides a solid foundation for the development of computer vision because of their clear image quality and rich details. However, due to the interference of imaging equipment, external environment, and other factors, the obtained image is relatively blurred. At this time, the definition of the image can be improved by super-resolution reconstruction technology. In recent years, the image super-resolution reconstruction method based on the generated countermeasure network restores the real and natural image, which is more in line with the visual perception of human eyes. However, when reconstructing images with rich texture and color, there will be some phenomena such as blurred details and disordered texture, and it is difficult to generate confrontation network training, and the model is unstable and easy to collapse. In order to solve the problems of unclear image details and disordered texture restored by image super-resolution reconstruction algorithm (SRGAN) generating countermeasure network, a super-resolution reconstruction algorithm based on dense residual network (SRGAN-R) is proposed. Firstly, the batch normalization layer is deleted in the network structure to reduce the computational complexity of the network and ensure that the color and brightness information of the image remains unchanged. The advantages of hierarchical network and dense networks can be used for reference, and then the characteristics of hierarchical networks and dense networks can be quickly expressed. Finally, it is tested on four benchmark data sets and one self-collected medical data set. The results show that compared with the SRGAN algorithm, the proposed algorithm accelerates the reconstruction speed of the model, improves the average PSNR value and SSIM value of the reconstructed image by 0.488dB and 0.0246 respectively, and the details of the reconstructed image are clearer and the texture is more realistic. Although the algorithm improves the quality of the reconstructed image, it also requires the stability of the algorithm in the actual application scene. Therefore, this paper further optimizes the algorithm. To address the problems of difficult convergence and unstable training of generated countermeasure network, a super-resolution reconstruction algorithm based on optimized loss function (SRWGAN-R) is proposed based on SRGAN-R. Firstly, the WGAN is used to optimize the algorithm in the discrimination network to stabilize the training of the network, and the optimized residual block is used to extract the deep features. Then, in the loss function, the optimized perception loss, content loss, confrontation loss, and texture loss are used to supervise the model training. Finally, four benchmark data sets and one self-collected medical data set are tested. The results show that the proposed algorithm converges faster and is more stable than the SRGAN algorithm. At the same time, the average PSNR value and SSIM value of the reconstructed image are improved by 1.088db and 0.0329 compared with the SRGAN algorithm, and 0.58dB and 0.0187 compared with the SRGAN-R algorithm. The texture of the reconstructed image is richer, the brightness and color information is more consistent with the original image, and the quality of the reconstructed image is better improved. |
参考文献: |
[1]冯博迪, 杨海涛, 李高源, 王晋宇, 张长弓. 神经网络在SAR图像目标识别中的研究综述[J]. 兵器装备工程学报, 2021, 42(10): 15-22. [4]刘朋伟, 高媛, 秦品乐, 殷喆, 王丽芳. 基于多感受野的生成对抗网络医学MRI影像超分辨率重建[J]. 计算机应用, 2022, 42(03): 938-945. [6]王殿伟, 郝元杰, 刘颖, 谢永军, 宋海军. 基于逐级反投影网络的车牌图像超分辨率重建[J]. 激光与光电子学进展, 2020, 57(16): 64-72. [7]房垚鑫, 郭宝峰, 马超. 基于改进点扩散函数的遥感图像超分辨率重建[J]. 激光技术, 2019, 43(05): 123-128. [9]郭岑, 尹增山, 高爽. 空间感知残差网络的遥感图像超分辨率重建[J]. 测绘科学, 2020, 45(05): 49-55. [10]吴磊, 吕国强, 赵晨, 盛杰超, 冯奇斌. 基于多尺度残差网络的CT图像超分辨率重建[J]. 液晶与显示, 2019, 34(10): 1006-1012. [12]刘颖, 朱丽, 林庆帆, 李莹华, 王富平, 卢津. 图像超分辨率技术的回顾与展望[J].计算机科学与探索, 2020, 14(02): 181-199. [14]夏皓, 吕宏峰, 罗军, 蔡念. 图像超分辨率深度学习研究及应用进展[J]. 计算机工程与应用, 2021, 57(24): 51-60. [32]毕玉萍. 基于改进的生成式对抗网络的图像超分辨率重建算法研究[D]. 青岛大学, 2020. [33]郑喆. 基于生成对抗网络的图像超分辨率重建算法研究[D]. 浙江师范大学, 2021. [34]李佳星, 赵勇先, 王京华. 基于深度学习的单幅图像超分辨率重建算法综述[J]. 自动化学报, 2021, 47(10): 2341-2363. [38]郭海涛, 李宸尧, 崔君, 余东行, 卢俊, 高慧. 多尺度并联卷积神经网络遥感影像油罐目标检测算法[J]. 测绘科学技术学报, 2021, 38(02): 148-154. [40]王鑫, 王翠荣, 王聪, 苑迎. 双通道多感知卷积神经网络图像超分辨率重建[J]. 东北大学学报(自然科学版), 2020, 41(11): 1564-1569. [41]方玉明, 眭相杰, 鄢杰斌, 刘学林, 黄丽萍. 无参考图像质量评价研究进展[J]. 中国图象图形学报, 2021, 26(02): 265-286. [42]朱海琦, 李宏, 李定文, 李富. 基于生成对抗网络的单图像超分辨率重建[J]. 吉林大学学报(理学版), 2021, 59(06): 1491-1498. [45]胡高鹏, 陈子鎏, 王晓明, 张开放, 黄增喜, 杜亚军. 基于深度残差反投影注意力网络的图像超分辨率[J]. 计算机应用研究, 2020, 38(04): 1271-1276. [47]吴鹏, 林国强, 郭玉荣, 赵振兵. 自学习稀疏密集连接卷积神经网络图像分类方法[J]. 信号处理, 2019, 35(10): 1747-1752. [48]杨文霞, 王萌, 张亮. 基于密集连接块U-Net的语义人脸图像修复[J]. 计算机应用, 2020, 40(12): 3651-3657. [50]程哲, 白茜, 张浩, 王世普, 梁宇. 使用深层卷积神经网络提高Hi-C数据分辨率[J]. 计算机科学, 2020, 47(S1): 70-74. [51]薛英杰, 韩威, 周松斌, 刘忆森. 基于生成对抗单分类网络的异常声音检测[J]. 吉林大学学报(理学版), 2021, 59(06): 1517-1524. [52]黄淑英, 汪斌, 李红霞, 杨勇, 胡威.基于生成对抗网络的图像去雾算法[J]. 模式识别与人工智能, 2021, 34(11): 990-1003. [53]林志鹏, 曾立波, 吴琼水. 基于生成对抗网络的宫颈细胞图像数据增强[J]. 科学技术与工程, 2020, 20(28): 11672-11677. |
中图分类号: | TP391.4 |
开放日期: | 2022-06-21 |