论文中文题名: | 复杂环境下的货物图像识别方法研究 |
姓名: | |
学号: | 19308207004 |
保密级别: | 公开 |
论文语种: | chi |
学科代码: | 085400 |
学科名称: | 工学 - 电子信息 |
学生类型: | 硕士 |
学位级别: | 工程硕士 |
学位年度: | 2022 |
培养单位: | 西安科技大学 |
院系: | |
专业: | |
研究方向: | 计算机图形图像处理技术 |
第一导师姓名: | |
第一导师单位: | |
论文提交日期: | 2022-06-23 |
论文答辩日期: | 2022-06-07 |
论文外文题名: | Research on Cargo Image Recognition Method in Complex Environment |
论文中文关键词: | |
论文外文关键词: | Image recognition ; Deep learning ; ESRGAN ; CGAN ; Automatic patrol system |
论文中文摘要: |
自动巡检系统作为某一体化安防系统中的重要组成之一,其作用是通过图像识别技术避免重要物资遭受外部威胁。这类重要物资存放环境复杂,在识别时会受到拍摄角度、光照和遮挡等因素的影响,导致拍摄到的图像存在目标小、背景干扰大、分辨率低以及识别准确率不够高等问题。针对上述问题,研究复杂环境下的货物图像识别方法。主要包括以下三点研究内容: (1)研究融合目标检测与ESRGAN的图像识别方法。针对专有图像数据集Cargo-images存在目标小、背景干扰大和分辨率低的问题,提出了一种图像识别方法TEResNet(Target-decetion and ESRGAN before ResNet)。该方法主要有三个步骤:首先利用目标检测方法,得到目标图像;然后用ESRGAN(Enhanced Super-Resolution Generative Adversarial Networks)模型提高目标检测后图像的分辨率;最后用改进的ResNet模型进行图像识别。在三种公开数据集和一种专有数据集上进行了对比实验,实验结果表明TEResNet方法比ResNet、AlexNet、GoogleNet和MobileNet四种卷积神经网络模型识别准确率更高。 (2)针对TEResNet方法识别失败的图像,提出了一种新的图像后识别方法CISCGAN(Compute Image Similarity and Conditional Generative Adversarial Network)。该方法主要有三个步骤:首先根据均方误差MSE、峰值信噪比PSNR和结构相似性SSIM三个指标从训练样本图像库里面识别正确的样本中选出与识别失败图像相似度最高的一个样本图像;然后把得到的这个样本图像输入CGAN模型生成新的图像;最后用TEResNet方法对CGAN模型生成的图像进行识别。在专有数据集上进行了对比实验,实验结果表明CISCGAN方法可以进一步提高图像识别的准确率。 (3)在上述两个研究内容的基础上开发了具有图像识别功能的自动巡检系统。自动巡检系统的开发平台是Microsoft Visual Studio 2012和Microsoft SQL Server 2012数据库。图像识别功能借助深度学习框架PyTorch实现。系统测试结果表明,图像识别准确率达到了设计要求。 通过以上研究内容,构建了复杂环境下自动巡检系统中模拟货物图像识别模型,实现了全自动实时巡检,为重要物资的安全提供了更加准确、可靠的判断方法。 |
论文外文摘要: |
As one of important components of an integrated security system, automatic inspection plays a role in avoiding external threats to important materials through image recognition technology. The storage environment of protected materials is complex, and when identifying, it is affected by factors such as shooting angle, illumination and occlusion, resulting in the problems of small target, large background interference, low resolution and insufficient recognition accuracy in the captured image. Aiming at the above problems, the method of cargo image recognition in complex environment is studied in thesis. It mainly includes the following three research contents: (1) A fusion target detection and ESRGAN image recognition method was studied. To overcome the problems of small targets, high background interference and low resolution of the proprietary image dataset Cargo images, an image recognition method TEResNet (target-decetion and ESRGAN before ResNet) was proposed. The method mainly had three steps: firstly, the target image was obtained by using the target detection method; then ESRGAN (Enhanced super-resolution Generative Adversarial Networks) model was used to improve the image Resolution after target detection; finally, the improved ResNet model was used for image recognition. Comparative experiments were conducted on three public datasets and one proprietary dataset. The experimental results showed that the TEResNet method had higher recognition accuracy than ResNet, AlexNet, GoogleNet and MobileNet. (2) A new post-image recognition method, named CISCGAN (Compute Image Similarity and Conditional Generative Adversarial Network), was proposed for the image recognition failure of TEResNet method. CISCGAN had three steps: firstly, according to the mean square error, peak signal-to-noise ratio and structural similarity, a sample image with the highest similarity to the failed image was selected from recognition correct samples in the training sample image set; then inputted the selected image into the CGAN model to generate a new image; finally, the image generated by CGAN model was recognized by TEResNet method. Comparative experiments were conducted on a proprietary dataset. Experimental results showed that CISCGAN method can improve the accuracy of image recognition. (3) Based on the above two research results, an automatic inspection system with image recognition function was developed. The system was developed by employing Microsoft Visual Studio 2012 and Microsoft SQL Server 2012 Databases. Image recognition was implemented with the help of the deep learning framework PyTorch. The system test results showed that the image recognition accuracy met the requirement of design. Through the above research results, a simulated cargo image recognition model of the automatic inspection system for complex environment was constructed. The system real-time inspected important materials automatically with accurately and reliably. |
参考文献: |
[2]贾旭, 孙福明, 李豪杰, 等. 具有普适性的改进非负矩阵分解图像特征提取方法[J]. 计算机应用, 2018, 38(01): 233-237+254. [3]苗开超, 罗希昌, 张淑静, 等. 基于色域分析的大雾图像特征提取与等级识别方法[J]. 科学技术与工程, 2019, 19(35): 228-233. [4]任燕红, 郭幸丽, 马丽. 基于增强算子的污染土雷达图像特征提取仿真[J]. 计算机仿真, 2020, 37(04): 5-8+61. [5]李泽宇, 何萍, 朱立峰. 一种基于PCA的医学图像特征提取与配准算法研究[J]. 中国数字医学, 2020, 15(07): 98-101. [6]郑志强, 胡鑫, 翁智, 等. 基于改进DenseNet的牛眼图像特征提取方法[J]. 计算机应用, 2021, 41(09): 2780-2784. [11]何敬, 刘仁义, 张丰, 等. 基于特征点群相似度计算模型的图像表示方法[J]. 浙江大学学报(理学版), 2017, 44(05): 599-605. [18]董天天, 曹海啸, 阚希, 等. 复杂天气下交通场景多目标识别方法研究[J]. 信息通信, 2020(11): 72-74. [19]谭章禄, 陈孝慈. RetinaNet图像识别技术在煤矿目标监测领域的应用研究[J]. 矿业安全与环保, 2020, 47(05): 65-70+76. [20]王家臣, 潘卫东, 张国英, 等. 图像识别智能放煤技术原理与应用[J]. 煤炭学报, 2022, 47(01): 87-101. [21]叶中华, 赵明霞, 贾璐. 复杂背景农作物病害图像识别研究[J]. 农业机械学报, 2021, 52(S1): 118-124+147. [24]程祥鸣, 邓春华. 基于无标签知识蒸馏的人脸识别模型的压缩算法[J/OL]. 计算机科学, 2022: 1-14. [25]张杨, 郝江波. 基于注意力机制和残差网络的恶意代码检测方法[J/OL]. 计算机应用, 2022: 1-10. [26]董明宇, 严迪群. 基于ResNet的音频场景声替换造假的取证算法[J/OL]. 计算机应用, 2022: 1-6. [27]全磊. 复杂环境下目标识别方法的研究[D]. 兰州:西北师范大学, 2018. [41]李新利, 邹昌铭, 杨国田, 等. 基于生成式对抗网络的发票图像超分辨率研究[J]. 系统仿真学报, 2021, 33(06): 1307-1314. [42]辛元雪, 朱凤婷, 史朋飞, 等. 基于改进增强型超分辨率生成对抗网络的图像超分辨率重建算法[J]. 激光与光电子学进展, 2022, 59(04): 381-391. [43]张建, 贾媛媛, 贺向前, 等. 面向各向异性3D-MRI图像超分辨率重建的ESRGAN网络[J/OL]. 重庆大学学报, 2022: 1-14. [52]陈新荃, 陈晓东, 蒋林华. 基于Spark平台的人脸图像检索系统[J]. 计算机工程, 2018,44(02): 251-256. [53]丁维龙, 辛卫涛, 徐志福, 等. 基于图像特征的植物形态相似度算法[J]. 中国图象图形学报, 2019, 24(12): 2255-2266. [54]郭渝洛, 边浩东, 董润婷, 等. 基于SIMD的并行傅里叶空间图像相似度计算[J]. 计算机工程, 2021, 47(11): 247-253. [55]徐文进, 解钦, 黄海广. 基于轨迹图像特征匹配的渔船轨迹相似度计算和轨迹分类[J]. 计算机系统应用, 2021, 30(08): 232-236. [57]梁培俊, 刘怡俊. 基于条件生成对抗网络的漫画手绘图上色方法[J]. 计算机应用研究, 2019, 36(01): 308-311. [62]刘建伟, 谢浩杰, 罗雄麟. 生成对抗网络在各领域应用研究进展[J]. 自动化学报, 2020, 46(12): 2500-2536. |
中图分类号: | TP391.4 |
开放日期: | 2022-06-24 |