论文中文题名: | 基于图像与视频数据的煤矸石快速识别方法研究 |
姓名: | |
学号: | 20206043044 |
保密级别: | 保密(1年后开放) |
论文语种: | chi |
学科代码: | 0811 |
学科名称: | 工学 - 控制科学与工程 |
学生类型: | 硕士 |
学位级别: | 工学硕士 |
学位年度: | 2023 |
培养单位: | 西安科技大学 |
院系: | |
专业: | |
研究方向: | 图像处理 |
第一导师姓名: | |
第一导师单位: | |
论文提交日期: | 2023-06-19 |
论文答辩日期: | 2023-06-02 |
论文外文题名: | : Research on Rapid Recognition Method of Coal Gangue Based on Image and Video Data |
论文中文关键词: | 矸石 ; 图像目标识别 ; Tiny YOLOv3 ; 视频目标识别 ; 长短期存储 |
论文外文关键词: | Coal gangue ; Image object recognition ; Tiny YOLOv3 ; Video object recognition ; Long short-term storage |
论文中文摘要: |
煤炭开采中矸石的大量混入降低了煤炭的生产效率和经济效益。煤矸石识别和分 选是提高煤炭资源质量的有效途径,常用的识别方式存在诸多局限已不适用于逐渐信 息化、智能化的现代煤矿,探索更优性能的煤矸石识别方法成为研究重点。基于此, 本文对基于图像和视频数据的煤矸石识别方法开展研究,具体内容如下: 1. 针对煤矸石图像中特征图尺寸不一、重要通道权重低及卷积层参数量大的问题, 首先,本文设计多卷积核组合池化的空间金字塔池化网络来确保输入特征图被处理为 固定尺寸,设计压缩激励模块增强煤矸石图像中重要通道的被关注度,再引入空洞卷 积层捕获上下文图像信息以增大感受野;其次,提出一种基于改进 tiny YOLOv3 的煤 矸石快速识别模型;最后,实验表明本文图像模型识别精度为 99.4%,与 tiny YOLOv3 相比训练耗时降低了 7.41%,损失值提升了 53.01%,具有显著的性能优势。 2. 针对图像数据存在的偶然性、低效率等弊端,首先,本文设计时空关系网络对 煤矸石视频帧序列进行多尺度特征聚合,减少了冗余数据带来的计算负担;其次,设 计关键帧选取框架和注意力机制来筛选关键帧,构建长、短期视频帧,并调节不同视 频帧之间的权重来增强关键特征的被关注度;最后,设计了长短期存储模块对长、短 期视频帧特征进行存储,并在关键帧识别时进行融合以增强识别精度。在此基础上, 本文设计了基于长短期聚合特征存储的煤矸石视频快速识别模型。 3. 本文采集宁夏某选煤厂煤矸石真实分选场景视频构建煤矸石视频识别数据集。 此外,对基于长短期聚合特征存储的煤矸石视频快速识别模型分别在两个数据集上进 行模型性能验证,并与 MEGA、FGFA、RDN、DFF 等模型识别效果进行了对比。实验 表明本文所提视频识别模型的识别精度最高,在 ILSVRC2015 数据集和自建煤矸石视 频数据集上分别为 77.12%、81.97%,验证了该模型的识别可行性与性能优越性。 基于图像和视频数据的煤矸石快速识别方法在自建煤矸石图像数据集、自建煤矸 石视频数据集和 ILSVRC2015 数据集上分别取得了较好的识别效果,与同领域其他前 沿模型相比具有显著的性能优势。上述模型的设计与验证为煤矸石图像和视频识别方 法工业现场应用提供了理论基础和实验数据。 |
论文外文摘要: |
The massive mixing of gangue in coal mining reduces the production efficiency and economic benefits of coal. The recognition and separation of coal gangue is an effective way to improve the quality of coal resources, and the current recognition method has many limitations which are not applicable to the modern coal mines with gradual informatization and intelligence. Exploring the better performance of coal gangue recognition technology has become the focus of research. On the basis of this, the research on coal gangue recognition method based on image and video data is carried out in this paper, as follows: 1. To address the problems of varying feature map size, low weight of important channels and large number of convolutional layer parameters in coal gangue images. Firstly, this paper designs a spatial pyramid pooling network with multiple convolutional kernel combination pooling to ensure that the input feature map is processed to a fixed size, designs a squeeze-andexcitation module to enhance the attention of important channels in gangue images, and then introduces a dilated convolutional layer to capture contextual image information to increase the perceptual field; secondly, a fast recognition model based on improved tiny YOLOv3 is proposed; finally, experiments show that the recognition accuracy of this model is 99.4%, the training time is reduced by 7.41% and the loss value is improved by 53.01% compared with tiny YOLOv3, which has significant performance advantages. 2. To address the drawbacks of image data such as occasional and inefficient, firstly, this paper designs a temporal relationship network for multi-scale feature aggregation of coal gangue video frame sequences to reduce the computational burden caused by redundant data; Secondly, designing key frame selection framework and attention mechanism to filter key frames, Constructing long and short-term video frames and adjust the weights between different video frames to improve focus on key features; Finally, a long short-term storage module is designed to store long and short-term video frame features and fuse them in key frame recognition to enhance recognition accuracy. Based on this, this paper introduces a fast recognition model for coal gangue video based on long and short-term aggregation feature storage. 3. In this paper, real coal gangue sorting scenes in Ningxia coal processing plant has been collected to construct a coal gangue video recognition dataset. In addition, the performance of the fast recognition model of gangue video based on long and short-term aggregated feature storage has been verified on two data sets, and the recognition effect is compared with MEGA, FGFA, RDN, DFF. The experiment data shows that the recognition accuracy of the proposed video recognition model on the ILSVRC2015 dataset and self-built gangue video dataset is the highest, that is 77.12% and 81.97% respectively, which verifies the recognition feasibility and superior performance of the model. The fast recognition method of coal gangue based on image and video data is designed in this paper, which achieves better recognition results on self-built coal gangue image dataset and ILSVRC2015 dataset respectively, and has significant performance advantages compared with other state of the art models. The above models are designed and validated to provide theoretical basis and experimental data for the application of coal gangue image and video recognition in industrial fields. |
参考文献: |
[1] 谢和平, 吴立新, 郑德志. 2025 年中国能源消费及煤炭需求预测 [J]. 煤炭学报, 2019,44(07): 1949–1960. [2] 黄金凤, 张建喜, 于江涛, 等. 并联式选矸机器人路径规划研究 [J]. 工矿自动化,2022(08): 26–32+42. a Journal of Fuel Science, 2022, 319(1): 123837–123837. [6] Shu X. Hierarchical separating and grading method of coal Google Patents. US Patent 11,498,098. [7] 陈立, 杜文华, 曾志强, 等. 基于小波变换的煤矸石自动分选方法 [J]. 工矿自动化, 2018, 44(12): 60–64. [29] 胡璟皓, 高妍, 张红娟, 等. 基于深度学习的带式输送机非煤异物识别方法 [J]. 工矿自动化, 2021, 47(6): 57–62+90. [30] 张梦超, 周满山, 张媛, 等. 基于深度学习的矿用输送带损伤检测方法 [J]. 工矿自动化, 2021, 47(6): 51–56. [31] 蒋磊, 马六章, 杨克虎, 等. 基于 MFCC 和 FD-CNN 卷积神经网络的综放工作面煤矸智能识别 [J]. 煤炭学报, 2020, 46(S2): 1109–1117. [32] 许志, 李敬兆, 张传江, 等. 轻量化 CNN 及其在煤矿智能视频监控中的应用 [J]. 工矿自动化, 2020, 46(12): 13–19. [53] 高新宇. 基于机器视觉的煤矸智能分选系统设计 [D]. 太原: 太原理工大学, 2021. [56] 饶中钰, 吴景涛, 李明. 煤矸石图像分类方法 [J]. 工矿自动化, 2020, 46(3): 69–73. [57] 王鹏, 曹现刚, 夏晶, 等. 基于机器视觉的多机械臂煤矸石分拣机器人系统研究 [J]. 工矿自动化, 2019, 45(9): 47–53. [59] 赵杰, 孙伟, 徐中达, 等. 基于形态学预处理的数字图像相关方法研究 [J]. 实验力学, 2022, 37(5): 629–637. [60] 于正永, 唐万春. 一种新颖的高低阻抗带状线低通滤波器分析方法 [J]. 兰州理工大学学报, 2019, 45(3): 90–94. [61] 张雪峰, 闫慧. 基于中值滤波和分数阶滤波的图像去噪与增强算法 [J]. 东北大学学报 (自然科学版), 2020, 48(4): 37–46. [62] 林昌, 周海峰, 陈武. 基于双边滤波的高斯金字塔变换 Retinex 图像增强算法 [J]. 激光与光电子学进展, 2020, 57(16): 209–215. [63] 吴景涛. 机器视觉技术在煤矸石智能识别中的应用研究 [D]. 徐州: 中国矿业大学2019. [64] 孙继平. AQ 6201—2017《煤矿安全监控系统通用技术要求》(报批稿)[J]. 工矿自动化, 2017(7): 1–6. [65] 贾婷婷. 基于深度学习的视频预处理算法研究和优化实现 [D]. 杭州: 浙江工业大学,2020. [66] 江巨浪, 刘国明, 朱柱, 等. 基于快速模糊聚类的动态多直方图均衡化算法 [J]. 电子学报, 2022, 50(1): 167–176. |
中图分类号: | TP391 |
开放日期: | 2024-06-19 |