论文中文题名: | 基于YOWO的井下皮带运输系统违规跨越行为检测算法研究 |
姓名: | |
学号: | 21207040031 |
保密级别: | 公开 |
论文语种: | chi |
学科代码: | 081002 |
学科名称: | 工学 - 信息与通信工程 - 信号与信息处理 |
学生类型: | 硕士 |
学位级别: | 工学硕士 |
学位年度: | 2024 |
培养单位: | 西安科技大学 |
院系: | |
专业: | |
研究方向: | 深度学习 |
第一导师姓名: | |
第一导师单位: | |
论文提交日期: | 2024-06-13 |
论文答辩日期: | 2024-06-05 |
论文外文题名: | Research on The Detection Algorithm of Lllegal Crossing Behavior in Underground Belt Transportation System Based on YOWO |
论文中文关键词: | |
论文外文关键词: | Illegal crossing of belts ; Spatio-temporal behavior detection ; YOWO ; Lightweight ; Model deployment |
论文中文摘要: |
井下皮带运输系统不仅是煤矿运输中至关重要的一环,也是人员容易发生事故的重点作业区域,因此对皮带运输系统违规跨越皮带行为检测的研究非常重要。但现有的行为检测算法精度不高,且用于皮带运输系统违规跨越行为的专用检测算法较少。因此,本文研究了一种高精度轻量级的皮带运输系统违规跨越行为检测算法。 针对现有时空行为检测算法将定位和分类任务解耦为两阶段,导致模型无法统一优化的问题,本文基于单阶段时空行为检测算法YOWO进行改进,提出YOWO-CCAA算法。首先,改进了YOWO的时空特征提取网络,通过3D CNN分支提取动作的运动特征,通过2D CNN分支提取动作主体的空间特征。其次,针对YOWO算法仅在通道维度对双分支特征进行融合,提出多级特征融合模块CCAA,在通道和空间层面进行特征融合,有效提升模型的检测精度。最后,采用SIoU作为边界框回归损失函数,加快模型收敛速度。实验结果表明,在自建的矿工跨越皮带数据集上,改进后的算法F-mAP达到93.83%,在UCF101-24和JHMDB-51公开数据集上较YOWO算法提升了2.51%和1.17%。 针对YOWO-CCAA算法参数量大,无法在嵌入式设备端部署进行实时检测的问题,本文基于GhostNetV2轻量级网络,设计Ghost-YOWO-CCAA轻量化算法。首先,将2D-GhostNetV2扩展为3D-GhostNetV2,重构时空提取网络的运动分支。其次,设计轻量级的特征提取结构C3GhostV2,重构时空特征提取网络的目标分支。轻量化后的模型为80MB,相较原模型体积压缩了4.5倍,在服务器端的推理速度达到41.4帧/秒,提升了1.62倍,F-mAP达到91.70%。 最后对轻量化后的模型进行模型转换和量化处理,部署在以RK3399pro为主处理器的嵌入式设备上进行应用测试。在嵌入式设备上的检测帧率约为26 FPS,满足了实际应用中25帧/秒的实时检测需求。尽管检测精度有微小下降,但达到了90%以上的通用性要求。研究结果表明,本文所提出的算法对井下皮带运输系统违规跨越行为检测方面具有一定的参考价值。 |
论文外文摘要: |
The underground belt transportation system is not only a crucial part of coal mine transportation, but also a key operation area where personnel are prone to accidents, so the research on the detection of illegal crossing behavior of the belt transportation system is very important. However, the existing behavior detection algorithms have low accuracy and fewer dedicated detection algorithms are used for the illegal crossing behavior of the belt transportation system, so this thesis investigates a high-precision and lightweight detection algorithm for the illegal crossing behavior of the belt transportation system. Aiming at the problem that the existing spatio-temporal behavior detection algorithm decouples the localization and classification tasks into two stages, resulting in the model not being able to be optimized uniformly, this thesis improves the single-stage spatio-temporal behavior detection algorithm YOWO based on the single-stage spatio-temporal behavior detection algorithm, and proposes the YOWO-CCAA algorithm. First, the spatio-temporal feature extraction network of YOWO is improved to extract the motion features of the action through 3D CNN branches and the spatial features of the action subject through 2D CNN branches. Second, for the YOWO algorithm which only fuses the dual-branch features in the channel dimension, a multi-level feature fusion module, CCAA, is proposed to perform feature fusion in the channel and spatial dimensions, which effectively improves the model detection accuracy. Finally, SIoU is adopted as the bounding box regression loss function to accelerate the model convergence speed. The experimental results show that the improved algorithm F-mAP reaches 93.83% on the self-built miner crossing belt dataset, and improves 2.51% and 1.17% over the YOWO algorithm on the UCF101-24 and JHMDB-51 public datasets. Aiming at the problem that the YOWO-CCAA algorithm has a large number of parameters and cannot be deployed on the embedded device side for real-time detection, this thesis designs the Ghost-YOWO-CCAA lightweight algorithm based on the GhostNetV2 lightweight network. First, 2D-GhostNetV2 is extended to 3D-GhostNetV2 to reconstruct the motion branch of the spatio-temporal extraction network. Second, the lightweight feature extraction structure C3GhostV2 is designed to reconstruct the target branch of the spatio-temporal feature extraction network. The lightweight model is 80MB, which is 4.5 times compressed compared with the original model volume, and the inference speed on the server side reaches 41.4 frames/sec, which is 1.62 times improved, and the F-mAP reaches 91.70%. Finally, the lightweight model is subjected to model transformation and quantization, and deployed on an embedded device with RK3399pro as the main processor for application testing. The detection frame rate on the embedded device is about 26 FPS, which meets the real-time detection requirement of 25 fps in practical applications. Despite the slight decrease in detection accuracy, the generality requirement of more than 90% is achieved, and the results show that the algorithm proposed in this thesis has certain reference value for the detection of illegal crossing behavior in underground belt transportation system. |
参考文献: |
[1]赵亚军, 张志男, 贾廷贵. 2010—2021年我国煤矿安全事故分析及安全对策研究[J]. 煤炭技术, 2023, 42 (08): 128-131. [34]程叶群, 王艳, 范裕莹, 等. 基于卷积神经网络的轻量化目标检测网络[J]. Laser & Optoelectronics Progress, 2021, 58(16): 1610023. [39]张泰, 张为, 刘艳艳. 周界视频监控中人员翻越行为检测算法[J]. 西安交通大学学报, 2016, 50 (06): 47-53. [40]倪焱. 基于机器视觉的露台人体危险行为检测[D]. 长春理工大学, 2019. [41]李瑞. 面向智能监控的异常行为实时识别[D]. 哈尔滨工程大学, 2021. [42]杨源. 博物馆游客违规行为识别系统设计与实现[D]. 哈尔滨工业大学, 2021. [43]李逸辰. 基于Kinect的地铁乘客异常行为检测[D]. 中国矿业大学, 2021. [45]王志鹏, 王涛. 基于Faster RCNN的穿越围栏违规行为检测[J]. 计算机系统应用, 2022, 31 (04): 346-351. [46]周巧瑜, 曹扬, 詹瑾瑜, 等. 基于Yolo和GOTURN的景区游客翻越行为识别[J]. 计算机技术与发展, 2022, 32 (01): 134-140. |
中图分类号: | TP391.41 |
开放日期: | 2024-06-14 |