论文中文题名: | 基于FCOS的煤矿井下人员目标检测与跟踪方法研究 |
姓名: | |
学号: | 19208049005 |
保密级别: | 公开 |
论文语种: | chi |
学科代码: | 0812 |
学科名称: | 工学 - 计算机科学与技术(可授工学、理学学位) |
学生类型: | 硕士 |
学位级别: | 工学硕士 |
学位年度: | 2022 |
培养单位: | 西安科技大学 |
院系: | |
专业: | |
研究方向: | 计算机视觉与可视化 |
第一导师姓名: | |
第一导师单位: | |
论文提交日期: | 2022-06-23 |
论文答辩日期: | 2022-06-07 |
论文外文题名: | Research on target detection and tracking method of under-ground coal mine personnel based on FCOS |
论文中文关键词: | |
论文外文关键词: | Underground pedestrian detection in coal mine ; Multi-object tracking ; Deep learning ; Neural network ; Anchor-free detection |
论文中文摘要: |
煤炭是我国工业发展的重要能源,每年都有着非常大的开采规模。但由于煤矿井下亮度低、粉尘含量高、作业环境恶劣,事故时有发生。因此,矿井工作人员的安全保障尤为重要,煤矿井下行人检测与跟踪对煤矿的安全生产具有重要意义。本文对矿井下监控装置采集到的视频图像进行手动标注,构建了小型的煤矿井下行人检测与跟踪数据集,并在此基础上对煤矿井下行人检测与跟踪问题进行研究,主要内容如下: (1)针对煤矿井下行人检测精度不足、实时性要求高、环境条件差、行人状态复杂等问题,提出一种改进的FCOS煤矿井下行人检测算法。该模型使用轻量级卷积神经网络ShuffleNet V2替换FCOS检测算法中的骨干网络ResNet-50,将原始网络中的特征金字塔结构改进为自上而下和自下而上的路径聚合网络,同时利用由两组深度可分离卷积组成的轻量化检测头替换原始FCOS网络的检测头。在实验训练过程中,通过对井下行人检测数据进行尺度和颜色数据增强来提升模型的泛化能力与鲁棒性。实验结果显示,改进的FCOS可以更好地实现精度与速度之间的平衡,该算法在基本不损失精度的情况下,mAP达51.9%,FPS可以达到100帧/s。 (2)基于改进的FCOS目标检测器的输出结果,使用卡尔曼滤波器进行预测与更新,并利用改进的匈牙利匹配算法对跟踪轨迹与检测结果进行数据关联与匹配。具体来说,为了充分利用因遮挡等问题导致检测器所输出的一部分置信度较低的检测框,将检测器的输出结果通过阈值设定的方式,分为高分检测框集合和低分检测框集合。首先将高分检测框集合与原跟踪轨迹进行关联和匹配,然后将未能成功与高分检测框匹配的跟踪轨迹与低分检测框集合进行关联与匹配,挖掘出低分检测框中有价值的目标信息,从而降低漏检并提高跟踪轨迹的连续性。通过在自构建的多个场景的煤矿井下数据集上进行实验,结果显示,改进的跟踪算法能够显著的提升跟踪器的跟踪性能。 |
论文外文摘要: |
Coal is an important energy source for China's industrial development and has a very large mining scale every year. However, due to low brightness, high dust content and bad working environment, accidents often occur in coal mines. Therefore, the safety guarantee of mine workers is particularly important, and the detection and tracking of underground pedestrians is of great significance to the safety production of coal mines. In this paper, the video images collected by the underground monitoring device are manually annotated, and a small data set of underground coal mine pedestrian detection and tracking is constructed. Based on this, the detection and tracking of underground coal mine pedestrian is studied. The main contents are as follows: (1) An improved FCOS pedestrian detection algorithm is proposed to solve the problems of insufficient detection accuracy, high real-time requirement, poor environmental conditions and complex pedestrian status in underground coal mine. In this model, a lightweight convolutional neural network ShuffleNet V2 is used to replace the backbone network ResNET-50 in FCOS detection algorithm, and the feature pyramid structure in the original network is improved into a top-down and bottom-up path aggregation network. At the same time, the detection head of the original FCOS network is replaced by a lightweight detection head composed of two sets of depth-separable convolution. In the course of experimental training, the generalization ability and robustness of the model are improved by enhancing the scale and color data of downhole pedestrian detection data. Experimental results show that the improved FCOS algorithm can achieve a better balance between accuracy and speed. The mAP of the algorithm can reach 51.9% and the FPS can reach 100 frames /s without losing accuracy. (2) Based on the output results of the improved FCOS target detector, Kalman filter is used for prediction and update. The improved Hungarian matching algorithm is used to correlate and match the tracking trajectories and detection results. Specifically, in order to make full use of some detection frames with low confidence output by the detector due to occlusion and other problems, the output results of the detector are divided into high-score detection frame set and low-score detection frame set through threshold setting. Firstly, the high-score detection frame set is associated and matched with the original tracking track, and then the tracking track of the high-score detection frame that has not been matched successfully before is associated and matched with the low-score detection frame set, mining the valuable target information in the low-score detection frame, so as to reduce the missed detection and improve the continuity of tracking track. Experimental results show that the improved tracking algorithm can significantly improve the tracking performance of the tracker. |
参考文献: |
[1] 董茜茜. 矿井环境监测中无线传感网分簇路由协议研究[D]. 南京: 南京邮电大学, 2014. [4] 刘文江. 矿井人员井下位置跟踪监控系统的实现方案比较研究[J]. 科技创新与应用, 2012 (08Z): 29-30. [5] 李晓建. 矿井人员目标检测与跟踪算法的研究与实现[D]. 山东: 山东科技大学, 2020. [6] 刘贝. 矿井人员目标检测与跟踪研究[D]. 西安: 西安科技大学, 2020. [7] 耿蒲龙, 宋建成, 刘旭飞, 等. 基于RGB颜色空间的矿井运动目标检测及跟踪方法[J]. 太原理工大学学报, 2017, 48(6): 963-968. [8] 杨铮. 基于视频矿井下的人员计数算法研究[D]. 武汉: 武汉理工大学, 2013. [9] 李首滨. 煤炭工业互联网及其关键技术[J]. 煤炭科学技术, 2020, 48(07): 98-108. [10] 张帆, 李闯. 面向智能矿山与新工科的数字孪生技术研究[J]. 工矿自动化, 2020, 46(5): 15-20. [11] 胡青松, 杨维, 丁恩杰, 等. 煤矿应急救援通信技术的现状与趋势[J]. 通信学报, 2019, 40(05): 163-179. [12] 牛化康, 何小海, 汪晓飞, 等. 一种改进的ViBe目标检测算法[J]. 四川大学学报 (工程科学版), 2014, 46(S2): 104-108. [28] 刘美枝, 杨磊, 高海. 结合角点特征的CamShift 目标跟踪算法研究[J]. 山西大同大学学报: 自然科学版, 2019, 35(5): 14-18. [29] 翟卫欣, 程承旗. 基于Kalman滤波的Camshift运动跟踪算法[J]. 北京大学学报: 自然科学版, 2015(5): 799-804. [32] 董观利, 宋春林. 基于视频的矿井行人越界检测系统[J]. 工矿自动化, 2017(2): 29-34. [33] 原磊明, 王海斌, 刘旭飞,等. 矿井人身安全视频防护系统设计[J]. 煤炭技术, 2018, 37(11):3. [34] 邹斐. 煤矿井下运动目标检测与跟踪研究[D]. 西安: 西安科技大学, 2018. [43] 孙继平. 煤矿安全监控系统联网技术研究[J]. 煤炭学报, 2009, 34(11): 1546-1549. [44] 李伟山, 卫晨, 王琳. 改进的 Faster RCNN 煤矿井下行人检测算法[J]. 计算机工程与应用, 2019, 55(4): 200-207. [53] 王娇娇, 刘政怡, 李辉. 特征融合与objectness加强的显著目标检测[J]. 计算机工程与应用, 2017, 53(2): 195-200. [54] A Welch G F. Kalman filter[J]. Computer Vision: A Reference Guide, 2020: 1-3. |
中图分类号: | TD76 |
开放日期: | 2022-06-23 |