论文中文题名: | 基于深度学习的车辆跟踪算法的研究 |
姓名: | |
学号: | 19208208052 |
保密级别: | 公开 |
论文语种: | chi |
学科代码: | 085212 |
学科名称: | 工学 - 工程 - 软件工程 |
学生类型: | 硕士 |
学位级别: | 工程硕士 |
学位年度: | 2022 |
培养单位: | 西安科技大学 |
院系: | |
专业: | |
研究方向: | 机器学习与计算机视觉 |
第一导师姓名: | |
第一导师单位: | |
论文提交日期: | 2022-06-22 |
论文答辩日期: | 2022-06-06 |
论文外文题名: | Research on Vehicle Tracking Algorithm Based on Deep Learning |
论文中文关键词: | |
论文外文关键词: | object detection ; object tracking ; k-means ; PANet ; YOLOv4 ; DeepSort |
论文中文摘要: |
随着我国城市化进程不断加快,私家车持有率不断提高,汽车行驶过程中产生的交通管理、路径规划、车辆运输等大量的行为信息,为交通信息服务、交通管理、交通运输安全等服务系统的高效智能化提供了基础数据。交通的管控决策、导航软件中的实时路况、自动化车辆控制等关键系统都依赖道路节点的车辆运行数据。车辆目标跟踪在智慧交通系统中起着重要的作用。随着深度学习在机器视觉方面的应用,各种基于计算机视觉的算法被大量用于目标检测和跟踪,如何高效、准确地检测和跟踪车辆成为了该领域的研究热点。由于实际路况中易出现距离摄像头较远时存在车辆较多,同时在图片或视频中像素占比较少等情况导致检测和跟踪准确率较低以及车辆跟踪算法如何实现精度和速度的最佳平衡,本文针对上述问题,所做的具体工作如下: (1)本文提出一种基于改进YOLOv4(You Only Look Once v4)算法的目标检测器,通过二分k-means聚类算法替换K-means算法来获取更具代表性的先验框,从而能够让训练更有针对性,可以加快模型的收敛。同时由于在YOLOv4网络中,目标特征通过自底向上的卷积操作进行传递,但是由于经过反复的卷积操作,目标特征中有关小目标及遮挡目标的特征及位置信息等数据会逐渐丢失,针对上述存在的问题,本文对YOLOv4中PAN进行改进,在PAN中将CSPDarkNet53的上一层ResBlock输出的特征与下一层进行跳跃拼接,增大了特征检测尺度,从而在高层特征空间中保留不同层级空间的特征,通过综合指标mAP值与召回率进行比较,在多云、夜间、晴天和雨天测试场景下,本文算法的mAP值与YOLOv4相比提高了0.23%-2.07%,在混合场景下提高了1.97%,召回率与YOLOv4相比提高了1.88%-4.03%,在混合场景下提高了2.24%。 (2)由于DeepSort的深度外观模型是在人的重识别数据集上训练得到的,本文针对车辆的实际特点,对DeepSort外观特征提取网络进行修改,同时根据车辆的外观特征,对网络模型的输入进行了修改。通过以上两处实验参数的修改使其更适合进行车辆目标跟踪。 (3)DeepSort目标跟踪算法在实时目标跟踪过程中,会将目标车辆的表观特征进行近邻匹配,每帧通过跟踪和级联匹配后会进行特征提取保存和对比,由于每一帧都进行特征提取保存和对比会耗费大量时间,所以会降低目标跟踪的速度。而Sort目标跟踪算法使用简单的卡尔曼滤波器和匈牙利匹配在高帧速率下的性能,但忽略了被检测物体的表面特征,如果物体间发生遮挡,会影响跟踪准确率。本文根据对数据集中车辆遮挡的分析,提出将DeepSort算法与混合一定比例的Sort算法来进行车辆跟踪。通过相应的实验,证实所提出的方法提高了FPS,而准确性几乎没有损失。最后与其他车辆跟踪算法相比,本文提出的改进DeepSort算法同时将改进的YOLOv4作为检测器的车辆跟踪算法无论是跟踪的准确性、跟踪的精确度还是车辆的检测速度上均有提升,从而可以实现精度和速度的最佳平衡。 |
论文外文摘要: |
As urbanization process continues to accelerate, the rate of private car ownership continues to increase, the car driving process produces a large number of traffic management, route planning, vehicle transportation and other behavioral information, for the traffic information services, traffic management, transportation safety and other services such as intelligent and efficient system to provide the basic data. Traffic control and decision-making, real-time road conditions in navigation software, automated vehicle control and other key systems are dependent on the vehicle operation data of road nodes. Vehicle target tracking plays an important role in the intelligent transportation system. With the application of deep learning in machine vision, various computer vision-based algorithms are used in large numbers for target detection and tracking, and how to efficiently and accurately detect and track vehicles has become a hot research topic in this field. As the actual road conditions are prone to the presence of more vehicles when the distance from the camera is far, while the pixel share in the picture or video is relatively small, etc. leading to low detection and tracking accuracy and how the vehicle tracking algorithm achieves the best balance of accuracy and speed, the specific work done in this paper to address the above issues is as follows. (1)This paper proposes a target detector based on the improved YOLOv4(You Only Look Once v4) algorithm, which replaces the K-means algorithm by the dichotomous k-means clustering algorithm to obtain more representative prior frames, thus enabling more targeted training and can accelerate the convergence of the model.In the YOLOv4 network, the target features are passed through the bottom-up convolution operation, but due to the repeated convolution operation, the data about the small targets and the features and location information of the occluded targets in the target features will be gradually lost. In response to the above existing problems, this paper improves the PAN in YOLOv4, in which the upper layer of CSPDarkNet53 This paper improves the PAN in YOLOv4 by jump splicing the features output from CSPDarkNet53 with the next layer to increase the feature detection scale, thus preserving the features of different layer spaces in the high level feature space. scenarios by 1.97% and the recall rate by 1.88%-4.03% compared to YOLOv4 and 2.24% in mixed scenarios. (2)Since DeepSort's deep appearance model is trained on the human re-identification dataset, this paper modifies the DeepSort appearance feature extraction network for the actual characteristics of the vehicles, and also modifies the input of the network model according to the appearance features of the vehicles. The above two experimental parameters are modified to make it more suitable for performing vehicle target tracking. (3)DeepSort target tracking algorithm will match the apparent features of the target vehicle in the nearest neighbor during the real-time target tracking process, and will perform feature extraction and saving and comparison after each frame through tracking and cascade matching, which will reduce the speed of target tracking because it will take a lot of time to perform feature extraction and saving and comparison in each frame. And Sort target tracking algorithm uses simple Kalman filter and Hungarian matching for performance at high frame rate, but ignores the surface features of detected objects, which will affect the tracking accuracy if occlusion occurs between objects. In this paper, based on the analysis of vehicle occlusion in the dataset, we propose to mix DeepSort algorithm with a certain percentage of Sort algorithm for vehicle tracking. Through corresponding experiments, it is confirmed that the proposed method improves the FPS with almost no loss of accuracy. Finally, compared with other vehicle tracking algorithms, the improved DeepSort algorithm proposed in this paper while using the improved YOLOv4 as a detector for vehicle tracking improves both the accuracy of tracking, the precision of tracking and the speed of vehicle detection, so that the best balance of accuracy and speed can be achieved. |
参考文献: |
[1]岳倩.汽车保有量超3亿辆新能源汽车同比增59.25%[N].中国质量报,2022-01-13(006).DOI:10.28164/n.cnki.nczlb.2022.000141. [2]明界日. 基于深度学习和核相关滤波的车辆跟踪研究[D].南昌大学,2021.DOI:10.27232/d.cnki.gnchu.2021.002560. [3]朱凌志. 基于深度学习的复杂场景监控视频车流量统计方法设计[D].南京邮电大学,2021.DOI:10.27251/d.cnki.gnjdc.2021.000319. [4]赵永强,饶元,董世鹏,张君毅.深度学习目标检测方法综述[J].中国图象图形学报,2020,25(04):629-654. [7]吴国强, 黄坤, 刘新世, 等. 基于YOLOv4的无人机实时人脸检测[J]. 中国航天电子技术研究院科学技术委员会 2020 年学术年会优秀论文集, 2020. [17]徐涛,马克,刘才华.基于深度学习的行人多目标跟踪方法[J].吉林大学学报(工学版),2021,51(01):27-38.DOI:10.13229/j.cnki.jdxbgxb20200509. [20]赵诚诚. 基于卷积神经网络的图像分类改进算法的研究[D].南京邮电大学,2020.DOI:10.27251/d.cnki.gnjdc.2020.001191. [32]JOCHER G. Yolov5[EB/OL].[2021-10-14]. Code repository https://github.com/ultralytics/yolov5. [44]王嘉琳. 基于YOLOv5和DeepSORT的多目标跟踪算法研究与应用[D].山东大学,2021.DOI:10.27272/d.cnki.gshdu.2021.000983. [48]缪佳妮,杨金龙,程小雪,葛洪伟.运动信息优化相关滤波的多目标跟踪算法[J].计算机科学与探索,2021,15(07):1310-1321. |
中图分类号: | TP391.41 |
开放日期: | 2022-06-22 |