论文中文题名: | 基于YOLOv3的目标检测算法研究 |
姓名: | |
学号: | 18207041016 |
保密级别: | 公开 |
论文语种: | chi |
学科代码: | 081001 |
学科名称: | 工学 - 信息与通信工程 - 通信与信息系统 |
学生类型: | 硕士 |
学位级别: | 工学硕士 |
学位年度: | 2021 |
培养单位: | 西安科技大学 |
院系: | |
专业: | |
研究方向: | 数字图像处理 |
第一导师姓名: | |
第一导师单位: | |
论文提交日期: | 2021-06-18 |
论文答辩日期: | 2021-06-04 |
论文外文题名: | Research on Object Detection Algorithm Based on YOLOv3 |
论文中文关键词: | |
论文外文关键词: | Object detection ; YOLOv3 ; Attention ; Data augmentation ; Model compression |
论文中文摘要: |
目标检测是图像分析与理解等大量高层次视觉任务的基础,YOLOv3作为目标检测技术中最受欢迎的算法之一,由于具有良好的泛化能力,应用广泛。为了满足终端设备高检测精度、低内存占用的算法部署需求,本文从提高检测精度、压缩模型参数两个方面对目标检测算法进行研究,最终以YOLOv3为基准对其进行改进。 在提高检测精度方面,从可嵌入到模型中的注意力方法和模型训练时使用的数据增强方法两个角度入手。针对注意力,结合YOLOv3算法的检测原理,将锚框信息作为先验知识引入到注意力方法中,得到改进的注意力方法。对比实验表明,使用了改进的注意力方法的YOLOv3算法相比于原始的YOLOv3算法在VOC2007和VOC2012数据集上分别有0.7%和0.4%的mAP提升。针对数据增强,将MixUp和Mosaic两种方法结合得到改进的数据增强方法。为验证改进方法的通用效果,使用基准网络PyramidNet在CIFAR-100和CIFAR-10数据集上进行对比实验,结果表明,模型训练时使用改进的数据增强方法之后,Top-1错误率相比于未使用之前分别降低2.47%和1.31%。 在模型压缩方面,提出将零激活占比大于阈值的卷积核过滤掉之后再进行FPGM剪枝的改进策略。为验证改进方法的通用效果,以ResNet-56为基准网络在CIFAR-10数据集上进行对比实验,结果表明,ResNet-56使用改进的FPGM剪枝策略后,相比于未剪枝之前FLOPs减少53%左右,准确率相比于未剪枝之前提高0.05%左右。 综合以上提出的改进方法,在MS COCO数据集上进行对比实验。结果表明,YOLOv3算法网络结构中添加改进的注意力方法,训练时使用改进的数据增强方法,得到推理模型后再用改进的FPGM剪枝策略进行剪枝,最终AP可达到37.9%,相比于未改进之前提高6.9%,网络参数量由59.6M降低到33.1M。改进的YOLOv3算法相比其他性能优异的算法虽有一些差距,但相比YOLOv3本身仍有较大的性能提升。 |
论文外文摘要: |
Object detection is the basis of a large number of high-level vision tasks such as image analysis and understanding. As one of the most popular algorithms in object detection technology, YOLOv3 has a wide range of applications due to its good generalization ability. In order to meet the requirements of algorithm deployment for terminal equipment with high detection accuracy and low memory usage, this dissertation studies the object detection algorithm from two aspects: improving detection accuracy and compressing model parameters, and finally improves it based on YOLOv3. In terms of improving the detection accuracy, we start from two perspectives: the attention method that can be embedded in the model and the data augmentation method used in model training. For attention, combined with the detection principle of the YOLOv3 algorithm, the anchor frame information is introduced into the attention method as prior knowledge, and an improved attention method is obtained. Comparative experiments show that compared with the original YOLOv3 algorithm, the YOLOv3 algorithm using the improved attention method has 0.7% and 0.4% mAP improvements on the VOC2007 and VOC2012 data sets, respectively. For data augmentation, the two methods of MixUp and Mosaic are combined to get an improved data augmentation method. In order to verify the general effect of the improved method, the benchmark network PyramidNet is used to conduct comparative experiments on the CIFAR-100 and CIFAR-10 data sets. The results show that after the improved data augmentation method is used during model training, the Top-1 error rate is compared to Before use, it was reduced by 2.47% and 1.31%, respectively. In terms of model compression, an improved strategy is proposed to filter out the convolution kernels with a proportion of zero activations greater than the threshold before performing FPGM pruning. In order to verify the general effect of the improved method, a comparative experiment was conducted on the CIFAR-10 data set with ResNet-56 as the benchmark network. The results show that after ResNet-56 uses the improved FPGM pruning strategy, FLOPs are reduced by about 53% compared to before pruning, and the accuracy rate is increased by about 0.05% compared with that before pruning. Based on the improved methods proposed above, comparative experiments are carried out on the MS COCO data set. The results show that the improved attention method is added to the network structure of the YOLOv3 algorithm, and the improved data augmentation method is used during training. After the inference model is obtained, the improved FPGM pruning strategy is used for pruning. The final AP can reach 37.9%, compared with Before improvement, it increased by 6.9%, and the amount of network parameters was reduced from 59.6M to 33.1M. Although the improved YOLOv3 algorithm has some gaps compared with other algorithms with excellent performance, it still has a large performance improvement compared to YOLOv3 itself. |
参考文献: | ﹀ |
中图分类号: | TP391 |
开放日期: | 2021-06-18 |