论文中文题名: | 基于深度学习的烟雾识别与分割研究 |
姓名: | |
学号: | 21208223046 |
保密级别: | 公开 |
论文语种: | chi |
学科代码: | 085400 |
学科名称: | 工学 - 电子信息 |
学生类型: | 硕士 |
学位级别: | 工学硕士 |
学位年度: | 2024 |
培养单位: | 西安科技大学 |
院系: | |
专业: | |
研究方向: | 图像处理 |
第一导师姓名: | |
第一导师单位: | |
论文提交日期: | 2024-06-17 |
论文答辩日期: | 2024-05-30 |
论文外文题名: | Deep Learning Based Smoke Recognition and Segmentation Research |
论文中文关键词: | 深度学习 ; Transformer ; 烟雾识别与分割 ; DeepLabV3+ ; 卷积神经网络 |
论文外文关键词: | Deep Learning ; Transformer ; Smoke Recognition and Segmentation ; DeepLabV3+ ; Convolutional Neural Networks |
论文中文摘要: |
火灾是一种极具破坏性的灾害,给人们的生命、财产和环境都带来了严重的威胁。火灾发生的初期会产生大量的烟雾,而在火灾中期才会有火焰的产生。所以为了提前预知火灾并尽早的采取救援措施,烟雾的提前检测显得尤为重要。针对烟雾识别过程中误报率较高以及在烟雾分割中大目标烟雾边缘和小目标分割不理想的问题,本文提出了改进的基于深度学习的烟雾识别与分割方法,旨在提高火灾预警的准确性和效率。 本文主要研究内容如下: (1)针对当前烟雾识别算法存在误报率较高的问题,本文提出了一种结合Inception和Transformer结构的双分支烟雾识别方法TCF-Net。将卷积神经网络学习局部信息的能力与Transformer中的自注意力机制学习全局上下文信息的能力相结合,其次,通过Inception结构使卷积核种类多样化,丰富了特征种类,又减少了通道数的冗余,同时在特征提取过程中嵌入了特征耦合模块(FCU),连续地对双分支中的局部特征和全局信息进行交互,以最大程度保留双分支中的局部信息和全局信息,提高该算法的性能。改进后的网络可以更好的提取烟雾的特征,降低了烟雾识别的误报率,并且将准确率提升至97.8%,证实了该算法有较好的性能。 (2)针对当前大多数烟雾分割算法对大目标烟雾边缘和小目标烟雾分割不理想导致精度较低的问题,本文提出了一种基于改进的DeepLabV3+的轻量化烟雾分割方法。本文将主干网络替换成了MobileNetV2,同时对空洞卷积金字塔池化模块(ASPP)进行了改进,将ASPP的空洞率设置为4、8、12、16以提高对多尺度信息的提取能力,进一步在空洞卷积模块中引入了串联结构来更好的融合多尺度特征,并且在编码部分嵌入了CBAM通道与空间注意力机制,提高了对特征融合的尺度和对小目标的关注程度。改进后的模型相较于原算法,平均交并比(mIoU)和平均像素精确度(mPA)分别提高了4.81%和2.03%。实验结果表明,与DeepLabV3+模型相比,本文方法提升了烟雾分割的速率和精确度。 (3)烟雾识别与分割系统的实现。本研究基于深度学习技术,针对烟雾分类与分割任务,设计并开发了一个系统。该系统结合了本文的两种模型,实现了对烟雾的快速、准确的分类与分割的可视化。 |
论文外文摘要: |
Fire is a very destructive disaster that poses a serious threat to people's lives, property and the environment. Fires produce large amounts of smoke in the early stages of a fire, while flames are not produced until the middle stages of a fire. Therefore, in order to anticipate fires and take rescue measures as early as possible, the early detection of smoke is particularly important. Aiming at the problems of high false alarm rate during smoke recognition and unsatisfactory segmentation of large target smoke edges and small targets in smoke segmentation, this paper proposes an improved deep-learning based smoke recognition and segmentation method, which aims to improve the accuracy and efficiency of fire warning. The main research of this paper is as follows: (1) Aiming at the problem of high false alarm rate of current smoke recognition algorithms, this paper proposes a two-branch smoke recognition method TCF-Net that combines Inception and Transformer structures. combines the ability of convolutional neural network to learn local information with the ability of the self-attention mechanism in Transformer to learn global contextual information, and, secondly, through the Inception structure to diversify the types of convolutional kernels, which enriches the feature variety and reduces the redundancy of the number of channels, and at the same time, the feature coupling module (FCU) is embedded in the process of feature extraction, which continuously interacts with the local features and global information in the two-branch to maximize the retention of local and global information in the two-branch to improve the performance of this algorithm. The improved network can better extract the features of smoke, reduce the false alarm rate of smoke recognition, and increase the accuracy to 97.8%, which confirms that the algorithm has better performance. (2) Aiming at the problem that most current smoke segmentation algorithms are not ideal for large target smoke edges and small target smoke segmentation resulting in low accuracy, this paper proposes a lightweight smoke segmentation method based on the improved DeepLabV3+. In this paper, the backbone network is replaced with MobileNetV2, while the null convolution pyramid pooling module (ASPP) is improved by setting the null rate of the ASPP to 4, 8, 12, and 16 in order to improve the ability of extracting multiscale information, further introducing a tandem structure in the null convolution module for better fusion of the multiscale features and embedding the CBAM in the coding part of the channel with spatial attention mechanism in the coding part to improve the scale of feature fusion and the attention to small targets. The improved model improves the mean intersection and merger ratio (mIoU) and mean pixel accuracy (mPA) by 4.81% and 2.03%, respectively, compared to the original algorithm. The experimental results show that the method in this paper improves the rate and accuracy of smoke segmentation compared to the DeepLabV3+ model. (3) Implementation of a smoke recognition and segmentation system. In this study, a system is designed and developed for smoke classification and segmentation tasks based on deep learning techniques. The system combines the two models in this paper to achieve fast and accurate classification and segmentation visualization of smoke. |
参考文献: |
[2]史劲亭,袁非牛,夏雪.视频烟雾检测研究进展[J].中国图象图形学报,2018,23(03):303-322. [5]冯路佳,王慧琴,王可等.基于目标区域的卷积神经网络火灾烟雾识别[J].激光与光电子学进展,2020,57(16):83-91. [15]王文朋,毛文涛,何建樑等.基于深度迁移学习的烟雾识别方法[J].计算机应用,2017,37(11):3176-3181+3193. [16]陈俊周,汪子杰,陈洪瀚等.基于级联卷积神经网络的视频动态烟雾检测[J].电子科技大学学报,2016,45(06):992-996. [20]朱家辉, 赵志瑛, 贾静静. 基于 YUV 色彩空间的烟雾区域提取方法[J]. 电子技术与软件工程, 2021 (7): 116-117. [23]王浩远,梁煜,张为.融合多分辨率表征的实时烟雾分割算法[J].浙江大学学报(工学版),2021,55(12):2334-2341. [34]刘志赢,谢春思,李进军等.基于改进Deeplabv3+的烟雾区域分割识别算法[J].系统工程与电子技术,2021,43(02):328-335. [39]Oloyede MO, Hancke GP, Myburgh H C. 人脸识别系统综述:最新方法和挑战[J]. 多媒体工具与应用, 2020, 79(37): 27891-27922. [61]Yuan F, Zhang L, Xia X, et al. Deep smoke segmentation[J]. Neurocomputing, 2019, 357: 248-260. |
中图分类号: | TP391.4 |
开放日期: | 2024-06-17 |