论文中文题名: | 基于时空信息交互的夜间人体动作识别方法研究与实现 |
姓名: | |
学号: | 20208223066 |
保密级别: | 公开 |
论文语种: | chi |
学科代码: | 0854 |
学科名称: | 工学 - 电子信息 |
学生类型: | 硕士 |
学位级别: | 工程硕士 |
学位年度: | 2023 |
培养单位: | 西安科技大学 |
院系: | |
专业: | |
研究方向: | 图形图像处理 |
第一导师姓名: | |
第一导师单位: | |
论文提交日期: | 2023-06-15 |
论文答辩日期: | 2023-06-06 |
论文外文题名: | Research and Implementation of Night Human Motion Recognition Method Based on Space-time Information Interaction |
论文中文关键词: | |
论文外文关键词: | Image Enhancement ; Motion Recognition ; Illumination Curve Estimation ; Spatiotemporal Information Interaction ; Attention Mechanism |
论文中文摘要: |
随着计算机视觉领域的迅猛发展,深度学习技术已经在图像处理、目标识别等任务中取得了成就。然而,研究正逐渐转向视频,因为生活和工作环境中随处可见监控摄像头,如果仅靠人工监控,每时每刻产生的大量监控视频将耗费大量人力、财力和物力。人体动作识别任务也成为了监控视频下的一个重要任务,人体动作识别任务的场景通常是在良好的视觉条件下进行的,而在夜间视觉场景下研究较少。因此,本文针对现有的夜间人体动作识别算法准确率低的问题,对相关技术进行了研究与应用。本课题完成的主要工作与创新如下: (1)针对传统图像增强算法中参数固定而导致增强后的图像各个区域无法得到有效提升,且基于深度学习的图像增强算法太过于依赖于配对训练的数据集等问题,本文提出了一种基于MDIFE-Net曲线估计的夜间图像增强算法。首先,基于灰度变换方法设计了一种光照估计曲线,通过光照估计曲线对图像进行像素级的调整,将夜间低光图像域映射到增强图像域,有效消除光照不足所带来的影响;其次,提出了基于Mish函数的深度光照特征提取网络(Mish Deep Illumination Feature Extraction Network, MDIFE-Net)提取图像特征,去掉了无参考深度曲线估计网络模型所有的下采样层和批处理归一化层,防止其破坏相邻像素之间的关系,用更加平滑的Mish激活函数代替了Relu激活函数,从而可以使参数更好地进行更新;最后,设计了一种联合多项损失的光照估计损失函数来驱动夜间图像增强算法,解决了成对数据集难以构建的问题。实验结果表明,本文算法在夜间ARID数据集上的NIQE和STD指标结果分别达到了12.283和67.472,相较于新颖的Zero-DCE算法,分别降低和提升了1.866和13.605,能够有效提升夜间图像的清晰度和对比度,为后续人体动作识别提供了良好的基础。 (2)针对深度学习领域中,人体动作识别算法对时间信息、空间信息以及背景信息总是进行同等处理,而造成人体动作识别算法精度不高的问题,本文提出了一种基于时空信息交互的人体动作识别算法。首先,提出了一个双路径网络以不同的刷新率分别学习空间和时间信息,包括一个在低帧率下运行以捕获空间语义信息的稀疏路径,以及一个并行的在高帧率下运行以捕获时序运动信息的密集路径;其次,为了从视频中提取更具有区分性的特征,提出了交叉双注意力交互模型将注意力集中在视频片段的重点区域,并在两条路径之间明确的交换时空信息。实验结果表明,本文算法在UCF101数据集和HMDB51数据集上的准确率分别达到了97.6%和78.4%,相较于新颖的Slowfast算法分别提升了1.8%和1.4%,取得了更高的准确率。结合基于MDIFE-Net曲线估计的夜间图像增强算法在夜间ARID数据集上的准确率达到了83.2%,比图像增强前的动作识别准确率提升了22.9%,能够有效的识别夜间人体动作,具有良好的实战意义。 (3)本文将所提出的夜间图像增强模型与人体动作识别模型进行实际应用。通过系统的需求分析,设计并实现了一套基于B/S架构的夜间人体动作识别系统,并对结果进行了可视化的展示,最后对该系统进行了功能测试,得到了能够满足用户需求的夜间人体动作识别系统。 综上所述,本文的工作主要从夜间图像增强和人体动作识别两个方向展开研究,针对夜间人体动作识别算法准确率低的问题,在夜间图像增强算法和人体动作识别算法上进行了改进和优化,搭建了相应的网络结构,通过实验进行了验证,达到了预期的研究目标,并将所提出的算法落地实用,搭建了一套基于B/S架构的夜间人体动作识别系统。 |
论文外文摘要: |
With the rapid development of computer vision, deep learning technology has made achievements in image processing, target recognition and other tasks. However, research is gradually turning to video because surveillance cameras can be seen everywhere in your life and work environment. If you only rely on manual surveillance, the large amount of surveillance video generated at any time will consume a lot of manpower, money and material resources. Human motion recognition task has also become an important task under surveillance video. Scenes of human motion recognition task are usually performed under good visual conditions, but less studied under night vision. Therefore, in order to solve the problem of low accuracy of the existing night motion recognition algorithms, the related technologies are studied and applied. The main work and innovations completed in this project are as follows: (1)A night image enhancement algorithm based on MDIFE-Net curve estimation is proposed to solve the problem that the parameters of traditional image enhancement algorithms are fixed, resulting in the enhancement of each area of the image cannot be effectively improved, and the deep learning-based image enhancement algorithm is too dependent on paired training datasets. First, an illumination estimation curve is designed based on the gray transformation method. By adjusting the pixel level of the image with the illumination estimation curve, the night low-light image domain is mapped to the enhanced image domain, which effectively eliminates the impact of the insufficient illumination. Secondly, a Mish Deep Illumination Feature Extraction Network (MDIFE-Net) based on Mish function is proposed to extract image features, eliminating all down-sampling layers and batch normalization layers of the network model without reference depth curve estimation, so as to prevent them from destroying the relationship between adjacent pixels, and replacing the Relu activation function with a smoother Mish activation function, so that the parameters can be updated better. Finally, a light loss estimation function combined with multiple losses is designed to drive the night image enhancement algorithm, which solves the problem that paired datasets are difficult to build. The experimental results show that the NIQE and STD indices obtained by this algorithm on night ARID dataset are 12.283 and 67.472, respectively. Compared with the novel Zero-DCE algorithm, the NIQE and STD indices are reduced and improved by 1.866 and 13.605, respectively, which can effectively improve the sharpness and contrast of night images, and provide a good basis for subsequent human motion recognition. (2)In the field of deep learning, human motion recognition algorithms always process time information, spatial information and background information equally, which results in low accuracy of human motion recognition algorithms. This paper presents a human motion recognition algorithm based on space-time information interaction. First, a two-path network is proposed to learn spatial and temporal information at different refresh rates, including a sparse path running at a low frame rate to capture spatial semantic information and a dense parallel path running at a high frame rate to capture temporal motion information. Secondly, in order to extract more distinctive features from the video, a cross-bi-attention interaction model is proposed, which focuses attention on the key areas of the video clips and explicitly exchanges space-time information between the two paths. The experimental results show that the accuracy of this algorithm on UCF101 and HMDB51 datasets is 97.6% and 78.4%, respectively, which is 1.8% and 1.4% higher than that of the novel Slowfast algorithm. The night image enhancement algorithm combined with MDIFE-Net curve estimation achieves 83.2% accuracy on night ARID dataset and 22.9% higher accuracy than motion recognition before image enhancement. It can effectively recognize night human movements and has good practical significance. (3)The night image enhancement model and human motion recognition model proposed in this paper are applied in practice. Through the system requirements analysis, a night human motion recognition system based on B/S architecture is designed and implemented, and the results are visualized. Finally, the function of the system is tested, and a night human motion recognition system that can meet the needs of users is obtained. In summary, this paper mainly studies night image enhancement and human motion recognition. To solve the problem of low accuracy of night human motion recognition algorithm, the night image enhancement algorithm and human motion recognition algorithm are improved and optimized, and the corresponding network structure is built. The experimental results verify that the expected research goals are achieved, and the proposed algorithm is practical. A night motion recognition system based on B/S architecture is built. |
参考文献: |
[2]闫航. 康复训练场景下的动作与行为识别方法研究[D]. 郑州: 郑州大学, 2020. [4]乔阳阳. 面向智能家居的动作识别关键技术研究[D]. 辽宁: 辽宁科技大学, 2021. [6]黄勇康, 梁美玉, 王笑笑, 陈徵, 曹晓雯. 基于深度时空残差卷积神经网络的课堂教学视频中多人课堂行为识别[J]. 计算机应用, 2022, 42(3): 736-742. [7]贺斌. 基于深度学习的考场作弊行为分析与研究[D], 成都: 电子科技大学, 2021. [8]陆晴. 基于深度学习的异常行为识别算法研究[D], 哈尔滨: 哈尔滨工业大学, 2018. |
中图分类号: | TP391.41 |
开放日期: | 2023-06-19 |