论文中文题名: | 基于三维卷积神经网络的微表情识别方法研究 |
姓名: | |
学号: | 20208223072 |
保密级别: | 公开 |
论文语种: | chi |
学科代码: | 085400 |
学科名称: | 工学 - 电子信息 |
学生类型: | 硕士 |
学位级别: | 工程硕士 |
学位年度: | 2023 |
培养单位: | 西安科技大学 |
院系: | |
专业: | |
研究方向: | 图形图像处理 |
第一导师姓名: | |
第一导师单位: | |
论文提交日期: | 2023-06-19 |
论文答辩日期: | 2023-06-05 |
论文外文题名: | Research on Micro-expression Recognition Method Based on 3D Convolutional Neural Network |
论文中文关键词: | |
论文外文关键词: | Micro-expression Recognition Method ; Optical Flow Method ; Double Flow Structure ; Multiscale Feature Extraction |
论文中文摘要: |
微表情是一种当人们试图控制和隐藏自身强烈的情感时,快速呈现出的表情,可以准确反映人们内心想法的真实写照。基于这种性质,微表情研究在临床治疗、犯罪调查、商业谈判等领域都有着巨大的应用潜力。微表情识别属于细粒度视频分类的范畴,与一般的图像分类相比,视频分类更为复杂,还需要考虑时间信息等因素,再加上微表情的脸部肌肉运动幅度非常小且持续时间很短(0.04s-0.2s),使得准确识别微表情变得非常困难。本文将微表情识别分为两个方面,结合上述问题做了如下研究: 在连续、快速微表情识别方面,例如商业谈判、课题教学中,微表情的出现通常间隔比较短,要求算法在确保一定准确率的前提下,还要有较低的识别耗时。所以本文提出了一种基于双流三维卷积神经网络的微表情识别算法。算法采用C3D作为主干网络,并将其第3、4、5层的双层卷积结构改为单层卷积结构,加快模型的推理速度;构建并行双流网络,同时传入微表情人脸图片和微表情光流图片,兼顾人脸空间属性和光流运动信息,在不改变网络深度的情况下获得了一定的识别精度提升;加入轻量级的3D时空注意力机制增强模型细节感知能力。在公开数据集SMIC和CASMEⅡ上进行算法性能评估,结果证明所提出的算法在保证准确率的同时识别速度提审升了约15%,可以有效解决快速微表情识别问题。 在更为严谨的、高精度的微表情识别方面,例如临床心理治疗、刑事审讯中,微表情识别有较为充裕的判断时间,不需要有很快的识别速度,但是识别错误的代价会比较大,要求算法有较高的识别精度。所以本文提出了一种基于多尺度三维残差卷积神经网络的微表情识别算法。算法采用3D-ResNet50作为主干网络,为了提高算法的特征提取能力,空间上,在网络中加入不同尺寸的多尺度卷积模块,整合全局信息和局部信息;时间上,将模型不同层数得到的特征图使用注意力特征模块进行融合,增强模型的上下文感知能力,再将融合结果分层输出,选择准确率最高的结果作为模型的输出。在公开数据集SMIC和CASMEⅡ上进行算法性能评估,均取得最高准确率,在高精度微表情识别方面可以作为重要参考依据。 开发设计微表情识别系统,该系统包括用户模块和微表情模块,用户模块主要有用户登录和用户数据管理两个功能,微表情模块主要有微表情识别和微表情数据管理两个功能,微表情识别功能集成上述两种微表情识别算法,面对不同场景,用户可以选择不同的微表情识别算法。 |
论文外文摘要: |
Micro-expression is a kind of expression that appears quickly when people try to control and hide their strong emotions, which can accurately reflect the true reflection of people's inner thoughts. Based on this nature, microexpression research has great potential in clinical treatment, crime investigation, business negotiation and other fields. Micro-expression recognition belongs to the category of fine-grained video classification. compared with general image classification, video classification is more complex, and factors such as time information need to be taken into account. in addition, the range of facial muscle movement of micro-expression is very small and the duration is very short (0.04s-0.2s), which makes it very difficult to identify micro-expression accurately. In this paper, microexpression recognition is divided into two aspects, combined with the above problems to do the following research: In the aspect of continuous and fast micro-expression recognition, such as business negotiation and project teaching, the interval between the occurrence of micro-expression is usually relatively short, which requires the algorithm to have a lower recognition time on the premise of ensuring a certain accuracy. Therefore, a microexpression recognition algorithm based on double-flow 3D convolution neural network is proposed in this paper. In the algorithm, C3D is used as the backbone network, and the double-layer convolution structure of layer 3, 4 and 5 is changed into single-layer convolution structure to speed up the reasoning speed of the model. At the same time, micro-expression facial images and micro-expression optical flow images are introduced, and the recognition accuracy is improved without changing the depth of the network. A lightweight 3D spatio-temporal attention mechanism is added to enhance the detail perception of the model. The performance of the algorithm is evaluated on the open data sets SMIC and CASME Ⅱ, and the results show that the proposed algorithm improves the recognition speed by about 15% while ensuring the accuracy, which can effectively solve the problem of fast micro-expression recognition. In the aspect of more rigorous and high-precision micro-expression recognition, such as clinical psychotherapy and criminal interrogation, micro-expression recognition has plenty of judgment time and does not need to have fast recognition speed, but the cost of recognition errors will be high. the algorithm is required to have high recognition accuracy. Therefore, a microexpression recognition algorithm based on multi-scale 3D residual convolution neural network is proposed in this paper. The algorithm uses 3D-ResNet50 as the backbone network. In order to improve the feature extraction ability of the algorithm, multi-scale convolution modules of different sizes are added to the network to integrate global information and local information. In terms of time, the feature images obtained from different layers of the model are fused with the attention feature module to enhance the context awareness of the model, and then output the fusion results layer by layer, and select the result with the highest accuracy as the output of the model. The performance of the algorithm is evaluated on the open data sets SMIC and CASME II, and the highest accuracy is achieved, which can be used as an important reference in high-precision microexpression recognition. Develop and design a micro-expression recognition system. The system includes a user module and a micro-expression module. The user module mainly has two functions of user login and user management. The micro-expression module mainly has two functions of micro-expression recognition and micro-expression data management. Micro-expression recognition function Integrating the above two micro-expression recognition algorithms, users can choose the corresponding micro-expression recognition algorithm for different scenarios. |
参考文献: |
[9] Ekman P. Lie catching and microexpressions[J]. The philosophy of deception, 2009, 1(2): 5-5. [10] 何景琳,梁正友,孙宇,等. 结合C3D与光流法的微表情自动识别[J].计算机系统应用,2021,30(1):221-227. [11] 吴奇,申寻兵,傅小兰. 微表情研究及其应用[J]. 心理科学进展,2010,18(9): 1359-1368. [12] 魏文辉. 基于代表性 AU 区域的微表情识别算法研究 [D]. 济南:山东大学,2021 [13] 张学森,贾静平.基于三维卷积神经网络和峰值帧光流的微表情识别算法[J].模式识别与人工智能,2021,34(5) : 423-433. [16] 周伟航,肖正清,钱育蓉,等.微表情自动分析方法研究综述[J]. 计算机应用研究,2022, 39(07):1921-1932 [20] 许诗琪. 基于深度学习的微表情识别算法研究[D].北方工业大学,2022. [32] 马浩原, 安高云, 阮秋琦. 平均光流方向直方图描述的微表情识别[J]. 信号处理, 2018, 34(3): 279-288 [36] Cortes, C., and V apnik, V. Support-vector networks[J]. Mach. Learn. 1995:20, 273– 297. [44] 李星燃,张立言,姚树婧.结合特征融合和注意力机制的微表情识别方法[J].计算机科 学,2022,49(02):4-11 [45] 林宇凌, 金晓宏, 王中任. 基于LK光流法的微流控芯片中流体速度检测[J]. 激光与 红外, 2020(008):50-63. [46] 刘子琦. 基于计算机视觉的高铁桥梁结构位移测量方法研究[D].哈尔滨工业大 学,2020. [55] 朱威,屈景怡,吴仁彪.结合批归一化的直通卷积神经网络图像分类算法[J].计算机 辅助设计与图形学学报,2017,29(9):1650-1657 [57] 刘建伟,赵会丹,罗雄麟.深度学习批归一化及其相关算法研究进展[J].自动化学 报,2020,46(6):1090-1120. |
中图分类号: | TP391.41 |
开放日期: | 2023-06-19 |