论文中文题名: | 基于多模态融合的体征状态识别方法研究 |
姓名: | |
学号: | 20206223076 |
保密级别: | 保密(1年后开放) |
论文语种: | chi |
学科代码: | 085400 |
学科名称: | 工学 - 电子信息 |
学生类型: | 硕士 |
学位级别: | 工学硕士 |
学位年度: | 2032 |
培养单位: | 西安科技大学 |
院系: | |
专业: | |
研究方向: | 人工智能与模式识别 |
第一导师姓名: | |
第一导师单位: | |
论文提交日期: | 2023-06-19 |
论文答辩日期: | 2023-06-02 |
论文外文题名: | Research of Body Sign Recognition Based on Multimodal Fusion |
论文中文关键词: | 体征状态识别 ; 多模态融合 ; 小波包分解 ; 门控循环单元网络 ; Transformer+ 模型 |
论文外文关键词: | Body sign recognition ; Multimodal fusion ; Wavelet packet decomposition ; Gate recurrent unit ; Transformer+ model |
论文中文摘要: |
随着体域网技术和通信技术的发展,体征状态识别的实际应用需求已不满足于对基本生命体征状态的监测,而是期望能识别出更加全面且复杂的体征状态。本研究以构建科研人员体征状态的智能识别系统为背景,研究了人体基本生命体征状态、情绪状态、疲劳状态和综合体征状态这4 种体征状态的识别方法。主要研究工作如下: |
论文外文摘要: |
With the development of body area network technology and communication technology, the practical application requirements of body sign recognition are no longer limited to monitoring basic body sign, but rather expect to recognize more comprehensive and complex body sign. This research focuses on building an intelligent recognition system for the body sign of scientific researchers, and studies the recognition methods of four body signs: basic body sign, emotional state, fatigue state, and comprehensive body sign. The main research work includes: 1. To solve the problem of missing data set of relevant signs, this study selected ECG signal, skin electrical signal, body temperature, oxygen saturation, blood pressure and facial video, and collected and preprocessed them, thus realizing the construction of body sign dataset. 2. Aiming at the noise and redundant information of physiological signals, this paper proposes a physiological signal feature extraction (PSFE) model: firstly, the wavelet packet decomposition method is used to denoise and preliminary feature extraction of physiological signals, and then its time domain features and frequency domain features are further extracted. Aiming at the static features and dynamic features of facial video, this paper proposes a Time and Space Feature Extraction (TSFE) model: firstly, the SE-ResNeXt-50 network is used to extract the static spatial features of facial video. Secondly, the gated recurrent unit network is used to extract the dynamic temporal features of facial video. 3. Aiming at the complex association between multimodal data and body sign, this paper proposes a learning framework of body sign recognition based on multimodal fusion (BSRMMF): firstly, the PSFE model and TSFE model are used to extract the feature information of each modal data; Secondly, the model layer fusion method in parallel form is selected as the feature fusion method. Thirdly, the Transformer+ model is proposed as a specific feature fusion model, so as to effectively fuse multimodal feature information. Finally, the fully connected layer and the Softmax activation function are used to identify the corresponding physical states. For the BSRM-MF learning framework proposed in this paper, this paper conducts relevant experimental verification: firstly, the ablation experiment of different input data verifies the reliability and effectiveness of the BSRM-MF learning framework, and the recognition accuracy can be achieved up to 94.62%. Secondly, the comparative experiment of the feature fusion model verifies that the Transformer+ model has better performance than the graph attention model. Finally, model verification experiments on the public dataset RAMAS further verify the effectiveness and reliability of the BSRM-MF learning framework. 4. In view of the application requirements of body sign recognition, this paper conducts relevant research and design of the body sign recognition system, and carries out modular + hierarchical design of the system according to the dimension of data processing: for the first time, different data processing processes are packaged into modules, so as to realize a low-coupling, expandable and maintainable sign state recognition system; Secondly, the logical architecture of the system is divided into perception layer, network layer, and application layer, and data collection, transmission, recognition and visual display are carried out respectively, so as to provide services for human body sign recognition. In this paper, by constructing a dataset, learning framework and system for body sign recognition based on multimodal fusion, this paper realizes the accurate recognition of the body sign of researchers. The research results can provide guidance for the daily life of researchers, so as to maintain the physical and mental health of researchers and further improve the quality of scientific research of researchers. In addition, in the context of the gradual maturity of wearable devices, the research can also be applied to telemedicine and health monitoring. This study provides a research basis for the practical application of body sign recognition, and has important research value and practical significance. |
参考文献: |
[2] 石长慧, 李睿婕, 何光喜, 等. 我国科研人员身心健康状况及干预对策研究[J]. 中国科技人才, 2022, 67(5): 51–59. [4] 刘昌鑫. 基于多生理信号融合的体征识别技术研究与实现[D]. 南京: 东南大学, 2020. [5] 邓丽娜, 王晓亮. 基于生理信号的情绪识别研究综述[J]. 物联网技术, 2021, 11(7): 33–41. [18] 刘军, 景晓军, 孙松林, 等. 一种用于人脸识别的基于主导近邻像素的局部 Gabor 空间直方图特征 [J]. 北京邮电大学学报, 2015, 38(1): 51–54. [21] 刘涛, 周先春, 严锡君. 基于光流特征与高斯 LDA 的面部表情识别算法 [J]. 计算机科学, 2018, 45(10): 286–290+319. [22] 张鹏, 孔韦韦, 滕金保. 基于多尺度特征注意力机制的人脸表情识别 [J]. 计算机工程与应用, 2022, 58(1): 182–189. [24] Elman J L. Finding structure in time[J]. Cognitive Science, 1990, 14(2): 179–211. [42] 赵小明, 杨轶娇, 张石清. 面向深度学习的多模态情感识别研究进展[J]. 计算机科学与探索, 2022(1479-1503). [43] 王黎, 韩清鹏. 人体生理信号的非线性分析方法[M]. 北京: 科学出版社, 2011: 1–96. [44] 马永昊. 基于心电的非接触精神疲劳监测系统的设计与研究[D]. 兰州: 兰州大学, 2019. [45] 张启飞. 皮肤电信号下学习焦虑的识别与调节技术研究[D]. 重庆: 西南大学, 2017. [52] 王黎, 韩清鹏. 人体生理信号的非线性分析方法[M]. 北京: 科学出版社, 2011: 1–96. [60] 邱锡鹏. 神经网络与深度学习[M]. 北京: 清华大学出版社, 2019: 145–147. [63] 刘超, 朱波. 融合画像和文本信息的轻量级关系图注意推荐模型[J]. 计算机应用研究, 2022, 40(4): 1037–1043. |
中图分类号: | TP391 |
开放日期: | 2024-06-20 |