论文中文题名: | 基于表面肌电信号的手势动作识别算法研究 |
姓名: | |
学号: | 21208088027 |
保密级别: | 公开 |
论文语种: | chi |
学科代码: | 083500 |
学科名称: | 工学 - 软件工程 |
学生类型: | 硕士 |
学位级别: | 工学硕士 |
学位年度: | 2024 |
培养单位: | 西安科技大学 |
院系: | |
专业: | |
研究方向: | 人工智能 |
第一导师姓名: | |
第一导师单位: | |
论文提交日期: | 2024-06-12 |
论文答辩日期: | 2024-05-30 |
论文外文题名: | Research on gesture action recognition algorithm based on surface EMG signals |
论文中文关键词: | |
论文外文关键词: | Surface EMG Signal ; Feature Dimensionality Reduction ; Convolutional Neural Networks ; Classification Algorithms ; Gesture Classification |
论文中文摘要: |
表面肌电信号作为一种记录肌肉收缩活动的生物电信号,为研究和分析人体肌肉功能提供了一种非侵入式的手段。随着人工智能的发展,基于肌电信号的手势识别被广泛应用于工业自动化、康复医学以及虚拟现实等领域,其中工业自动化包含控制机器人、远程控制生产设备,康复医学包含假肢控制、运动辅助,虚拟现实包含手势导航或控制、手势交互游戏。在手势识别的研究过程中,采集数据、预处理、特征提取和分类是实现高效、准确识别的关键步骤。本文主要在特征提取、分类识别进行了深入研究,并提出了改进方法。主要的研究工作如下: (1)在机械手臂的手势识别中,针对特征提取方法提取到的特征维度高的问题,本文提出了基于相关性热力图的特征降维方法,旨在通过降低特征维度的同时提取到数据的核心信息,以提高后续分类任务的效率和准确性。该方法的核心思想是利用统计学中的协方差和方差来量化sEMG中不同通道之间的相关性,从而筛选出具有代表性的特征。为了验证所提方法的有效性,在特征降维后,本文采用了三种传统的机器学习分类算法对降维后的特征进行了分类处理,并对这三种特征降维方法的性能进行了全面评估。实验结果表明,通过相关性热力图降维后,使用随机森林分类的准确率为81.6%,证明了所提出的降维方法在有效减少特征数量的同时,能够显著提升表面肌电信号分类的准确性。 (2)针对手势识别算法中模型分类准确率低、网络结构冗余的问题,本文提出了基于双卷积神经网络的分类算法。首先,通过小波阈值去噪对原始肌电信号进行预处理,这是因为肌电信号采集过程中会受到各种噪声的影响,这些噪声会严重影响信号的质量和后续处理的准确性;其次,为了能够提取到更全面、更抽象的特征信息,将三层双卷积层和最大池化层结合,从输入的原始信号中精确提取、优化过滤出有明显差别的特征图;最后,在特征提取和优化过程完成之后,通过引入全连接层的设计,将之前双卷积层和最大池化层处理后的高维特征映射转化为一维向量形式。全连接层的加入,进一步提高了网络处理不同信号分类问题时的适应性和效率,确保整体模型在高维特征空间中的映射能力和训练效率。实验结果表明,DUAL-CNN在单通道上的平均准确率为95.91%,与使用同一数据集的其他算法相比,该算法提高了整体手势分类的准确率。 本文所提出的特征降维和分类识别算法,有效提升了基于表面肌电信号的手势动作识别的性能,这不仅为残疾人士提供了新的希望,还展示了科技进步给社会带来的积极影响,推动了人机交互技术的发展。未来,基于表面肌电信号的手势识别技术将在更多领域发挥更大的作用。 |
论文外文摘要: |
Surface electromyography, as a kind of bioelectrical signals that record muscle contraction activities, provide a non-invasive means to study and analyze human muscle functions. With the development of artificial intelligence, gesture recognition based on EMG signals is widely used in the fields of industrial automation, rehabilitation medicine, and virtual reality, where industrial automation includes controlling robots and remotely controlling production equipment, rehabilitation medicine includes prosthetic limb control and motion assistance, and virtual reality includes gesture navigation or control, and gesture-interactive games. In the research process of gesture recognition, data acquisition, preprocessing, feature extraction and classification are the key steps to realize efficient and accurate recognition. In this paper, two methods are improved in feature extraction and classification recognition to improve the accuracy and efficiency of gesture recognition, and the main research work is as follows: (1) In gesture recognition of robotic arms, for the problem of high dimensionality of features extracted by feature extraction methods, this paper proposes a feature dimensionality reduction method based on correlation heat map, which aims to extract the core information of the data by reducing the dimensionality of the features at the same time, in order to improve the efficiency and accuracy of the subsequent classification tasks. The core idea of the method is to utilize the covariance and variance in statistics to quantify the correlation between different channels in the sEMG, so as to filter out representative features. In order to verify the effectiveness of the proposed method, after feature dimensionality reduction, the paper employs three traditional machine learning classification algorithms to categorize the dimensionality reduced features and comprehensively evaluates the performance of these three feature dimensionality reduction methods. The experimental results show that the accuracy of classification using random forest is 81.6% after dimensionality reduction by correlation heat map, which proves that the proposed dimensionality reduction method can significantly improve the accuracy of sEMG signal classification while effectively reducing the number of features. (2)Aiming at the problems of low model classification accuracy and redundant network structure in gesture recognition algorithms, the paper proposes a classification algorithm based on dual convolutional neural networks. Firstly, the original EMG signal is preprocessed by wavelet threshold denoising, because the EMG signal acquisition process will be affected by a variety of noises, which will seriously affect the quality of the signal and the accuracy of the subsequent processing. Secondly, in order to be able to extract a more comprehensive and higher abstract feature information, the three-layer dual convolutional layer and the maximal pooling layer are combined, and the feature maps are accurately extracted, optimized and filtered out from the input original signal. Finally, after the feature extraction and optimization process is completed, the feature maps with significant differences are extracted and optimized by the dual convolutional neural networks, filter out the feature maps with obvious differences. After the feature extraction and optimization process is completed, the high-dimensional feature maps previously processed by the dual convolutional layer and the maximum pooling layer are transformed into a one-dimensional vector form by the introduction of the fully connected layer design. The addition of the fully connected layer further improves the adaptability and efficiency of the network when dealing with different signal classification problems, and ensures the mapping ability and training efficiency of the overall model in the high-dimensional feature space. The experimental results show that DUAL-CNN has an average accuracy of 95.91% on a single channel, which improves the overall gesture classification accuracy compared to other algorithms using the same dataset. The feature reduction and classification recognition algorithm proposed in the paper effectively improves the performance of gesture action recognition based on sEMG signal, which not only provides a new hope for people with disabilities, but also promotes the development of human-computer interaction technology and demonstrates the positive impact of scientific and technological progress on society. In the future, sEMG signal-based gesture recognition technology will play a greater role in more fields. |
参考文献: |
[1]苏芳茵. 表面肌电信号分析及运动模式分类研究[D]. 厦门: 华侨大学, 2016. [2]张志勇. 肌电信号采集与肌电假肢控制的研究[D]. 哈尔滨: 哈尔滨工业大学, 2010. [5]李阳. 智能仿生手臂肌电信号—运动模型化与模式识别理论方法研究[D]. 长春: 吉林大学, 2012. [6]丁其川, 熊安斌, 赵新刚, 等. 基于表面肌电的运动意图识别方法研究及应用综述[J].自动化学报, 2016, 42(1): 13-25. [7]周杨. 基于深度学习的肌电手势识别算法研究[D]. 南京: 南京邮电大学, 2023. [20]周志华. 机器学习[M]. 北京: 清华大学出版社, 2016: 133-140. [27]仝润泽. 面向手势识别的表面肌电信号分类算法研究[D]. 浙江: 浙江工业大学, 2020. [43]Li X, Samuel O W, Zhang X, et al. A motion-classification strategy based on [44]杨彬, 黄鹏程, 杨庆华, 等. 基于肌电控制的灵巧手及其控制系统设计[J]. 机电工程, 2016, 33(4): 388-394. [45]尤波, 刘意, 赵汗青, 等. 人体肌电信号的生理层模型[J]. 计算机仿真, 2015, 32(1): 367-371. [48]朱昊, 辛长宇, 吉小军, 等. 表面肌电信号前端处理电路与采集系统设计[J]. 测控技术, 2008, 27(3): 37-39. [49]侯文生, 万莎, 吴小鹰, 等. 手指运动姿态检测及假肢手指动作实时控制[J]. 重庆大学学报, 2013, 36(4): 103-109. [50]王聪. 基于RGB-D相机的AGV三维环境感知重建和路径规划研究[D]. 南昌: 南昌大学, 2022. [52]钱晓进, 郑如冰, 王传林. 高性能肌电检测前置放大器的设计[J]. 现代科学仪器, 2003, (3): 50-52. [53]陈斌. 基于表面肌电信号的手势识别研究[D]. 南昌: 南昌大学, 2021. [54]韦晔. 基于Myo的手势和手臂运动识别研究[D]. 沈阳: 沈阳工业大学, 2021. [58]张廷尉, 陈红, 王磊. 基于 Matlab的巴特沃斯数字低通滤波器的设计[J]. 鞍山师范学院学报, 2012, 14(2): 13-15. [59]卢朝林. 表面肌电信号采集臂带的设计与手势识别应用[D]. 成都: 电子科技大学, 2020. [61]于然, 付周兴, 王清亮. 基于MATLAB的电弧建模仿真及故障分析[J]. 高压电器, 2011, (9): 95-99. [62]李航. 统计学习方法[M]. 北京: 清华大学出版社, 2012: 95-135. [64]黄晓娟, 张莉. 改进的多类支持向量机递归特征消除在癌症多分类中的应用[J]. 计算机应用, 2015, 35(10): 2798-2802. [67]唐胡鑫. 网站运营与数据分析双色版[M]. 北京: 航空工业出版社, 2016: 81-82. [68]黄勤龙. 云计算数据安全[M], 北京: 北京邮电大学出版社, 2018: 95. [70]Vapnik,V. Statistical learning theory[J]. Wiley, 1998, 3: 401-492. [71]边肇祺. 模式识别[M]. 清华大学出版社有限公司, 2000. [73]Breiman L. Random forests[J]. Machine learning, 2001, 45: 5-32. [78]窦慧, 张凌茗, 韩峰, 等. 卷积神经网络的可解释性研究综述[J]. 软件学报, 2023, 35(1): 159-184. [80]黄妤宁. 非合作通信环境下飞机声信号类型识别的方法研究[D]. 哈尔滨: 哈尔滨工程大学, 2019. [82]陈永生, 喻玲娟, 谢晓春. 基于全卷积神经网络的 SAR 图像目标分类[J]. 雷达科学与技术, 2018, 16(3): 242-248. [84]Zhi-Hua, Zhou, Ji, et al. Deep forest[J]. National Science Review, 2019, 6(1): 74-86. |
中图分类号: | TP391 |
开放日期: | 2024-06-13 |