论文中文题名: | 基于Attention-CNN的文本情感分类研究 |
姓名: | |
学号: | 18208207038 |
保密级别: | 公开 |
论文语种: | chi |
学科代码: | 085211 |
学科名称: | 工学 - 工程 - 计算机技术 |
学生类型: | 硕士 |
学位级别: | 工程硕士 |
学位年度: | 2021 |
培养单位: | 西安科技大学 |
院系: | |
专业: | |
研究方向: | 软件开发与测试工程 |
第一导师姓名: | |
第一导师单位: | |
论文提交日期: | 2021-06-22 |
论文答辩日期: | 2021-06-04 |
论文外文题名: | Research on Text Sentiment Classification Based on Attention-CNN |
论文中文关键词: | |
论文外文关键词: | word vector ; convolutional neural network ; Gated neural network ; attention mechanism ; sentiment classification |
论文中文摘要: |
文本情感分类是利用计算机技术对含有情感特征的数据进行处理,它能够识别和提取文本中的主观信息。由于基于传统方法的文本情感分类对特征信息提取的不够充分,而基于深度学习的文本情感分类方法可以改善这一问题,因此,深度学习成为了文本情感分类的主要研究方法。论文为了提取更多的特征信息,提高分类准确率,提出了非静态的FastText词向量模型和Attention-CNN的融合模型。论文的主要研究内容如下: (1)提出了一种基于非静态的FastText词向量的方法。该方法通过FastText模型进行词向量训练得到初始的词向量矩阵,并在训练过程中作为参数训练,通过不断进行调整和更新,使得词向量更加适用于该文本情感分类任务,获得更多词语之间的语义和语法关系,从而提高分类准确率。在卷积神经网络进行文本情感分类实验,与静态的FastText、Word2Vec和Doc2vec模型进行相比,非静态的FastText模型在公开的评论数据集上,准确率提高了2.54%、6.22%和4.82%;在酒店评论语料库上,准确率提高了1.84%、6.22%和5.10%。 (2)提出了一种基于Attention-CNN的融合模型,该模型结合了卷积神经网络、门控神经网络和注意力机制,针对卷积神经网络无法捕获长距离依赖关系,提取的特征不够充分的问题,首先通过实验确定了相关超参数的取值,包括 Dropout值、batch-size参数和激活函数等;然后进行了模型对比实验,与CNN、LSTM和GRU相比,基于Attention-CNN的融合模型在公开的评论数据集上,准确率提高了4.70% 、3.66%和2.95%,在酒店评论语料库上,准确率提高了3.61%、2.88%和2.03%。 (3)构建了文本内容情感分类系统。该系统验证了基于非静态FastText词向量和基于Attention-CNN的融合模型的有效性,用Python对该系统进行实现,通过可视化操作可以清晰地展示分类结果的具体细节,在生活中有一定使用价值。 |
论文外文摘要: |
Text sentiment classification is the use lor='red'>of computer technology to process data containing emotional features, which can identify and extract subjective information in text. Because the text sentiment classification based on traditional methods is not sufficient for feature information extraction, and the text sentiment classification method based on deep learning can improve this problem, deep learning has become the main research method lor='red'>of text sentiment classification. In order to extract more feature information and improve classification accuracy, the paper proposes a non-static FastText word vector model and Attention-CNN fusion model. The main research contents lor='red'>of the thesis are as follows: (1) A method based on non-static FastText word vectors is proposed. This method uses the FastText model to train the word vector to obtain the initial word vector matrix, which is used as parameter training during the training process. Through continuous adjustment and update, the word vector is more suitable for the text emotion classification task, and more words are obtained. The semantic and grammatical relationship lor='red'>of the, so as to improve the classification accuracy. Perform text sentiment classification experiments on convolutional neural networks. Compared with the static FastText, Word2Vec and Doc2vec models, the non-static FastText model has increased accuracy by 2.54%, 6.22% and 4.82% on the public comment data set; On the hotel review corpus, the accuracy rate increased by 1.84%, 6.22% and 5.10%. (2) A fusion model based on Attention-CNN is proposed. The model combines convolutional neural network, gated neural network and attention mechanism. Aiming at the inability lor='red'>of convolutional neural network to capture long-distance dependencies, the extracted features are not sufficient For the problem, firstly determine the value lor='red'>of related hyperparameters through experiments, including Dropout value, batch-size parameter and activation function, etc; then conduct model comparison experiments, compared with CNN, LSTM and GRU, based on Attention-CNN fusion On the public review data set, the accuracy lor='red'>of the model increased by 4.70%, 3.66%, and 2.95%, and on the hotel review corpus, the accuracy increased by 3.61%, 2.88%, and 2.03%. (3) Constructed a text content sentiment classification system. The system verifies the effectiveness lor='red'>of the non-static FastText word vector and the Attention-CNN-based fusion model. The system is implemented in Python, through visual operation, the specific details lor='red'>of the classification results can be clearly displayed, which has certain use value in life. |
参考文献: |
[1] 黄琼霞.基于深度学习的文本情感分析研究[D].福建农林大学,2018. [6] Kobayashi N, Inui K, Matsumoto Y, et al. Collecting Evaluative Expressions for Opinion Extraction[J]. Journal of Natural Language Processing, 2004, 12: 596-605. [11] 徐琳宏,林鸿飞,赵晶.情感语料库的构建和分析[J].中文信息学报,2008(01):116-122. [12] 柳位平,朱艳辉,栗春亮,向华政,文志强.中文基础情感词词典构建方法研究[J].计算机应用,2009,29(10):2875-2877. [13] 杨超,冯时,王大玲,杨楠,于戈.基于情感词典扩展技术的网络舆情倾向性分析[J].小型微型计算机系统,2010,31(04):691-695. [14] 孙本旺,田芳,藏文.情感词典的构建及微博情感计算研究[J]. 计算机技术与发展,2018,28( 11) : 218-222. [15] 王志涛,於志文,郭斌.基于词典和规则集的中文微博情感分析[J]. 计算机工程与应用,2015,51( 8) : 218-225. [21] 张莹. 在线新闻评论的情感分析研究[D]. 南开大学, 2013. [22] 杨经,林世平.基于SVM的文本词句情感分析[J].计算机应用与软件,2011, 28(09): 225-228. [23] 李婷婷,姬东鸿.基于SVM和CRF多特征组合的微博情感分析[J].计算机应用研究, 2015, 32(4):978-981. [24] 徐军,丁宇新,王晓龙.使用机器学习方法进行新闻的情感自动分类[J].中文信息学报,2007, 21(6): 95-100. [26] Kim Y . Convolutional Neural Networks for Sentence Classification[J]. Eprint Arxiv, 2014. [37] 刘龙飞,杨亮,张绍武,等.基于卷积神经网络的微博情感倾向性分析[J].中文信息学报, 2015, 29(6):159-165. [38] 冯兴杰,张志伟,史金钏.基于卷积神经网络和注意力模型的文本情感分析[J].计算机应用研究, 2018(5). [39] 何炎祥,孙松涛,牛菲菲,等.用于微博情感分析的一种情感语义增强的深度学习模型[J].计算机学报, 2017, 40(4):773-790. [40] 李科.基于多元特征融合和LSTM神经网络的中文评论情感分析[D].太原理工大学,2017. [41] 吴小华,陈莉,魏甜甜,范婷婷.基于Self-Attention和Bi-LSTM的中文短文本情感分析[J].中文信息学报,2019,33(6): 100-107. [42] 杨东,王移芝.基于 Attention-based C-GRU神经网络的文本分类[J]计算机与现代化 2018 (2):96-100. [43] 孙明敏.基于GRU-Attention的中文文本分类[J]现代信息科技2019,3(3):10-12. [44] 线岩团,相艳,余正涛,文永华,王红斌,张亚飞.用于文本分类的均值原型网络[J]中文信息学报. 2020,34(6):74-80. [47] 汪岿,刘柏嵩.文本分类研究综述[J].数据通信,2019(03):37-47. [51] Tao L , Yu Z . Training RNNs as Fast as CNNs[J]. 2017. [55] Duchi J , Hazan E , Singer Y , et al. dearly. Adaptive Subgradient Methods Adaptive |
中图分类号: | TP391 |
开放日期: | 2021-06-22 |