论文中文题名: | 基于RF-MITSO-LSSVR的股价预测模型建立及应用研究 |
姓名: | |
学号: | 21201221063 |
保密级别: | 公开 |
论文语种: | chi |
学科代码: | 025200 |
学科名称: | 经济学 - 应用统计 |
学生类型: | 硕士 |
学位级别: | 经济学硕士 |
学位年度: | 2024 |
培养单位: | 西安科技大学 |
院系: | |
专业: | |
研究方向: | 数据挖掘 |
第一导师姓名: | |
第一导师单位: | |
论文提交日期: | 2024-06-10 |
论文答辩日期: | 2024-06-04 |
论文外文题名: | Research on the Establishment and Application of a Stock Price Forecasting Model Based on the RF-MITSO-LSSVR |
论文中文关键词: | 股价预测 ; 随机森林 ; 金枪鱼群优化算法 ; 最小二乘支持向量回归 |
论文外文关键词: | Stock price forecasting ; Random forest ; Tuna swarm optimization algorithm ; Least Squares Support Vector Regression |
论文中文摘要: |
金融业已成为我国经济的关键支柱,证券市场作为其重要组成部分,对经济发展有着深远影响。近年来,我国经济持续高质高效增长,政府加大证券市场支持力度,企业和个人纷纷参与证券贸易,推动了证券市场繁荣发展。股票作为证券市场的核心,其价格变化与多种因素密切相关,具有很大的不确定性和复杂性,在此背景下,股票价格预测问题也成了研究热点之一,采用有效的方法,实现股票价格的精准预测具有重要意义。针对股价预测研究在大型综合股指中存在的预测精度不高和拟合效果较差等问题,本文构建了新的RF-MITSO-LSSVR股价预测模型并对其进行了系统研究,具体如下。 首先,针对标准金枪鱼群优化算法(TSO)易早期收敛和陷入局部最优解的缺陷,提出了一种将佳点集种群初始化,莱维飞行改进螺旋觅食和自适应概率阈值三种改进策略相结合的多策略改进金枪鱼群优化算法(MITSO)。并与TSO算法、GA算法、PSO算法和GWO算法在基准测试函数上进行了性能对比,证明了MITSO算法具备更优秀的搜索机制和摆脱局部最优解的能力。 其次,选取包括股权风险溢价率等风险指标在内的16个股价相关指标作为预测模型输入变量,并采用随机森林(RF)对输入变量进行降维处理以消除指标间多重共线性的影响,提升模型训练效率。同时,针对输入变量中存在的非线性关系、多特征耦合等特点,选择了LSSVR作为基准模型,并且针对机器学习中人工设置超参数的局限性,采用本文提出的MITSO算法对最小二乘支持向量机回归(LSSVR)的正则化参数和核函数参数进行了优化,建立了基于RF-MITSO-LSSVR的股价预测模型。 最后,选择沪深300指数这一典型综合股指的历史数据进行了三组对比实验,对模型的准确性和有效性进行了检验,结果表明:相比于基准模型,RF-MITSO-LSSVR 模型具有更好的预测性能。同时,与TSO、GA、PAO、GWO算法相比,MITSO算法在LSSVR超参数的搜索上效果更好且对预测效果有一定提升。此外,与常用于股价预测的模型ARIMA和LSTM相比,本文构建的RF-MITSO-LSSVR 模型在综合型股指价格预测方面的精度与耗时上取得了平衡,具有一定的优势。 |
论文外文摘要: |
The financial industry has become a key pillar of China’s economy, with the stock market as an important component having a profound impact on economic development. In recent years, China’s economy has continued to grow with high quality and efficiency. The government has increased its support for the stock market, and both enterprises and individuals have actively participated in securities trading, promoting the prosperity and development of the stock market. As the core of the stock market, stock prices are closely related to various factors, characterized by great uncertainty and complexity. Against this backdrop, stock price prediction has also become one of the hot research topics. It is of great significance to adopt effective methods to achieve accurate prediction of stock prices. In response to the issues of low prediction accuracy and poor fitting effects in large comprehensive stock indices, this paper constructs a new RF-MITSO-LSSVR stock price prediction model and systematically studies it, as follows. Firstly, in order to solve the shortcomings of the standard tuna swarm optimization algorithm (TSO), which is easy to converge early and fall into the local optimal solution, a multi-strategy improved tuna swarm optimization algorithm (MITSO) was proposed, which combined three improved strategies: population initialization of the best spot set, improved spiral foraging by Levy flight and adaptive probability threshold. The performance of the MITSO algorithm,the TSO algorithm,the GA algorithm, the PSO algorithm and the GWO algorithm is compared with the benchmark function, which proves that the MITSO algorithm has a better search mechanism and the ability to get rid of the local optimal solution. Secondly, 16 stock price-related indicators, including equity risk premium rate and other risk indicators, were selected as the input variables of the prediction model, and random forest (RF) was used to reduce the dimensionality of the input variables to eliminate the influence of multicollinearity between indicators and improve the training efficiency of the model. At the same time, in view of the nonlinear relationship and multi-feature coupling in the input variables, LSSVR was selected as the benchmark model, and in view of the limitations of manual hyperparameters in machine learning, the regularization parameters and kernel function parameters of Least Squares Support Vector Machine Regression (LSSVR) were optimized by using the MITSO algorithm proposed in this paper, and a stock price prediction model based on RF-MITSO-LSSSVR was established. Finally, historical data of the CSI 300 Index, a typical comprehensive stock index, are selected for three sets of comparative experiments to test the accuracy and effectiveness of the model. The results show that the RF-MITSO-LSSVR model has better predictive performance compared to the baseline model. Moreover, compared with TSO, GA, PAO, and GWO algorithms, MITSO algorithm is more efficient in searching for LSSVR hyperparameters and has improved the prediction effect to a certain extent. In addition, compared with the commonly used stock price prediction models ARIMA and LSTM, the RF-MITSO-LSSVR model constructed in this paper achieves a balance between accuracy and time consumption in the prediction of comprehensive stock index prices, demonstrating certain advantages. |
参考文献: |
[3]卜林,孙丽玲,李政.地缘政治风险、经济政策不确定性与股票市场波动[J].南开经济研究,2020(05):185-205.DOI:10.14116/j.nkes.2020.05.010. [4]李斌,龙真.中国股票市场可预测性研究:基于机器学习的视角[J].管理科学学报,2023,26(10):138-158.DOI:10.19920/j.cnki.jmsc.2023.10.008. [16]冯宇旭,李裕梅.基于LSTM神经网络的沪深300指数预测模型研究[J].数学的实践与认识,2019,49(07):308-315. [22]王冰玉;刘勇军. 基于信噪比的KPCA-SVM-KNN算法的股价预测研究 [J]. 计算机与数字工程, 2022, 50 (04): 685-690. [24]张倩. 基于降维和机器学习模型的沪深300指数预测[D].江西:江西财经大学,2023.DOI:10.27175/d.cnki.gjxcu.2022.001769. [29]孙秋韵,刘金清,刘引等.基于改进GA参数优化的SVR股价预测模型[J].计算机系统应用,2015,24(09):29-34. [33]陈晓姨.基于贝叶斯优化的CNN-LSTM股价预测模型[D].兰州:兰州大学,2023.DOI:10.27204/d.cnki.glzhu.2023.002202. [35]杨青沙. 基于粒子群优化算法和LSSVM模型的股价预测研究[D].厦门:厦门大学,2020.DOI:10.27424/d.cnki.gxmdu.2019.000130. [43]Breiman L. Random forests[J]. Machine learning, 2001, 45: 5-32. [44]Cortes C, Vapnik V. Support-vector networks[J]. Machine learning, 1995, 20: 273-297. [48]Holland J H. Genetic algorithms[J]. Scientific american, 1992, 267(1): 66-73. [52]华罗庚, 王元. 数论在近似分析中的应用 [ M ]. 北京科学出版社, 1978. [53]李涵,李文敬.混合策略改进的金枪鱼群优化算法[J].广西科学,2023,30(01):208-218.DOI:10.13656/j.cnki.gxkx.20230308.022. [62]李程,张恒玮.经济政策不确定性、国债利率期限结构与股市收益率[J].金融市场研究,2024(03):98-111.DOI:10.20134/j.cnki.fmr.2024.03.005. |
中图分类号: | F832.5 |
开放日期: | 2024-06-11 |