论文中文题名: | 关中地区空气质量多模式数值预报优化方法研究 |
姓名: | |
学号: | 18210063040 |
保密级别: | 公开 |
论文语种: | chi |
学科代码: | 081602 |
学科名称: | 工学 - 测绘科学与技术 - 摄影测量与遥感 |
学生类型: | 硕士 |
学位级别: | 工学硕士 |
学位年度: | 2021 |
培养单位: | 西安科技大学 |
院系: | |
专业: | |
研究方向: | 大气污染数值模拟 |
第一导师姓名: | |
第一导师单位: | |
第二导师姓名: | |
论文提交日期: | 2021-06-16 |
论文答辩日期: | 2021-06-01 |
论文外文题名: | Optimization Method of Multi-model Forecast system of Air Quality in Guanzhong area |
论文中文关键词: | |
论文外文关键词: | Multimodal ; Air quality model ; Machine learning ; Regional prediction optimization |
论文中文摘要: |
关中地区地处渭河盆地,近年来社会经济快速发展导致大气问题突出,因此进行空气质量预报预警工作以获取准确及时的大气污染信息也显得尤为重要。相比于传统的统计预报,区域空气质量模式以大气动力学为基础,考虑多种物理和化学过程,进而定量描述污染物的扩散和输送规律,被广泛应用于空气质量预报以及污染过程分析中。但由于现阶段的排放清单具有时间滞后性与低空间分辨率性,且模式本身化学机理以及气象模式也存在不确定性等问题,导致空气质量模式的模拟结果与实际观测值之间存在误差。为了提高关中地区污染物浓度预测精度,本研究针对区域空气质量模型现存问题,结合多模式数值预报与机器学习算法,开展关中地区区域空气质量模式优化与订正方法研究。本文主要研究内容总结如下: (1)基于大气化学数值模式建立关中地区空气质量多模式预报预警系统。利用WRF、CMAQ及CAMx模式进行关中地区气象数值模拟以及空气质量预报及再分析工作,获取关中地区的数值模拟再分析数据以及WRF模式气象数据,并对WRF、CMAQ及CAMx模式进行相关精度验证。结果表明,WRF模式能准确模拟区域气象因子的时空演变,对温度以及海平面气压的模拟精度较高,相关性系数分别为0.94、0.91;对风速及相对湿度的模拟精度稍低。CAMx与CMAQ模式模拟结果与观测值时序变化趋势一致,模拟结果存在PM2.5高估、O3低估现象,其中CAMx模拟精度较CMAQ低。 (2)利用机器学习算法实现单站点模拟结果优化以及优化评价。由于目前大多数研究集中在单个机器学习模型对预报值进行优化,而对多种机器学习模型优化结果的对比以及模型对不同污染物优化效果评估的研究相对较少。因此基于空气质量模式模拟结果,本文提出利用多种机器学习算法对数值模拟结果进行订正与优化,选取温度、相对湿度、边界层高度、海平面气压以及风速等气象因子,验证多种机器学习算法对于关中五市的数值模拟优化结果,分析不同算法对于单站数值模拟的优化性能。结果显示,随机森林算法对PM2.5优化精度最高,优化后模拟值与观测值之间的相关性系数为0.74~0.8;而支持向量机算法对O3优化结果最好,优化后相关性系数提高至0.79~0.88。 (3)建立区域优化网络,实现关中地区区域模拟结果优化并进行验证。由于当前机器学习算法研究主要集中在城市站点优化方面,而利用算法结合集合预报实现数值模型的区域优化较少。为实现数值模式的区域优化,本文在单站优化的基础上,加入关中五市33个国控监测站点数据、DEM及土地利用数据,搭建关中地区区域优化网络,验证对比XGBoost算法与LSTM深度学习神经网络算法的优化性能。研究结果显示,XGBoost 算法在区域优化方面优于LSTM算法,XGBoost与LSTM算法优化后的PM2.5模拟值与观测值相关性系数分别为0.93~0.99、0.73~0.82,O3模拟值与观测值相关性系数分别0.85~0.97、0.7~0.84。在区域验证方面,算法能明显改善模式PM2.5高估以及O3低估的现象,并为区域提供较为准确的污染物浓度值。 (4)利用XGBoost算法实现关中地区未来七天的区域数值模拟优化预报。基于算法区域优化的可行性,本文利用算法实现对区域预报数据的订正优化。结果表明,关中地区PM2.5预报优化结果的RMSE值约为14.2~22.3μg·m-3,O3预报优化结果的RMSE值约为5.5~10.6μg·m-3,同时PM2.5 及O3的浓度演变过程与单站观测结果时序变化结果一致。因此,通过算法优化,可以提高对未来空气质量预报的准确度,对于没有空气质量监测站点的地区也可以实现大气污染物的准确预报,进而为污染管控及防治提供参考。 |
论文外文摘要: |
Guanzhong area is located in the Weihe Basin. In recent years, the rapid development of social economy has led to prominent atmospheric problems. Therefore, it is particularly important to carry out air quality forecast and early warning to obtain accurate and timely air pollution information. Compared with the traditional statistical forecast, the regional air quality model is based on atmospheric dynamics, considering a variety of physical and chemical processes, and then quantitatively describes the diffusion and transport laws of pollutants, which is widely used in air quality forecast and pollution process analysis. However, due to the time lag and low spatial resolution of the emission inventory at the present stage, as well as the uncertainties of the chemical mechanism of the model and the meteorological model, there are errors between the simulated results of the air quality model and the actual observed values. In order to improve the prediction accuracy of pollutant concentration in Guanzhong region, this study aimed at the existing problems of regional air quality model in Guanzhong region, combined with multi-model numerical prediction and machine learning algorithm, carried out optimization and correction method research of regional air quality model. The main research contents of this paper are summarized as follows: (1) A multi-model air quality forecast and early warning system in Guanzhong region was established based on atmospheric chemistry numerical model. In this paper, WRF, CMAQ and CAMX models were used to carry out meteorological numerical simulation and air quality forecast and reanalysis in Guanzhong region. The numerical simulation reanalysis data and WRF model meteorological data in Guanzhong region were obtained, and the related accuracy of WRF, CMAQ and CAMX models was verified. The results show that the WRF model can accurately simulate the spatiotemporal evolution of regional meteorological factors, and the simulation accuracy of temperature and sea level pressure were relatively high, with correlation coefficients of 0.94 and 0.91, respectively. The simulation accuracy of wind speed and relative humidity were lower. The simulation results of CAMX and CMAQ modes were consistent with the observation values in time series, and PM2.5 overestimation and O3 underestimation existed in the simulation results. The simulation accuracy of CAMX was lower than that of CMAQ. (2) The machine learning algorithm was used to optimize the simulation results of single site and optimize the evaluation. At present, most of the researches focused on the optimization of forecast values by a single machine learning model, while there were relatively few researches on the comparison of optimization results of multiple machine learning models and the evaluation of optimization effects of different pollutants by models. Therefore, based on the air quality model simulation results, this paper proposed to use a variety of machine learning algorithms to correct and optimize the numerical simulation results. Meteorological factors such as temperature, relative humidity, boundary layer height, sea level pressure and wind speed were selected to verify the numerical simulation optimization results of a variety of machine learning algorithms for the five cities in Guanzhong, and to analyze the optimization performance of different algorithms for the numerical simulation of a single station. The results showed that the random forest algorithm had the highest optimization accuracy for PM2.5, and the correlation coefficient between the simulated value and the observed value after optimization was 0.74~0.8. The support vector machine algorithm had the best result for O3 optimization, and the correlation coefficient was improved to 0.79~0.88 after optimization. (3) The regional optimization network was established to optimize and verify the simulation results in Guanzhong region. At present, the research of machine learning algorithms mainly focused on the optimization of urban sites, while the regional optimization of numerical models based on algorithm combined with ensemble prediction was less. In order to realize the regional optimization of numerical model, based on the single station optimization, this paper added the data of 33 state-controlled monitoring stations, DEM and land use data in five cities of Guanzhong to build a regional optimization network, and verified and compared the optimization performance of XGBoost algorithm and LSTM deep learning neural network algorithm. The results showed that the XGBoost algorithm was superior to the LSTM algorithm in terms of regional optimization. The correlation coefficients between the simulated and observed PM2.5 values optimized by XGBoost and LSTM algorithms were 0.93-0.99 and 0.73-0.82, respectively. The correlation coefficients between the simulated and observed values of O3 were 0.85~0.97 and 0.70 ~0.84, respectively. In terms of regional verification, the algorithm could significantly improve the phenomenon of overestimation of PM2.5 and underestimation of O3, providing more accurate pollutant concentration value for the region. (4) The XGBoost algorithm was used to achieve the regional numerical simulation and optimization forecast of Guanzhong region in the next seven days. Based on the feasibility of regional optimization algorithm, this paper used the algorithm to revise and optimize the regional forecast data. The results showed that the RMSE value of PM2.5 prediction and optimization results in Guanzhong area was about 14.2~22.3μg·m-3, and the RMSE value of O3 prediction and optimization results was about 5.5~10.6μg·m-3. Meanwhile, the evolution process of PM2.5 and O3 concentration were consistent with the time series variation results of single station observation results. Therefore, the algorithm optimization could improve the accuracy of air quality forecast in the future, and could also achieve accurate forecast of air pollutants in areas without air quality monitoring stations, thus providing reference for pollution control and prevention. |
参考文献: |
[10] 张小曳,孙俊英,王亚强,等. 我国雾-霾成因及其治理的思考[J]. 科学通报, 2013, 58(13): 1178-1187. [11] 生态环境部. 2019 年中国生态环境状况公报[R]. 北京: 生态环境部, 2020. [12] 陕西省统计局. 陕西省统计年鉴(2019年)[M]. 北京: 中国统计出版社, 2020. [13] 陕西省环保厅. 2019年陕西省生态环境状况公报. 2020. [25] 万显烈,杨凤林,王慧卿.利用人工神经网络对空气中O3浓度进行预测[J].中国环境科学,2003(01):111-113. [26] 周杉杉,李文静,乔俊飞.基于MRMR-PSO的递归模糊神经网络PM2.5预测模型[J].计算机与应用化学,2018,35(10):783-791. [27] 康俊锋,黄烈星,张春艳,等.多机器学习模型下逐小时PM2.5预测及对比分析[J].中国环境科学,2020,40(05):1895-1905. [28] 曲悦,钱旭,宋洪庆,等.基于机器学习的北京市PM2.5浓度预测模型及模拟分析[J].工程科学学报,2019,41(03):401-407. [41] 薛文博,王金南,杨金田,等.国内外空气质量模型研究进展[J].环境与可持续发展,2013,38(03):14-20. [44] 胡翠娟,丁峰,李时蓓,等.国内外环境空气质量模型法规化现状与对比研究[J].环境工程,2015,33(01):132-136. [45] 王淑芳,陈家宜,刘式达.北京城近郊区大气污染评价的气质模式[J]. 中国环境科学, 1981, 1(1):0-0. [46] 王自发,谢付莹,王喜全,等.嵌套网格空气质量预报模式系统的发展与应用[J].大气科学,2006(05):778-790. [47] 王自发,吴其重,晏平仲,等.北京空气质量多模式集成预报系统的建立及 初步应用[J]. 南京信息工程大学学报: 自然科学版, 2009, 1(1): 19-26. [48] 黄思,唐晓,徐文帅,等.利用多模式集合和多元 线性回归改进北京PM10预报[J].环境科学学报,2015,35(01):56-64. [49] 程兴宏,刁志刚,胡江凯,等.基于CMAQ模式和自适应偏最小二乘回归法的中国地区PM2.5浓度动力-统计预报方法研究[J].环境科学学报,2016,36(8):2771-2782. [50] 赵俊日,肖昕,吴涛,等.空气质量数值预报优化方法研究[J].中国环境科学,2018,38(6):2047-2054. [51] 汤静,王春林,谭浩波,等.利用PCA-kNN方法改进广州市空气质量模式PM2.5预报[J].热带气象学报,2019,35(1):125-134. [52] 张伟,王自发,安俊岭,等.利用BP神经网络提高奥运会空气质量实时预报系统预报效果[J].气候与环境研究,2010,15(5):595-601. [53] 王茜,吴剑斌,林燕芬.CMAQ模式及其修正技术在上海市PM2.5预报中的应用检验[J].环境科学学报,2015,35(6):1651-1656. [54] 张岳军,张怀德,朱凌云,等.太原市PM2.5预报统计修正模型及其应用检验[J].环境科学研究,2018,31(7):1207-1213. [55] 潘锦秀,晏平仲,孙峰,等.多元线性回归方法对北京地区PM2.5预报的改进应用[J].中国环境监测,2019,35(2):43-52. [56] 张恒德,张庭玉,李涛,等.基于BP神经网络的污染物浓度多模式集成预报[J].中国环境科学,2018,38(4):1243-1256. [57] 戴李杰,张长江,马雷鸣.基于机器学习的PM2.5短期浓度动态预报模型[J].计算机应用,2017,37(11):3057-3063. [58] 马井会,曹钰,余钟奇,等.深度学习方法在上海市PM2.5 浓度预报中的应用[J].中国环境科学,2020,40(2):530-538. [59] 陕西省地方志编纂委员会. 陕西省志·地理志[M]. 西安: 陕西人民出版社, 2000. [60] 生态环境部.环境空气质量指数(AQI)技术规定(试行)(633-2012)[S]. 北京: 生态环境部, 2012. [63] 李颖若,汪君霞,韩婷婷,等.利用多元线性回归方法评估气象条件和控制措施对APEC期间北京空气质量的影响[J].环境科学,2019,40(03):1024-1034. [64] Breiman, L. Random Forests, Mach. Learn., 45, 5–32. [67] 张学工.关于统计学习理论与支持向量机 [J]. 自动化学报, 2000, 26(1):36-46. [77] 颜鹏,李维亮,秦瑜.近年来大气气溶胶模式研究综述[J].应用气象学报,2004(05):629-640. [79] 陈云波,徐峻,何友江,等.北京市冬季典型重污染时段PM2.5污染来源模式解析[J].环境科学研究,2016,29(05):627-636. [80] 何斌,梅士龙,陆琛莉,等.MEIC排放清单在空气质量模式中的应用研究[J].中国环境科学,2017,37(10):3658-3668. [82] 沈劲,王雪松,李金凤,等.Models-3/CMAQ和CAMx对珠江三角洲臭氧污染模拟的比较分析[J].中国科学:化学,2011,41(11):1750-1762. [94] 李娟,尉鹏,戴学之,等.基于机器学习方法的西安市数值模拟优化研究[J].环境科学研究.2021,34(4):34-43. [104] 杜金龙,朱记伟,解建仓,等.近25a关中地区土地利用及其景观格局变化[J].干旱区研究,2018,35(01):217-226. |
中图分类号: | X51 |
开放日期: | 2021-06-16 |