题名: | 煤矿井下移动机器人视觉SLAM关键技术研究 |
作者: | |
学号: | 21210061039 |
保密级别: | 保密(4年后开放) |
语种: | chi |
学科代码: | 081601 |
学科: | 工学 - 测绘科学与技术 - 大地测量学与测量工程 |
学生类型: | 硕士 |
学位: | 工学硕士 |
学位年度: | 2024 |
学校: | 西安科技大学 |
院系: | |
专业: | |
研究方向: | 视觉SLAM |
导师姓名: | |
导师单位: | |
提交日期: | 2024-06-13 |
答辩日期: | 2024-06-03 |
外文题名: | Research on the Key Technologies of Visual SLAM for Underground Mobile Robots in Coal Mines |
关键词: | |
外文关键词: | Intelligent Underground Coal Mine ; Mobile Robot ; Visual SLAM ; Keyframe Selection ; Dense Reconstruction |
摘要: |
近年来,国家能源安全战略持续升级,智能化技术与煤炭企业深度融合,煤矿智能化成为煤炭企业高质量发展的技术支撑和迫切需求。移动机器人是实现煤矿智能化和无人化的关键,对提升煤矿企业安全性、减轻矿工劳动负担和实现高效管理具有划时代意义。视觉同时定位与建图(Visual Simultaneous Localization and Mapping,VSLAM)凭借高帧率、低成本、色彩信息丰富以及在GNSS信号拒止环境下的出色表现,成为煤矿井下移动机器人实现精确定位与导航的关键技术。然而,面对煤矿井下空间狭小、能见度低、纹理信息稀缺的恶劣环境,现有视觉SLAM难以达到理想的定位精度和建图效果。为此,本文聚焦于提升煤矿井下移动机器人视觉SLAM精度和鲁棒性,研究煤矿井下图像增强算法、关键帧选取方法以及三维稠密建图,旨在推动煤矿企业向智能化和无人化方向发展。论文主要研究内容总结如下: (1)煤矿井下低照度使得双目立体匹配面临着极大的挑战,因此,本文在双目立体匹配之前需要对图像质量进行增强。本文提出基于双边滤波的Retinex图像增强算法,首先,将输入的RGB(Red,Green,Blue)图像转换至HSI(Hue,Saturation,Intensity)色彩空间,以便更好地处理图像的色相、饱和度和亮度信息。其次,采用双边滤波函数替代传统Retinex算法中的高斯核函数,估计更加精准的反射分量。最后,经过处理后的图像再从HSI色彩空间转换到RGB色彩空间,得到对比度显著提升且亮度分布均匀的增强图像。通过实验分析,相较于Retinex算法和多尺度Retinex算法,本文算法处理后的图像未出现过亮及光晕现象,图像质量得到了明显提升。 (2)针对现有依赖启发式阈值进行关键帧选取方法无法满足煤矿井下视觉SLAM的有效性和定位精度,本文提出多重约束的视觉SLAM关键帧选取方法。采用几何结构约束的自适应阈值取代静态启发式阈值进行关键帧选取,实现了关键帧选取方法的有效性;通过重心平衡原则对有效特征点分布进行均匀化处理,确保关键帧选取方法的稳定性以及创建地图点的稠密性;利用航向角阈值对转向处做进一步约束,降低视角突变对关键帧选取精度的影响。为了验证该方法的效果,自主设计集成了移动机器人数据采集平台,在室内场景和煤矿井下分别进行实验,并比于启发式视觉SLAM关键帧选取方法,本文提出的方法在室内场景和煤矿井下的RMSE分别提升了32%和37%,具有较高的定位精度和鲁棒性。 (3)视觉SLAM构建的稀疏点云地图未能完整表达煤矿井下场景信息,如井下地形、障碍物、设备布局等,无法为煤矿井下导航、路径规划、避障等任务提供有力支持。稠密点云地图通过构建更多的环境细节,能够提供丰富的场景信息,为煤矿井下的各类应用提供数据支撑。为此,本文基于ORB-SLAM3框架,提出了煤矿井下关键帧图像快速稠密建图方法。采用ELAS算法估计关键帧视觉点云,通过估计的位姿实现局部点云数据融合,全局光束法平差(Bundle Adjustment,BA)对整体点云进行优化。煤矿井下实验表明:所构建的点云地图具有较好的全局一致性和几何结构真实性,验证了本文方法的有效性。 |
外文摘要: |
In recent years, the implementation of the national energy security strategy has been upgraded, the intelligent technology has been deeply integrated with the coal enterprises, and coal mine intelligence has become the technical support and urgent demand for the high-quality development of coal enterprises. The mobile robot is the key to realize coal mine intelligence and unmanned operations, which is of epoch-making significance in enhancing the safety of coal mining enterprises, reducing the labor burden of miners and realizing efficient management. Visual Simultaneous Localization and Mapping (VSLAM), with its high frame rate, low cost, rich color information and excellent performance in GNSS signal rejection environments, has become the key technology to achieve accurate positioning and navigation of mobile robots in underground coal mines. However, in the context of the harsh environment of narrow spaces, low visibility and scarce texture information in underground coal mines, the existing visual SLAM is unable to achieve the ideal positioning accuracy and map building effect. This paper therefore focuses on improving the accuracy and robustness of VSLAM for mobile robots in underground coal mines. It also examines image enhancement algorithms, key frame selection methods, and 3D dense map building in underground coal mines. The aim is to promote the development of coal mining enterprises in the direction of intelligence and unmanned development. The main research content of the thesis is summarized as follows: (1) The existing key frame selection method, which relies on heuristic thresholds, is unable to achieve the effectiveness and positioning accuracy required for VSLAM in underground coal mines. This paper proposes a multiple-constraint VSLAM key frame selection method to enable real-time robust position estimation for mobile robots in underground coal mines. Adaptive thresholds based on geometric structure constraints are employed to supplant static heuristic thresholds for key frame selection, thereby achieving the efficacy and resilience of key frame selection method. The distribution of effective feature points is homogenized by the principle of center of gravity balance to guarantee stability. Furthermore, heading angle thresholds are employed to further constrain the steering position and reduce the impact of sudden changes in perspective on key frame selection and the accuracy of key frames. The impact of sudden changes in view angle on the accuracy of key frame selection is reduced. The experiments are conducted in indoor scenarios and underground coal mines using the autonomous integrated mobile robot data acquisition platform. Quantitative and qualitative evaluations are conducted in terms of Absolute Pose Error and Root Mean Square Error. The experimental results demonstrate that the proposed method outperforms the heuristic visual SLAM keyframe selection method in terms of RMSE, with improvements of 32% and 37% observed in indoor and underground coal mine scenes, respectively. Furthermore, the method exhibits high localization accuracy and robustness. (2) This paper proposes a multiple-constraint VSLAM key frame selection method as an alternative to the existing key frame selection method relying on heuristic thresholds, which has been demonstrated to be ineffective and inaccurate in the context of visual SLAM in underground coal mines. Adaptive thresholds with geometric structure constraints are employed to supplant static heuristic thresholds for key frame selection, thereby achieving the efficacy of key frame selection. The distribution of effective feature points is homogenized by the principle of center of gravity balance, which ensures the stability of the method. The key frame selection method is also constrained by the density of the created map points, and further constraints are imposed on the steering place by using heading angle thresholds, which reduces the effect of sudden changes in perspective on the accuracy of key frame selection. In order to verify the effectiveness of the method, a mobile robot data acquisition platform is designed and integrated independently, and experiments are carried out in indoor scenes and coal mine underground, respectively. These are then compared with the heuristic VSLAM key frame selection method. The results demonstrate that the method proposed in this paper improves the RMSE in indoor scenes and coal mine underground by 32% and 37%, respectively. Furthermore, it has high positioning accuracy and robustness. (3) The sparse point cloud map constructed by VSLAM is inadequate for fully representing the underground coal mine scene information, including underground terrain, obstacles, and equipment layout. Consequently, it is unable to provide sufficient support for navigation, path planning, and obstacle avoidance in the underground coal mine. By incorporating more environmental details, the dense point cloud map can provide richer scene information and serve as a valuable data source for a range of applications in coal mine underground environments. This paper proposes a fast dense map building method for key frame images of coal mine underground based on the ORB-SLAM3 framework. The ELAS algorithm is used to estimate the visual point cloud of key frames, the local point cloud data fusion is achieved by the estimated bitmap, and the global beam method levelling Bundle Adjustment (BA) is used to optimize the overall point cloud. Experiments conducted in a coal mine underground have demonstrated that the constructed point cloud map exhibits satisfactory global consistency and geometric structure realism, thereby corroborating the efficacy of the methodology proposed in this paper. |
参考文献: |
[1] 葛世荣, 樊静丽, 刘淑琴等. 低碳化现代煤基能源技术体系及开发战略[J/OL]. 煤炭学报:1-26[2024-02-28]. [2] 葛世荣, 胡而已, 李允旺. 煤矿机器人技术新进展及新方向[J]. 煤炭学报, 2023, 48(01): 54-73. [3] 陈国良, 时洪涛, 汪云甲等. 矿山地质环境“天—空—地—人”协同监测与多要素智能感知[J]. 金属矿山, 2023 (01): 9-16. [4] 王国法, 庞义辉, 许永祥等. 厚煤层智能绿色高效开采技术与装备研发进展[J]. 采矿与安全工程学报, 2023, 40(05): 882-893. [5] 贺飞, 鲁义强, 代恩虎等. 煤矿岩巷TBM适应性与新技术发展[J]. 煤炭科学技术, 2023, 51(S1): 351-361. [6] 王国法. 煤矿智能化最新技术进展与问题探讨[J]. 煤炭科学技术, 2022, 50(01):1-27. [8] 马宏伟, 孙思雅, 王川伟等. 多机械臂多钻机协作的煤矿巷道钻锚机器人关键技术[J]. 煤炭学报, 2023, 48(01): 497-509. [9] 杨春雨, 张鑫. 煤矿机器人环境感知与路径规划关键技术[J]. 煤炭学报, 2022, 47(07): 2844-2872. [11] 张旭辉, 杨文娟, 薛旭升等. 煤矿远程智能掘进面临的挑战与研究进展[J]. 煤炭学报, 2022, 47(01): 579-597. [12] 杨必胜, 陈一平, 邹勤. 从大模型看测绘时空信息智能处理的机遇和挑战[J]. 武汉大学学报(信息科学版), 2023, 48(11): 1756-1768. [13] 廖志伟, 杨真, 贺晓峰等. 煤矿井下机器人研发应用现状及发展趋势研究[J]. 中国煤炭, 2023, 49(S2): 13-23. [18] 孙海波, 童紫原, 唐守锋等. 基于卡尔曼滤波与粒子滤波的SLAM研究综述[J]. 软件导刊, 2018, 17(12): 1-3+7. [19] 邸凯昌, 万文辉, 赵红颖等. 视觉SLAM技术的进展与应用[J]. 测绘学报, 2018, 47(06): 770-779. [20] 高翔, 张涛, 刘毅, 等. 视觉SLAM十四讲:从理论到实践[M]. 北京:电子工业出版社, 2019:153-154, 239-242, 365-366. [53] Engel J, Koltun V, Cremers D. Direct Sparse Odometry[J]. arXiv e-prints, 2016. [62] 王云峰, 吴炜, 余小亮等. 基于自适应权重AD-Census变换的双目立体匹配[J]. 工程科学与技术, 2018, 50(04): 153-160. [65] 粟序明, 方成刚, 洪荣晶等. 基于机器视觉的轴类零件定位与测量系统[J]. 机械设计与制造, 2020, (07): 250-254. [72] Nocedal J, Wright S J. Quadratic programming[J]. Numerical optimization, 2006: 448-492. |
中图分类号: | TD676 |
开放日期: | 2028-06-13 |