多维度视觉感知增强的煤矿机器人高精度三维高斯溅射SLAM方法

Multidimensional visual perception-enhanced high-precision 3DGS SLAM for underground coal mine robots

  • 摘要: 三维高斯溅射作为辐射场建模的前沿技术为煤矿井下数字化进程开辟了新路径,已成为煤矿机器人自主导航系统与数字孪生体系构建的关键支撑。然而,现有煤矿机器人SLAM(Simultaneous Localization and Mapping)方法在面对动态光照、深度感知缺失与复杂几何结构等因素时面临着挑战。鉴于此,提出一种多维度视觉感知增强的高精度三维高斯溅射(3D Gaussian Splatting, 3DGS)SLAM方法。首先,提出基于HIS(Hue-Intensity-Saturation)空间的多阶段图像增强校正算法,融合MSRCR(Multi-Scale Retinex with Color Restoration)、SWF(Side Window Filtering)和归一化伽马校正技术,通过非线性光照补偿与局部对比度优化实现RGB图像细节增强。构建彩色图−深度图特征引导的深度补全网络以填充深度数据空洞并抑制噪声干扰,提升原始数据质量。然后,设计混合欧几里德距离与重叠度结合的双阶段关键帧选择机制,通过空间差异性约束与视角覆盖冗余度控制,在保障场景表征完整性的前提下筛选最小必要关键帧集,实现精确高效关键帧选取。最后,设计透明度与时空加权观测频次双重约束的高斯管理策略,剔除低透明度区域的无效高斯椭球并对长期稳定观测的高斯椭球进行置信度强化,结合回环检测与位姿图优化实现全局一致的高质量建图。为验证所提方法有效性,依托自主设计集成的移动机器人平台,在采掘巷道等典型煤矿场景中进行了大量对比实验,结果表明:该方法在煤矿井下复杂场景中实现厘米级轨迹跟踪精度,其平均跟踪误差较当前最优基准方法降低12.4%;三维建图峰值信噪比(Peak Signal-to-noise Ratio,PSNR)达22.4 dB并保持较高实时性,显著优于主流算法,为构建数字孪生矿山提供了可靠技术支撑。

     

    Abstract: 3D Gaussian Splatting has emerged as a cutting-edge technology in radiation field modeling, offering a novel approach to the digitalization of underground coal mines. This advancement has become a pivotal component in developing autonomous navigation systems and digital twins for coal mine robots. However, existing SLAM methods for coal mine robots face challenges when facing factors such as dynamic illumination, lack of depth perception, and complex geometric structures. To address these issues, a high-precision 3D Gaussian Splatting SLAM method for multi-dimensional visual perception enhancement is proposed. First, a multi-stage image enhancement correction algorithm based on HIS space is proposed, which integrates MSRCR, SWF, and normalized gamma correction techniques, achieving RGB image detail enhancement through nonlinear illumination compensation and local contrast optimization. A color map-depth map feature-guided depth-completion network is constructed to fill depth data voids and suppress noise interference, thereby enhancing the quality of raw data. Then, a two-stage keyframe selection mechanism combining hybrid Euclidean distance with overlap ratio is designed to filter the minimum necessary keyframe set, achieving accurate and efficient keyframe selection under the premise of guaranteeing the integrity of the scene representation by using spatial disparity constraints and redundancy control of view coverage. Finally, a Gaussian management strategy with double constraints on transparency and spatiotemporally weighted observation frequency is designed to eliminate invalid Gaussian ellipsoids in low-transparency regions and strengthen the confidence of Gaussian ellipsoids observed stably over a long period, achieving globally consistent and high-quality mapping by combining loop closure detection and pose graph optimization. To verify the effectiveness of the proposed method, extensive comparative experiments are conducted in typical coal mine scenarios, such as mining roadways, relying on an independently designed and integrated mobile robot platform. The results show that the proposed method achieves centimeter-level trajectory tracking accuracy in complex underground coal mine scenarios, reducing the average tracking error by 12.4% compared to the current state-of-the-art baseline method. The Peak Signal-to-Noise Ratio of 3D mapping reaches 22.4 dB while maintaining high real-time performance, which is significantly better than mainstream algorithms, providing reliable technical support for the construction of digital twin mines.

     

/

返回文章
返回