吉林大学学报(信息科学版) ›› 2023, Vol. 41 ›› Issue (6): 1041-1047.

• • 上一篇    下一篇

基于动态语义特征的视觉 SLAM 系统

任伟建1a,1b , 张志强1a , 康朝海1a,1b , 霍凤财1a,1b , 孙勤江2 , 陈建玲2    

  1. 1. 东北石油大学 a. 电气信息工程学院; b. 黑龙江省网络化与智能控制重点实验室, 黑龙江 大庆 163318; 2. 中海石油(中国)有限公司 天津分公司, 天津 300450
  • 收稿日期:2023-03-17 出版日期:2023-11-30 发布日期:2023-12-01
  • 通讯作者: 康朝海(1976— ), 男, 黑龙江望奎人, 东北石油大学副教授, 硕士生导师, 主要从事智能算法与智能控制研究, (Tel)86-15603690883 E-mail:kangchaohai@ 126. com
  • 作者简介:任伟建(1963— ), 女, 黑龙江泰来人, 东北石油大学教授, 博士生导师, 主要从事油气集输过程故障诊断研究, (Tel) 86-15765988699(E-mail)1064619284@ qq. com
  • 基金资助:
    国家自然科学基金资助项目(61933007;61873058)

Visual SLAM System Based on Dynamic Semantic Features 

REN Weijian 1a,1b , ZHANG Zhiqiang 1a , KANG Chaohai 1a,1b , HUO Fengcai 1a,1b , SUN Qinjiang 2 , CHEN Jianling   

  1. 1a. Department of Electrical and Information Engineering; 1b. Heilongjiang Provincial Key Laboratory of Networking and Intelligent Control, Northeast Petroleum University, Daqing 163318, China; 2. Tianjin Branch, China National Offshore Oil Corporation, Tianjin 300450, China
  • Received:2023-03-17 Online:2023-11-30 Published:2023-12-01

摘要:  针对视觉 SLAM( Simultaneous Localization and Mapping) 在真实场景下出现动态物体( 如行人, 车辆、 动物)等影响算法定位和建图精确性的问题, 基于 ORB-SLAM3(Oriented FAST and Rotated BRIEF-Simultaneous Localization and Mapping 3)提出了 YOLOv3-ORB-SLAM3 算法。 该算法在 ORB-SLAM3 的基础上增加了语义线 程, 采用动态和静态场景特征提取双线程机制: 语义线程使用 YOLOv3 对场景中动态物体进行语义识别目标检 测, 同时对提取的动态区域特征点进行离群点剔除; 跟踪线程通过 ORB 特征提取场景区域特征, 结合语义信 息获得静态场景特征送入后端, 从而消除动态场景对系统的干扰, 提升视觉 SLAM 算法定位精度。 利用 TUM (Technical University of Munich)数据集验证, 结果表明 YOLOv3-ORB-SLAM3 算法在单目模式下动态序列相比 ORB-SLAM3 算法 ATE(Average Treatment Effect)指标下降30% 左右, RGB-D(Red, Green and Blue-Depth)模式下 动态序列 ATE 指标下降 10% , 静态序列未有明显下降。

关键词:  目标检测, ORB-SLAM3 算法, 动态场景, 单目相机, 深度相机 

Abstract: Aiming at the problems that dynamic objects (such as pedestrins, vehicles, animals) appear in visual SLAM(Simultaneous Localization and Mapping) in real scenes, affect the accuracy of algorithm positioning and mapping, the YOLOv3-ORB-SLAM3(Oriented FAST and Rotated BRIEF-Simultaneous Localization and Mapping 3) algorithm is proposed based on ORB-SLAM3. The algorithm adds a semantic thread on the basis of ORB- SLAM3, and the thread uses YOLOv3 to perform semantic recognition target detection on dynamic objects in the scene. The outliers are removed from the extracted feature points on the tracking thread, and the static environment area extracted by the ORB feature, thereby the positioning accuracy of the visual SLAM algorithm is improved. The TUM(Technical University of Munich) data set is used to verify the positioning accuracy of the algorithm in monocular and RGB-D(Red, Green and Blue-Depth) modes. The verification results show that the dynamic sequence of the YOLOv3-ORB-SLAM3 algorithm in monocular mode is about 30% lower than that of the ORB-SLAM3 algorithm in RGB-D mode, the dynamic sequence decreases by 10% , and the static sequence does not decrease significantly.

Key words: you only look once v3(YOLOv3), oriented FAST and rotated BRIEF-simultaneous localization and mapping 3(ORB-SLAM3), dynamic scene, monocular camera, red, green and blue-depth(RGB-D)

中图分类号: 

  • TP391