Journal of Jilin University (Information Science Edition) ›› 2025, Vol. 43 ›› Issue (5): 1006-1113.

Previous Articles     Next Articles

Semantic SLAM System Based on Improved YOLACT++ 

REN Weijian1, SHEN Wenxu1, REN Lu2, ZHANG Yongfeng3    

  1. 1. School of Electrical and Information Engineering, Northeast Petroleum University, Daqing 163318, China; 2. Offshore Oil Engineering Company Limited, Tianjin 300450, China; 3. Oil Production Plant, Daqing Oilfield Company Limited, Daqing 163414, China
  • Received:2024-05-09 Online:2025-09-28 Published:2025-11-19

Abstract: SLAM(Simultaneous Localization and Mapping) technology is a camera pose estimation based on static scene features. It is susceptible to dynamic objects in the process of feature calculation and matching at its front end. Therefore a method of instance segmentation combined with multi-view geometric constraints is proposed to improve the front-end feature processing of visual SLAM and eliminate the interference of dynamic information. Specifically, in the front end of the ORB-SLAM3(Oriented FAST and Rotated BRIEF-Simultaneous Localization and Mapping3) framework, the YOLACT + + (You Only Look At CoefficienTs + +) instance segmentation thread is paralleled, and the segmented results are used to supplement the multi-view geometric constraint method testing the dynamic consistency of feature points. The EfficientNetV2 network is used to replace the original backbone network of YOLACT + +, and the TensorRT is used to quantify the instance segmentation model to reduce the front-end computing pressure of the algorithm. The test of TUM data set shows that the positioning accuracy of the proposed algorithm in high dynamic environment is 80. 6% higher than that of ORB-SLAM3 algorithm. 

Key words: semantic simultaneous localization and mapping(SLAM), YOLACT++ segmentation algorithm, multi-view geometric constraints, dynamic scene

CLC Number: 

  • TP391