基于动态语义特征的视觉 SLAM 系统

吉林大学学报(信息科学版) ›› 2023, Vol. 41 ›› Issue (6): 1041-1047.

基于动态语义特征的视觉 SLAM 系统

任伟建^1a,1b, 张志强^1a, 康朝海^1a,1b , 霍凤财^1a,1b, 孙勤江², 陈建玲²

1. 东北石油大学 a. 电气信息工程学院; b. 黑龙江省网络化与智能控制重点实验室, 黑龙江大庆 163318; 2. 中海石油(中国)有限公司天津分公司, 天津 300450

收稿日期:2023-03-17 出版日期:2023-11-30 发布日期:2023-12-01
通讯作者: 康朝海(1976— ), 男, 黑龙江望奎人, 东北石油大学副教授, 硕士生导师, 主要从事智能算法与智能控制研究, (Tel)86-15603690883 E-mail:kangchaohai@ 126. com
作者简介:任伟建(1963— ), 女, 黑龙江泰来人, 东北石油大学教授, 博士生导师, 主要从事油气集输过程故障诊断研究, (Tel) 86-15765988699(E-mail)1064619284@ qq. com
基金资助:
国家自然科学基金资助项目(61933007;61873058)

Visual SLAM System Based on Dynamic Semantic Features

REN Weijian ^1a,1b , ZHANG Zhiqiang ^1a , KANG Chaohai ^1a,1b , HUO Fengcai ^1a,1b , SUN Qinjiang² , CHEN Jianling²

1a. Department of Electrical and Information Engineering; 1b. Heilongjiang Provincial Key Laboratory of Networking and Intelligent Control, Northeast Petroleum University, Daqing 163318, China; 2. Tianjin Branch, China National Offshore Oil Corporation, Tianjin 300450, China

Received:2023-03-17 Online:2023-11-30 Published:2023-12-01

摘要/Abstract

摘要： 针对视觉 SLAM( Simultaneous Localization and Mapping) 在真实场景下出现动态物体( 如行人, 车辆、动物)等影响算法定位和建图精确性的问题, 基于 ORB-SLAM3(Oriented FAST and Rotated BRIEF-Simultaneous Localization and Mapping 3)提出了 YOLOv3-ORB-SLAM3 算法。该算法在 ORB-SLAM3 的基础上增加了语义线程, 采用动态和静态场景特征提取双线程机制: 语义线程使用 YOLOv3 对场景中动态物体进行语义识别目标检测, 同时对提取的动态区域特征点进行离群点剔除; 跟踪线程通过 ORB 特征提取场景区域特征, 结合语义信息获得静态场景特征送入后端, 从而消除动态场景对系统的干扰, 提升视觉 SLAM 算法定位精度。利用 TUM (Technical University of Munich)数据集验证, 结果表明 YOLOv3-ORB-SLAM3 算法在单目模式下动态序列相比 ORB-SLAM3 算法 ATE(Average Treatment Effect)指标下降30% 左右, RGB-D(Red, Green and Blue-Depth)模式下动态序列 ATE 指标下降 10% , 静态序列未有明显下降。

关键词: 目标检测, ORB-SLAM3 算法, 动态场景, 单目相机, 深度相机

Abstract: Aiming at the problems that dynamic objects (such as pedestrins, vehicles, animals) appear in visual SLAM(Simultaneous Localization and Mapping) in real scenes, affect the accuracy of algorithm positioning and mapping, the YOLOv3-ORB-SLAM3(Oriented FAST and Rotated BRIEF-Simultaneous Localization and Mapping 3) algorithm is proposed based on ORB-SLAM3. The algorithm adds a semantic thread on the basis of ORB- SLAM3, and the thread uses YOLOv3 to perform semantic recognition target detection on dynamic objects in the scene. The outliers are removed from the extracted feature points on the tracking thread, and the static environment area extracted by the ORB feature, thereby the positioning accuracy of the visual SLAM algorithm is improved. The TUM(Technical University of Munich) data set is used to verify the positioning accuracy of the algorithm in monocular and RGB-D(Red, Green and Blue-Depth) modes. The verification results show that the dynamic sequence of the YOLOv3-ORB-SLAM3 algorithm in monocular mode is about 30% lower than that of the ORB-SLAM3 algorithm in RGB-D mode, the dynamic sequence decreases by 10% , and the static sequence does not decrease significantly.

Key words: you only look once v3(YOLOv3), oriented FAST and rotated BRIEF-simultaneous localization and mapping 3(ORB-SLAM3), dynamic scene, monocular camera, red, green and blue-depth(RGB-D)

中图分类号:

TP391

任伟建, 张志强, 康朝海, 霍凤财, 孙勤江, 陈建玲. 基于动态语义特征的视觉 SLAM 系统[J]. 吉林大学学报(信息科学版), 2023, 41(6): 1041-1047.

REN Weijian , ZHANG Zhiqiang , KANG Chaohai , HUO Fengcai , SUN Qinjiang , CHEN Jianling . Visual SLAM System Based on Dynamic Semantic Features [J]. Journal of Jilin University (Information Science Edition), 2023, 41(6): 1041-1047.

[1]	欧阳继红 , 曹竞月 , 王腾 . Copula 层次化变分推理[J]. 吉林大学学报(信息科学版), 2024, 42(1): 51-58.
[2]	李婉莹 , 刘学艳 , 杨博. 隐私保护的图像替代数据生成方法[J]. 吉林大学学报(信息科学版), 2024, 42(1): 59-66.
[3]	安志伟 , 刘玉敏 , 袁硕 , 魏海军 . 基于 UNet++卷积神经网络的断层识别 [J]. 吉林大学学报(信息科学版), 2024, 42(1): 100-110.
[4]	籍风磊, 陈少琦, 梁楠, 迟学芬, 李志军. 基于手机相机的可见光成像通信实验系统设计[J]. 吉林大学学报(信息科学版), 2023, 41(6): 1023-1029.
[5]	苏雯, 徐鑫林, 胡宇超, 黄博涵, 周佩廷. 面向垃圾图像分类的残差语义强化网络 [J]. 吉林大学学报(信息科学版), 2023, 41(6): 1030-1040.
[6]	陈雪松, 邹梦. 基于 BERT-BiGRU-CNN 模型的短文本分类研究 [J]. 吉林大学学报(信息科学版), 2023, 41(6): 1048-1053.
[7]	沈晨, 张培珍, 刘欢, 唐杰平, 高守勇, 王振鹏. 基于 VMD-Hilbert 变换的大型网箱养殖鱼群声特性研究 [J]. 吉林大学学报(信息科学版), 2023, 41(6): 1054-1062.
[8]	吴薇, 阮星, 蔡闯华, 刘长勇, 刘彦秀, 王宜怀. 资源受限 MCU 的轻量化部署策略和实现[J]. 吉林大学学报(信息科学版), 2023, 41(6): 1063-1071.
[9]	吴淑娟, 张铭. 基于 CycleGAN 图像增强的输送皮带洒料检测技术 [J]. 吉林大学学报(信息科学版), 2023, 41(6): 1072-1078.
[10]	魏亚明, 孟媛. 基于随机森林模型的不平衡大数据分类算法 [J]. 吉林大学学报(信息科学版), 2023, 41(6): 1079-1085.
[11]	刘樱琪, 宋杨, 李梓木, 罗维, 黄新睿, 王昊丰. 基于深度学习的心电信号分析检测系统 [J]. 吉林大学学报(信息科学版), 2023, 41(6): 1135-1142.
[12]	李王波, 范昕桐, 顾玲嘉. 基于星载被动微波的中国东北森林雪深反演[J]. 吉林大学学报(信息科学版), 2023, 41(5): 914-921.
[13]	陈雪松, 詹子依, 王浩畅. 融合 SikuBERT 模型与 MHA 的古汉语命名实体识别[J]. 吉林大学学报(信息科学版), 2023, 41(5): 866-875.
[14]	梁楠, 王成喜, 张春飞, 徐涛, 籍风磊. 基于 Python 的多维度、层次化的综合实验平台[J]. 吉林大学学报(信息科学版), 2023, 41(5): 858-865.
[15]	卞冰阳, 孙圣博, 佟伟华, 滕岩, 肖莉莉, 孙野, 王烁, 苗政, 纪铁凤, 张磊. 基于影像三维可视化技术的解剖教学模式研究[J]. 吉林大学学报(信息科学版), 2023, 41(5): 885-893.