基于压缩表示的实例分割方法

吉林大学学报(理学版) ›› 2023, Vol. 61 ›› Issue (4): 883-889.

基于压缩表示的实例分割方法

李文举, 李文辉

吉林大学计算机科学与技术学院, 长春 130012

收稿日期:2022-03-07 出版日期:2023-07-26 发布日期:2023-07-26
通讯作者: 李文辉 E-mail:liwh@jlu.edu.cn

Instance Segmentation Method Based on Compressed Representation

LI Wenju, LI Wenhui

College of Computer Science and Technology, Jilin University, Changchun 130012, China

Received:2022-03-07 Online:2023-07-26 Published:2023-07-26

摘要/Abstract

摘要： 针对目前实例分割领域掩膜表示高复杂度的问题, 提出一种新的图像实例掩膜表征方法, 使用3个不依赖于任何先验信息的表征单元表示并预测掩膜, 且以非线性解码的形式复原掩膜, 该方法可显著降低图像实例掩膜的表示复杂度和推理运算量. 基于这种表示方法, 构建一个高效的单阶段实例分割模型, 实验结果表明, 相对于其他单阶段实例分割模型, 该模型在保证时间开销基本相同的情况下能获得更好的性能. 此外, 将该表征方法以最小改动嵌入经典模型BlendMask以重建注意力图, 改进的模型相对于原模型的推理速度更快, 掩膜平均精度提升1.5%, 表明该表征方法通用性较好.

关键词: 深度学习, 实例分割, 压缩表示, 表征单元

Abstract: Aiming at the problem of high complexity in mask representation in the field of instance segmentation, we proposed a new mask representation method for instance segmentation, which used three repsesentation units that did not rely on any prior information to represent and predict mask, and restored the mask in the form of nonlinear decoding. This method could significantly reduce the representation complexity and inference computation of image instance masks. Based on the representation method, we constructed an efficient single-shot instance segmentation model. The experimental results show that compared to other single-shot instance segmentation models, the model can achieve better performance while ensuring that the time cost is basically the same. Additionally, we embed the representation method with minimal modifications into the classic model BlendMask to reconstruct attention maps. The improved model has a faster inference speed compared to the original model, and the average accuracy of the mask is improved by 1.5%, indicating that the representation method has good universality.

Key words: deep learning, instance segmentation, compressed representation, representation unit

中图分类号:

TP391.4

李文举, 李文辉. 基于压缩表示的实例分割方法[J]. 吉林大学学报(理学版), 2023, 61(4): 883-889.

LI Wenju, LI Wenhui. Instance Segmentation Method Based on Compressed Representation[J]. Journal of Jilin University Science Edition, 2023, 61(4): 883-889.

参考文献

Metrics

Viewed

Full text

139

HTML			PDF

Just accepted	Online first	Issue	Just accepted	Online first	Issue
0	0	0	0	0	139

From	Others	local

Times	11	128
Rate	8%	92%

Abstract

591

Just accepted	Online first	Issue

0	0	591

	From	Others

	Times	591
	Rate	100%

Cited

Web of Science	Crossref	ScienceDirect	Search for Citations in Google Scholar >>


This page requires you have already subscribed to WoS.

Shared

[1]	姚博, 王卫卫. 基于异构融合和判别损失的图嵌入聚类[J]. 吉林大学学报(理学版), 2023, 61(4): 853-862.
[2]	李肃义, 张欣雨, 杨强, 张熠, 刁庶. 一种MCSEM数据噪声压制方法[J]. 吉林大学学报(理学版), 2023, 61(4): 929-936.
[3]	霍光, 林大为, 刘元宁, 朱晓冬, 袁梦. 基于轻量级卷积神经网络的小样本虹膜图像分割[J]. 吉林大学学报(理学版), 2023, 61(3): 583-591.
[4]	沈记全, 魏坤. 融合残差网络的CR-BiGRU入侵检测模型[J]. 吉林大学学报(理学版), 2023, 61(2): 353-361.
[5]	郭晴晴, 王卫卫. 基于类间损失和多视图融合的深度嵌入聚类[J]. 吉林大学学报(理学版), 2023, 61(1): 101-110.
[6]	欧阳继红, 王梓明, 刘思光. 改进多尺度特征的YOLO_v4目标检测方法[J]. 吉林大学学报(理学版), 2022, 60(6): 1349-1355.
[7]	徐博文, 卢奕南. 基于改进SOLO网络的城市道路场景实例分割方法[J]. 吉林大学学报(理学版), 2022, 60(6): 1356-1362.
[8]	常洪彬, 李文举, 李文辉. 基于注意力机制的航空图像旋转框目标检测[J]. 吉林大学学报(理学版), 2022, 60(6): 1363-1369.
[9]	汪慎文, 周瑶. 基于改进U-Net的肝脏MRI分割方法[J]. 吉林大学学报(理学版), 2022, 60(5): 1153-1160.
[10]	葛延良, 孙笑笑, 张乔, 王冬梅, 王肖肖. 基于循环生成对抗网络的人脸素描合成[J]. 吉林大学学报(理学版), 2022, 60(4): 897-905.
[11]	陈继伟, 汪海涛, 朱兴翔, 姜瑛, 陈星. 长期记忆增强的时间感知序列推荐算法[J]. 吉林大学学报(理学版), 2022, 60(4): 919-928.
[12]	李向宇, 李慧盈. 基于卷积神经网络的猪脸特征点检测方法[J]. 吉林大学学报(理学版), 2022, 60(3): 609-616.
[13]	李盼池, 石彤, 李学贵. 基于循环神经网络的微地震数据降噪方法[J]. 吉林大学学报(理学版), 2022, 60(3): 685-696.
[14]	刘磊, 王昊, 孙凯, 郜山权, 刘宣彤. 基于seq2seq模型的标签推荐方法[J]. 吉林大学学报(理学版), 2022, 60(2): 316-320.
[15]	刘高天, 段锦, 范祺, 吴杰, 赵言. 基于改进RFBNet算法的遥感图像目标检测[J]. 吉林大学学报(理学版), 2021, 59(5): 1188-1198.