吉林大学学报(信息科学版) ›› 2023, Vol. 41 ›› Issue (6): 1030-1040.

• • 上一篇    下一篇

面向垃圾图像分类的残差语义强化网络 

苏 雯, 徐鑫林, 胡宇超, 黄博涵, 周佩廷   

  1. 浙江理工大学 信息科学与工程学院, 杭州 310018
  • 收稿日期:2023-04-25 出版日期:2023-11-30 发布日期:2023-12-01
  • 作者简介:苏雯(1992— ), 女, 陕西汉中人, 浙江理工大学讲师, 博士, 主要从事语义分割、 单目深度估计和三维场景理解等研究, (Tel)86-13750876801(E-mail)wensu@ zstu. edu. cn
  • 基金资助:
    国家自然科学基金资助项目(62006209)

Network of Residual Semantic Enhancement for Garbage Image Classification

SU Wen, XU Xinlin, HU Yuchao, HUANG Bohan, ZHOU Peiting   

  1. School of Information Science and Engineering, Zhejiang Sci-Tech University, Hangzhou 310018, China
  • Received:2023-04-25 Online:2023-11-30 Published:2023-12-01

摘要: 为更好地保护生态环境并提高可回收垃圾的经济价值, 针对现有的垃圾识别方法面临的分类背景复杂、 垃圾目标形态变化多样等问题, 提出一种面向垃圾图像分类的残差语义强化网络, 能从复杂背景中剥离前景 语义目标。 该网络以骨干残差网络为基础, 利用视觉概念采样、 推理以及调制模块实现视觉语义的提取, 并通过注意力模块消除语义层次和空间分辨率与视觉概念特征的差距, 从而对垃圾目标形态变化更加具有 鲁棒性。 通过在 Kaggle 开源的 12 分类垃圾数据集及 TrashNet 数据集上进行实验, 结果表明, 相较于骨干网络 ResNeXt-50 和其他深层网络, 该算法均取得了性能的提升, 在垃圾图像分类任务上有较好表现。

关键词: 模式识别与智能系统, 垃圾分类, 视觉概念, 视觉采样, 概念推理, 注意力机制 

Abstract: In order to better protect the ecological environment and increase the economic value of recyclable waste, to solve the problems faced by the existing garbage identification methods, such as the complex classification background and the variety of garbage target forms, a residual semantic enhancement network for garbage image classification is proposed, which can strip foreground semantic objects from complex backgrounds. Based on the backbone residual network, the network uses visual concept sampling, inference and modulation modules to achieve visual semantic extraction, and eliminates the gap between semantic level and spatial resolution and visual concept features through the attention module, so as to be more robust to the morphological changes of garbage targets. Through experiments on the Kaggle open source 12 classified garbage dataset and TrashNet dataset, the results show that compared with the backbone network ResNeXt-50 and some other deep networks, the proposed algorithms have improved performance and have good performance in garbage image classification. 

Key words: pattern recognition and intelligent system, garbage classification, visual concept, visual sampling, concept reasoning, attention mechanism

中图分类号: 

  • TP391