基于背景抑制与噪声监督的人群计数方法

吉林大学学报(信息科学版) ›› 2025, Vol. 43 ›› Issue (3): 615-623.

基于背景抑制与噪声监督的人群计数方法

洪蕾,杨明

西南大学计算机与信息科学学院,重庆400700

收稿日期:2024-05-27 出版日期:2025-06-19 发布日期:2025-06-19
通讯作者: 杨明(1970— ), 女, 山东泰安人, 西南大学副教授, 硕士生导师, 主要从事机器学习、人工智能研究,(Tel)86-13883870736(E-mail)yangming@ swu. edu. cn。
作者简介:洪蕾(1999— ), 女, 郑州人, 西南大学硕士研究生, 主要从事计算机视觉研究, (Tel)86-15136208651(E-mail)roogko@ email. swu. edu. cn
基金资助:
重庆市技术创新与应用发展专项重点基金资助项目(CSTB2023TIAD-KPX0064)

Crowd Counting Method Based on Background Suppression and Noise Supervision

HONG Lei, YANG Ming

College of Computer and Information Science, Southwest University, Chongqing 400700, China

Received:2024-05-27 Online:2025-06-19 Published:2025-06-19

摘要/Abstract

摘要： 针对人群的大尺度变化、复杂的背景、以及标签噪声对计数精准度产生严重影响的问题,提出了一种基于背景抑制与噪声监督的人群计数模型。该模型在编码阶段使用VGG16_bn的前13层作为主干网络, 将初步提取到的特征输入到双分支特征提取模块与背景信息聚合模块,分别缓解人群大尺度变化并提高背景的可辨性。最后融合两个模块所处理的信息, 使用解码器回归生成预测密度图,并与ground truth密度图进行监督以实现对噪声的抑制。与其他算法相比结果表明,该模型的计数精准度有所提升,在ShanghaiTech PartA 上的MAE(Mean Absolute Error)和 MSE(Mean Squared Error)分别为58.1 和95.9; 在 ShanghaiTech PartA 上进行的消融实验也验证了各模块的有效性。该算法能有效地提高人群计数的精度。

关键词: 人群计数, 密度图, 卷积神经网络, 深度学习, 噪声监督

Abstract: A crowd counting model based on background suppression and noise monitoring is proposed to solve the problems of large-scale change of crowd, complex background, and label noise. In the coding stage, the first 13 layers of VGG16_bn are used as the backbone, and the initially extracted features are sent to the two-branch feature extraction module and the background information aggregation module respectively, to mitigate the large- scale changes of the population and improve the discriminability of the background. Finally, the information processed by the two modules is fused, and the predictive density map is generated by decoder regression, which is supervised with the ground truth density map to achieve noise suppression. Compared with other algorithms, the counting accuracy of this model has been improved. MAE(Mean Absolute Error) and MSE(Mean Squared Error) on ShanghaiTech PartA are 58. 1 and 95. 9 respectively. Ablation experiments conducted on ShanghaiTech PartA also verified the effectiveness of the modules. Experimental results show that the algorithm can effectively improve the accuracy of crowd counting.

Key words: crowd counting, density map, convolutional neural network, deep learning, noise supervision

中图分类号:

TP391

洪蕾, 杨明. 基于背景抑制与噪声监督的人群计数方法[J]. 吉林大学学报(信息科学版), 2025, 43(3): 615-623.

HONG Lei, YANG Ming. Crowd Counting Method Based on Background Suppression and Noise Supervision[J]. Journal of Jilin University (Information Science Edition), 2025, 43(3): 615-623.

[1]	朱彦华. 基于改进CNN 的弱边缘超声图像分割方法[J]. 吉林大学学报(信息科学版), 2024, 42(6): 1018-1024.
[2]	周丰丰, 董广宇, 李柯薇. 多头注意力引导卷积网络检测阿尔兹海默症[J]. 吉林大学学报(信息科学版), 2024, 42(6): 1074-1089.
[3]	梅健, 孙珈玥, 邹青宇. 基于人体关键点的滑雪动作评分方法研究 [J]. 吉林大学学报(信息科学版), 2024, 42(5): 866-873.
[4]	刘樱琪, 宋杨, 李梓木, 罗维, 黄新睿, 王昊丰. 基于深度学习的心电信号分析检测系统 [J]. 吉林大学学报(信息科学版), 2023, 41(6): 1135-1142.
[5]	张峻豪, 吴洵, 吴宁, 曲瑞权, 孟凡儒, 张晨松. 基于 Jetson Nano 的智能果蔬采摘机器人设计[J]. 吉林大学学报(信息科学版), 2023, 41(4): 759-766.
[6]	杨莉, 张帅, 鹿卓慧. 基于卷积神经网络的抽油机井故障诊断研究[J]. 吉林大学学报(信息科学版), 2023, 41(4): 646-652.
[7]	任爽, 田振川, 林光辉, 杨凯, 商继财. 改良 GoogLeNet 的电机滚动轴承故障诊断[J]. 吉林大学学报(信息科学版), 2022, 40(3): 371-378.