吉林大学学报(工学版) ›› 2025, Vol. 55 ›› Issue (7): 2425-2433.doi: 10.13229/j.cnki.jdxbgxb.20240744

• 通信与控制工程 • 上一篇    

基于自适应参数化非极大值抑制的二维人体姿态估计算法

李家宝1,2(),王成军1,2,苏文杭1   

  1. 1.安徽理工大学 人工智能学院,安徽 淮南 232001
    2.合肥综合性国家科学中心 人工智能研究院,合肥 230026
  • 收稿日期:2024-07-05 出版日期:2025-07-01 发布日期:2025-09-12
  • 作者简介:李家宝(1998-),男,博士研究生.研究方向:人体姿态估计,人工智能. E-mail: 3196177554@qq.com
  • 基金资助:
    安徽省高校协同创新项目(GXXT-2022-053);国家创新方法工作专项项目(2018IM010500);淮南市科技计划项目(2021A242)

2D human pose estimation algorithm based on adaptive parameterized non-maximum suppression

Jia-bao LI1,2(),Cheng-jun WANG1,2,Wen-hang SU1   

  1. 1.School of Artificial Intelligence,Anhui University of Science and Technology,Huainan 232001,China
    2.Institute of Artificial Intelligence,Hefei Comprehensive National Science Center,Hefei 230026,China
  • Received:2024-07-05 Online:2025-07-01 Published:2025-09-12

摘要:

针对检测器的准确率低和检测结果导致的关键点冗余两个问题,提出了基于自适应参数化NMS的二维人体姿态估计算法。首先,采用CenterNet检测器取代原有检测器,提高了人体检测的性能,并为后续姿态估计打下基础。然后,提出了自适应参数化的PoseNMS算法,引入样本外观相似度实现样本自适应度量调节,使NMS中的过滤条件更加灵活。最后,提出了一种基于检测置信度的非均匀采样方法,在训练过程中保证了样本的有效性,实现了困难样本的发现与挖掘。在MSCOCO 2017数据集上采用本文检测器输出的检测结果作为后续目标框的条件取得了71.9%的mAP。另外,本文算法同样在MPII和MSCOCO 2015数据集上进行了大量实验,定量实验与可视化结果说明本文方法有效解决了上述两个问题,实现了更准确的姿态估计。

关键词: CenterNet, 关键点检测, 非极大值抑制, 姿态估计, 非均匀采样

Abstract:

To address the two problems of low detector accuracy and redundant keypoints in detection results, a two-dimensional human pose estimation algorithm was proposed based on adaptive parameterized NMS. Replacing the original detector with CenterNet detector improves the performance of human detection and lays the foundation for subsequent pose estimation. Propose an adaptive parameterized PoseNMS algorithm that introduces sample appearance similarity to achieve sample adaptive measurement adjustment, making the filtering conditions in NMS more flexible. A non-uniform sampling method based on detection confidence was proposed, which ensured the effectiveness of the samples during the training process and achieved the discovery and mining of difficult samples. Verified on three datasets, a 71.9% mAP was achieved on the MSCOCO 2017 dataset under the condition of using the detection results output by the detector in this paper as the subsequent target box. In addition, the algorithm proposed in this paper has also been extensively experimented on MPII and MSCOCO 2015, and the quantitative and visual results show that the proposed method effectively solves the above two problems and achieves more accurate attitude estimation.

Key words: CenterNet, key point detection, non-maximum suppression, pose estimation, non-uniform sampling

中图分类号: 

  • TP391

图1

网络结构"

图2

区域多人姿态估计(RMPE)框架"

图3

CenterNet网络的框架图"

图4

自适应位姿距离度量网络框架图"

图5

基于检测置信度的非均匀采样方法与高斯分布的采样方法对比"

图6

MPII数据集关键点描述"

表1

COCO2017测试集上的对比结果说明"

方法mAP模型参数量/M推理时间/ms
文献[2]方法61.8%25150
文献[3]方法65.5%2575
文献[4]方法71.0%3025
本文方法71.9%3022

图7

本文方法在MPII数据集上的可视化结果"

图8

本文方法在MSCOCO 2015测试集上的可视化结果"

图9

本文方法在MSCOCO 2017测试集上的可视化结果"

[1] 张宇, 温光照, 米思娅, 等. 基于深度学习的二维人体姿态估计综述[J].软件学报, 2022, 33(11): 4173-4191.
Zhang Yu, Wen Guang-zhao, Mi Si-ya, et al. A review of two-dimensional human pose estimation based on deep learning[J]. Journal of Software, 2022,33(11): 4173-4191.
[2] 马皖宜, 张德平. 基于多谱注意力高分辨率网络的人体姿态估计[J]. 计算机辅助设计与图形学学报,2022, 34(8): 1283-1292.
Ma Wan-yi, Zhang De-ping. Human pose estimation based on multi-spectral attention high-resolution network[J]. Journal of Computer-Aided Design and Computer Graphics, 2022,34(8):1283-1292.
[3] 吉斌, 潘烨, 金小刚, 等. 用于视频流人体姿态估计的时空信息感知网络[J]. 计算机辅助设计与图形学学报, 2022, 34(2): 189-197.
Ji Bin, Pan Ye, Jin Xiao-gang, et al. Spatio-temporal information perception network for human pose estimation in video streams[J]. Journal of Computer-Aided Design and Computer Graphics, 2022, 34(2):189-197.
[4] Cao Z, Hidalgo G, Simon T,et al. Openpose: realtime multi-person 2D pose estimation using part affinity fields[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021 43(1):172-186.
[5] 程旭, 宋晨, 史金钢, 等.基于深度学习的通用目标检测研究综述[J].电子学报, 2021, 49(7): 1428-1438.
Cheng Xu, Song Chen, Shi Jin-gang, et al. A review of general object detection based on deep learning[J]. Acta Electronica Sinica, 2021, 49(7): 1428-1438.
[6] 邵延华, 张铎, 楚红雨, 等. 基于深度学习的YOLO目标检测综述[J]. 电子与信息学报, 2022, 44(10): 3697-3708.
Shao Yan-hua, Zhang Duo, Chu Hong-yu, et al. A review of YOLO object detection based on deep learning[J]. Journal of Electronics & Information Technology, 2022, 44(10): 3697-3708.
[7] 陈丽, 王世勇, 高思莉, 等. Sentinel-2卫星的多光谱轻量级船舶目标检测算法[J]. 光谱学与光谱分析, 2022, 42(9): 2862-2869.
Chen Li, Wang Shi-yong, Gao Si-li, et al. Multispectral lightweight ship detection algorithm for Sentinel-2 satellite[J]. Spectroscopy and Spectral Analysis, 2022, 42(9): 2862-2869.
[8] 陈科圻, 朱志亮, 邓小明, 等.多尺度目标检测的深度学习研究综述[J]. 软件学报, 2021, 32(4): 1201-1227.
Chen Ke-qi, Zhu Zhi-liang, Deng Xiao-ming, et al. A review of deep learning research on multi-scale object detection[J]. Journal of Software, 2021,32(4):1201-1227.
[9] Law H, Deng J. CornerNet: detecting objects as paired keypoints[J]. International Journal of Computer Vision, 2020, 128(3): 642-656.
[10] 曲优, 李文辉. 基于锚框变换的单阶段旋转目标检测方法[J]. 吉林大学学报: 工学版, 2022, 52(1):162-173.
Qu You, Li Wen-hui. Single-stage rotated object detection method based on anchor box transformation[J]. Journal of Jilin University(Engineering and Technology Edition), 2022,52(1): 162-173.
[11] 朱毅琳, 肖秦琨, 杨梦薇. 基于语义图卷积神经网络的三维人体姿态估计[J]. 计算机仿真, 2023, 40(11): 207-211, 402.
Zhu Yi-lin, Xiao Qin-kun, Yang Meng-wei. 3D human pose estimation based on semantic graph convolutional neural network[J]. Computer Simulation, 2023, 40(11): 207-211, 402.
[1] 才华,朱瑞昆,付强,王伟刚,马智勇,孙俊喜. 基于隐式关键点互联的人体姿态估计矫正器算法[J]. 吉林大学学报(工学版), 2025, 55(3): 1061-1071.
[2] 才华,郑延阳,付强,王晟宇,王伟刚,马智勇. 基于多尺度候选融合与优化的三维目标检测算法[J]. 吉林大学学报(工学版), 2025, 55(2): 709-721.
[3] 王宇,赵凯. 基于亚像素定位的人体姿态热图后处理[J]. 吉林大学学报(工学版), 2024, 54(5): 1385-1392.
[4] 陈仁祥,胡超超,胡小林,杨黎霞,张军,何家乐. 基于改进YOLOv5的驾驶员分心驾驶检测[J]. 吉林大学学报(工学版), 2024, 54(4): 959-968.
[5] 王连明,吴鑫. 基于姿态估计的物体3D运动参数测量方法[J]. 吉林大学学报(工学版), 2023, 53(7): 2099-2108.
[6] 管乃彦,郭娟利. 基于姿态估计算法的组件感知自适应模型[J]. 吉林大学学报(工学版), 2020, 50(5): 1850-1855.
[7] 林金花, 王延杰, 王璐, 姚禹. 全局相机姿态优化下的快速表面重建[J]. 吉林大学学报(工学版), 2018, 48(3): 909-918.
[8] 田文飚, 芮国胜, 张海波, 王林. 下非均匀信息采集及重构[J]. 吉林大学学报(工学版), 2014, 44(4): 1209-1214.
[9] 宋怀波, 史建强. 应用PCA理论进行多人脸姿态估计的方法[J]. 吉林大学学报(工学版), 2013, 43(增刊1): 43-46.
[10] 刘长英, 高乐, 高印寒, 车仁生. 单目视觉坐标测量方法[J]. 吉林大学学报(工学版), 2010, 40(05): 1278-1282.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!