Journal of Jilin University(Engineering and Technology Edition) ›› 2025, Vol. 55 ›› Issue (7): 2425-2433.doi: 10.13229/j.cnki.jdxbgxb.20240744

Previous Articles    

2D human pose estimation algorithm based on adaptive parameterized non-maximum suppression

Jia-bao LI1,2(),Cheng-jun WANG1,2,Wen-hang SU1   

  1. 1.School of Artificial Intelligence,Anhui University of Science and Technology,Huainan 232001,China
    2.Institute of Artificial Intelligence,Hefei Comprehensive National Science Center,Hefei 230026,China
  • Received:2024-07-05 Online:2025-07-01 Published:2025-09-12

Abstract:

To address the two problems of low detector accuracy and redundant keypoints in detection results, a two-dimensional human pose estimation algorithm was proposed based on adaptive parameterized NMS. Replacing the original detector with CenterNet detector improves the performance of human detection and lays the foundation for subsequent pose estimation. Propose an adaptive parameterized PoseNMS algorithm that introduces sample appearance similarity to achieve sample adaptive measurement adjustment, making the filtering conditions in NMS more flexible. A non-uniform sampling method based on detection confidence was proposed, which ensured the effectiveness of the samples during the training process and achieved the discovery and mining of difficult samples. Verified on three datasets, a 71.9% mAP was achieved on the MSCOCO 2017 dataset under the condition of using the detection results output by the detector in this paper as the subsequent target box. In addition, the algorithm proposed in this paper has also been extensively experimented on MPII and MSCOCO 2015, and the quantitative and visual results show that the proposed method effectively solves the above two problems and achieves more accurate attitude estimation.

Key words: CenterNet, key point detection, non-maximum suppression, pose estimation, non-uniform sampling

CLC Number: 

  • TP391

Fig.1

Network structure"

Fig.2

Regional multi person attitude estimation(RMPE) framework"

Fig.3

Framework diagram of CenterNet network"

Fig.4

Framework diagram of adaptive pose distance measurement network"

Fig.5

Comparison between non-uniform sampling method based on detection confidence and sampling method based on Gaussian distribution"

Fig.6

Description of key points in MPII dataset"

Table 1

Comparison results on COCO 2017 test set"

方法mAP模型参数量/M推理时间/ms
文献[2]方法61.8%25150
文献[3]方法65.5%2575
文献[4]方法71.0%3025
本文方法71.9%3022

Fig.7

Visualization results of proposed method on MPII dataset"

Fig.8

Visualization results of proposed method on MSCOCO 2015 test set"

Fig.9

Visualization results of proposed method on MSCOCO 2017 test set"

[1] 张宇, 温光照, 米思娅, 等. 基于深度学习的二维人体姿态估计综述[J].软件学报, 2022, 33(11): 4173-4191.
Zhang Yu, Wen Guang-zhao, Mi Si-ya, et al. A review of two-dimensional human pose estimation based on deep learning[J]. Journal of Software, 2022,33(11): 4173-4191.
[2] 马皖宜, 张德平. 基于多谱注意力高分辨率网络的人体姿态估计[J]. 计算机辅助设计与图形学学报,2022, 34(8): 1283-1292.
Ma Wan-yi, Zhang De-ping. Human pose estimation based on multi-spectral attention high-resolution network[J]. Journal of Computer-Aided Design and Computer Graphics, 2022,34(8):1283-1292.
[3] 吉斌, 潘烨, 金小刚, 等. 用于视频流人体姿态估计的时空信息感知网络[J]. 计算机辅助设计与图形学学报, 2022, 34(2): 189-197.
Ji Bin, Pan Ye, Jin Xiao-gang, et al. Spatio-temporal information perception network for human pose estimation in video streams[J]. Journal of Computer-Aided Design and Computer Graphics, 2022, 34(2):189-197.
[4] Cao Z, Hidalgo G, Simon T,et al. Openpose: realtime multi-person 2D pose estimation using part affinity fields[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021 43(1):172-186.
[5] 程旭, 宋晨, 史金钢, 等.基于深度学习的通用目标检测研究综述[J].电子学报, 2021, 49(7): 1428-1438.
Cheng Xu, Song Chen, Shi Jin-gang, et al. A review of general object detection based on deep learning[J]. Acta Electronica Sinica, 2021, 49(7): 1428-1438.
[6] 邵延华, 张铎, 楚红雨, 等. 基于深度学习的YOLO目标检测综述[J]. 电子与信息学报, 2022, 44(10): 3697-3708.
Shao Yan-hua, Zhang Duo, Chu Hong-yu, et al. A review of YOLO object detection based on deep learning[J]. Journal of Electronics & Information Technology, 2022, 44(10): 3697-3708.
[7] 陈丽, 王世勇, 高思莉, 等. Sentinel-2卫星的多光谱轻量级船舶目标检测算法[J]. 光谱学与光谱分析, 2022, 42(9): 2862-2869.
Chen Li, Wang Shi-yong, Gao Si-li, et al. Multispectral lightweight ship detection algorithm for Sentinel-2 satellite[J]. Spectroscopy and Spectral Analysis, 2022, 42(9): 2862-2869.
[8] 陈科圻, 朱志亮, 邓小明, 等.多尺度目标检测的深度学习研究综述[J]. 软件学报, 2021, 32(4): 1201-1227.
Chen Ke-qi, Zhu Zhi-liang, Deng Xiao-ming, et al. A review of deep learning research on multi-scale object detection[J]. Journal of Software, 2021,32(4):1201-1227.
[9] Law H, Deng J. CornerNet: detecting objects as paired keypoints[J]. International Journal of Computer Vision, 2020, 128(3): 642-656.
[10] 曲优, 李文辉. 基于锚框变换的单阶段旋转目标检测方法[J]. 吉林大学学报: 工学版, 2022, 52(1):162-173.
Qu You, Li Wen-hui. Single-stage rotated object detection method based on anchor box transformation[J]. Journal of Jilin University(Engineering and Technology Edition), 2022,52(1): 162-173.
[11] 朱毅琳, 肖秦琨, 杨梦薇. 基于语义图卷积神经网络的三维人体姿态估计[J]. 计算机仿真, 2023, 40(11): 207-211, 402.
Zhu Yi-lin, Xiao Qin-kun, Yang Meng-wei. 3D human pose estimation based on semantic graph convolutional neural network[J]. Computer Simulation, 2023, 40(11): 207-211, 402.
[1] Hua CAI,Rui-kun ZHU,Qiang FU,Wei-gang WANG,Zhi-yong MA,Jun-xi SUN. Human pose estimation corrector algorithm based on implicit key point interconnection [J]. Journal of Jilin University(Engineering and Technology Edition), 2025, 55(3): 1061-1071.
[2] Hua CAI,Yan-yang ZHENG,Qiang FU,Sheng-yu WANG,Wei-gang WANG,Zhi-yong MA. Three-dimensional object detection algorithm based on multi-scale candidate fusion and optimization [J]. Journal of Jilin University(Engineering and Technology Edition), 2025, 55(2): 709-721.
[3] Yu WANG,Kai ZHAO. Postprocessing of human pose heatmap based on sub⁃pixel location [J]. Journal of Jilin University(Engineering and Technology Edition), 2024, 54(5): 1385-1392.
[4] Ren-xiang CHEN,Chao-chao HU,Xiao-lin HU,Li-xia YANG,Jun ZHANG,Jia-le HE. Driver distracted driving detection based on improved YOLOv5 [J]. Journal of Jilin University(Engineering and Technology Edition), 2024, 54(4): 959-968.
[5] Lian-ming WANG,Xin WU. Method for 3D motion parameter measurement based on pose estimation [J]. Journal of Jilin University(Engineering and Technology Edition), 2023, 53(7): 2099-2108.
[6] LIN Jin-hua, WANG Yan-jie, WANG Lu, YAO Yu. Real-time surface reconstruction based global camera pose optimization [J]. 吉林大学学报(工学版), 2018, 48(3): 909-918.
[7] LI Zhen-hui, WANG Hong-guang, WANG Yue-chao, JIANG Yong, YUE Xiang. Line-grasping control for a power transmission line inspection robot [J]. 吉林大学学报(工学版), 2015, 45(5): 1519-1526.
[8] PAN Hai-yang, LIU Shun-an, YAO Yong-ming. Depth information-basd autonomous aerial refueling [J]. 吉林大学学报(工学版), 2014, 44(6): 1750-1756.
[9] TIAN Wen-biao, RUI Guo-sheng, ZHANG Hai-bo, WANG Lin. Non-uniform information acquisition and reconstruction within compressed sensing framework [J]. 吉林大学学报(工学版), 2014, 44(4): 1209-1214.
[10] SONG Huai-bo, SHI Jian-qiang. Pose estimation of varied human faces based on PCA method [J]. 吉林大学学报(工学版), 2013, 43(增刊1): 43-46.
[11] LIU Chang-Ying, GAO Le, GAO Yin-Han, CHE Ren-Sheng. Measurement of coordinates by single camera vision [J]. 吉林大学学报(工学版), 2010, 40(05): 1278-1282.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!