基于弱监督迁移网络的3D人体关节点识别

doi:10.13229/j.cnki.jdxbgxb.20220280

Abstract

Abstract:

Aiming at the lack of depth information and incomplete spatial structure information of behavior and posture in 2D images， a 3D human joint point recognition method based on weak supervised migration network is proposed. Firstly， an end-to-end 3D human pose estimation framework for real images is proposed. The depth neural network is trained with 2D and 3D mixed label images. In the 2D human pose recognition sub network， the depth regression module is added to improve the 2D human pose recognition sub network to solve the problem of depth ambiguity in 3D human pose recognition； Secondly， in the 3D human pose recognition sub network， 3D geometric constraints are introduced to standardize the human pose recognition. For the case of no real depth label， it can better learn the depth features and effectively solve the problem of human pose recognition with occlusion. In human 3.6m and mpii data sets， the average error of joint point prediction is lower than that of other methods， and has better 3D human posture recognition effect.

Key words: migration network, pose recognition, 3D joint points, geometric constraints, depth regression

CLC Number:

TP391

Zhi-yong SUN,Hong-you LI,Jun-yong YE. 3D human joint point recognition based on weakly supervised migration network[J].Journal of Jilin University(Engineering and Technology Edition), 2024, 54(1): 251-258.

Figures/Tables 7

Fig.1

Fig.2

Fig.3

Table 1

Fig.4

Table 2

Table 3

References 26

1	Insafutdinov E, Pishchulin L, Andres B, et al. DeeperCut: a deeper, stronger, and faster multi-person pose estimation model[J/OL].[2016-12-10].
2	Bulat A, Tzimiropoulos G. Human pose estimation via convolutional part heat map regression[C]∥The 14th European Conference on Computer Vision, Amsterdam, The Netherlands, 2016: 717-732.
3	Chu X, Yang W, Ouyang W, et al. Multi-context attention for human pose estimation[C]∥Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, USA, 2017: 1831-1840.
4	Newell A, Yang K, Deng J. Stacked hourglass networks for human pose estimation[C]∥The 14th European Conference Computer Vision, Amsterdam, The Netherlands, 2016: 483-499.
5	Ionescu C, Papava D, Olaru V, et al. Human3.6M: large scale datasets and predictive methods for 3D human sensing in natural environments[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2014, 36(7): 1325-1339.
6	Sigal L, Balan A O, Black M J. Humaneva: synchronized video and motion capture dataset and baseline algorithm for evaluation of articulated human motion[J]. International Journal of Computer Vision, 2010, 87(1/2): 4-27.
7	Zhou X, Sun X, Zhang W, et al. Deep kinematic pose regression[C]∥The 14th European Conference Computer Vision, Amsterdam, The Netherlands, 2016: 186-201.
8	Li S, Chan A B. 3D human pose estimation from monocular images with deep convolutional neural network[C]∥The 12th Asian Conference on Computer Vision, Singapore, 2014: 332-347.
9	Bogo F, Kanazawa A, Lassner C, et al. Keep it SMPL: automatic estimation of 3D human pose and shape from a single image[C]∥The 14th European Conference, Amsterdam, The Netherlands, 2016: 561-578.
10	Chen C H, Ramanan D. 3D human pose estimation= 2d pose estimation+ matching[C]∥Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, USA,2017: 7035-7043.
11	Tome D, Russell C, Agapito L. Lifting from the deep: convolutional 3D pose estimation from a single image[C]∥Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, USA, 2017: 2500-2509.
12	Wu J, Xue T, Lim J J, et al. Single image 3D interpreter network[C]∥The 14th European Conference, Amsterdam, The Netherlands, 2016: 365-382.
13	Yasin H, Iqbal U, Kruger B, et al. A dual-source approach for 3D pose estimation from a single image[C]∥Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, USA, 2016: 4948-4956.
14	Zhou X, Zhu M, Leonardos S, et al. Sparseness meets deepness: 3D human pose estimation from monocular video[C]∥Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, USA, 2016: 4966-4975.
15	Wei S E, Ramakrishna V, Kanade T, et al. Convolutional pose machines[C]∥Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, USA, 2016: 4724-4732.
16	Akhter I, Black M J. Pose-conditioned joint angle limits for 3D human pose reconstruction[C]∥Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, USA, 2015: 1446-1455.
17	Ramakrishna V, Kanade T, Sheikh Y. Reconstructing 3D human pose from 2D image landmarks[C]∥The 12th European Conference on Computer Vision, Florence, Italy, 2012: 573-586.
18	Zhou X, Leonardos S, Hu X, et al. 3D shape estimation from 2D landmarks: a convex relaxation approach[C]∥Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, USA, 2015: 4447-4455.
19	Wei S E, Ramakrishna V, Kanade T, et al. Convolutional pose machines[C]∥Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, USA, 2016: 4724-4732.
20	Tome D, Russell C, Agapito L. Lifting from the deep: Convolutional 3D pose estimation from a single image[C]∥Proceedings of the IEEE Conference On Computer Vision and Pattern Recognition, Honolulu, USA, 2017: 2500-2509.
21	Zhou X, Zhu M, Leonardos S, et al. Sparseness meets deepness: 3d human pose estimation from monocular video[C]∥Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, USA, 2016: 4966-4975.
22	Zhang Z, Hu L, Deng X, et al. Weakly supervised adversarial learning for 3D human pose estimation from point clouds[J]. IEEE Transactions on Visualization and Computer Graphics, 2020, 26(5): 1851-1859.
23	Hoffman J, Wang D, Yu F, et al. FCNs in the wild: pixel-level adversarial and constraint-based adaptation[J/OL]. [2016-12-10].
24	Zhou X, Zhu M, Pavlakos G, et al. Monocap: monocular human motion capture using a CNN coupled with a geometric prior[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018, 41(4): 901-914.
25	Mehta D, Rhodin H, Casas D, et al. Monocular 3D human pose estimation using transfer learning and improved CNN supervision[J/OL]. [2016-12-10].
26	Andriluka M, Pishchulin L, Gehler P,et al.Human pose estimation: new benchmark and state of the art analysis[C]∥Computer Vision and Pattern Recognitio,Columbus,USA,2014.

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

Comments

Recommended 0

No Suggested Reading articles found!

方法	坐立	抽烟	行走	交谈	吃东西	拍照	打招呼	打电话	购物	等候	遛狗	平均
文献［10］	133.14	106.65	114.05	97.57	89.98	139.17	107.87	107.31	136.09	106.21	87.03	114.18
文献［11］	110.19	84.95	71.36	73.47	76.82	110.67	86.43	86.28	74.79	85.78	86.26	88.39
文献［24］	124.52	107.42	79.36	109.31	87.05	143.32	103.16	116.18	99.78	118.09	114.23	79.9
文献［25］	96.19	70.82	82.03	69.74	60.55	85.42	68.77	76.36	75.04	68.45	54.41	74.14
无约束	74.79	64.34	63.97	61.16	58.12	67.29	71.75	62.54	56.38	68.78	52.22	63.76
有约束	75.20	64.15	63.22	60.70	58.22	65.53	71.41	62.03	55.58	66.05	51.43	63.05

	无约束	有约束
大臂	42.4 mm	37.8 mm
小臂	60.4 mm	50.7 mm
大腿	43.5 mm	43.4 mm
小腿	59.4 mm	47.8 mm
大臂	6.27 px	4.80 px
小臂	10.11 px	6.64 px
大腿	6.89 px	4.93 px
小腿	8.03 px	6.22 px

3D human joint point recognition based on weakly supervised migration network

RICH HTML

PDF (PC)

Abstract

Cite this article

share this article

Figures/Tables 7

References 26

Related Articles 1

Metrics

Comments

Recommended 0