中国科学院机构知识库网格系统: Depth-Aware Multi-Person 3D Pose Estimation With Multi-Scale Waterfall Representations

Depth-Aware Multi-Person 3D Pose Estimation With Multi-Scale Waterfall Representations

文献类型：期刊论文


作者	Shen, Tianyu2 ; Li, Deqi 2; Wang, Fei-Yue1,3 ; Huang, Hua 2
刊名	IEEE TRANSACTIONS ON MULTIMEDIA
出版日期	2023
卷号	25 页码:1439-1451
关键词	Three-dimensional displays Pose estimation Feature extraction Location awareness Cameras Semantics Solid modeling Human depth perceiving multi-person 3d pose estimation multi-scale representation occlusion handling
ISSN号	1520-9210
DOI	10.1109/TMM.2022.3233251
通讯作者	Huang, Hua(huahuang@bnu.edu.cn)
英文摘要	Estimating absolute 3D poses of multiple people from monocular image is challenging due to the presence of occlusions and the scale variation among different persons. Among the existing methods, the top-down paradigms are highly dependent on human detection which is prone to the influence from inter-person occlusions, while the bottom-up paradigms suffer from the difficulties in keypoint feature extraction caused by scale variation and unreliable joint grouping caused by occlusions. To address these challenges, we introduce a novel multi-person 3D pose estimation framework, aided by multi-scale feature representations and human depth perceiving. Firstly, a waterfall-based architecture is incorporated for multi-scale feature representations to achieve a more accurate estimation of occluded joints with a better detection of human shapes. Then the global and local representations are fused for handling the effects of inter-person occlusion and scale variation in depth perceiving and keypoint feature extraction. Finally, with the guidance of the fused multi-scale representations, a depth-aware model is exploited for better 2D joint grouping and 3D pose recovering. Quantitative and qualitative evaluations on benchmark datasets of MuCo-3DHP and MuPoTS-3D prove the effectiveness of our proposed method. Furthermore, we produce an occluded MuPoTS-3D dataset and the experiments on it validate the superiority of our method for overcoming the occlusions.
WOS研究方向	Computer Science ; Telecommunications
语种	英语
WOS记录号	WOS:000987415000004
出版者	IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
源URL	[http://ir.ia.ac.cn/handle/173211/53450]
专题	多模态人工智能系统全国重点实验室
通讯作者	Huang, Hua
作者单位	1.Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China 2.Beijing Normal Univ, Sch Artificial Intelligence, Beijing 100875, Peoples R China 3.Chinese Acad Sci, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
推荐引用方式 GB/T 7714	Shen, Tianyu,Li, Deqi,Wang, Fei-Yue,et al. Depth-Aware Multi-Person 3D Pose Estimation With Multi-Scale Waterfall Representations[J]. IEEE TRANSACTIONS ON MULTIMEDIA,2023,25:1439-1451.
APA	Shen, Tianyu,Li, Deqi,Wang, Fei-Yue,&Huang, Hua.(2023).Depth-Aware Multi-Person 3D Pose Estimation With Multi-Scale Waterfall Representations.IEEE TRANSACTIONS ON MULTIMEDIA,25,1439-1451.
MLA	Shen, Tianyu,et al."Depth-Aware Multi-Person 3D Pose Estimation With Multi-Scale Waterfall Representations".IEEE TRANSACTIONS ON MULTIMEDIA 25(2023):1439-1451.

入库方式： OAI收割

来源：自动化研究所

下载0

Depth-Aware Multi-Person 3D Pose Estimation With Multi-Scale Waterfall Representations

其他版本