HIDE: Hierarchical iterative decoding enhancement for multi-view 3D human parameter regression
文献类型:期刊论文
作者 | Lin WT(林伟涛)1,2; Zhang JG(张吉光)1,2![]() ![]() ![]() ![]() |
刊名 | Computer Animation and Virtual Worlds
![]() |
出版日期 | 2024 |
期号 | 35页码:3 |
英文摘要 | Parametric human modeling are limited to either single-view frameworks or simple multi-view frameworks, failing to fully leverage the advantages of easily trainable single-view networks and the occlusion-resistant capabil ities of multi-view images. The prevalent presence of object occlusion and self-occlusion in real-world scenarios leads to issues of robustness and accuracy in predicting human body parameters. Additionally, many methods overlook the spatial connectivity of human joints in the global estimation of model pose parameters, resulting in cumulative errors in continuous joint parameters.To address these challenges, we propose a flexible and efficient iterative decoding strategy. By extending from single-view images to multi-view video inputs, we achieve local-to-global optimization. We utilize attention mechanisms to cap ture the rotational dependencies between any node in the human body and all its ancestor nodes, thereby enhancing pose decoding capability. We employ a parameter-level iterative fusion of multi-view image data to achieve flexible integration of global pose information, rapidly obtaining appropriate projection features from different viewpoints, ultimately resulting in precise parameter estimation. Through experiments, we validate the effectiveness of the HIDE method on the Human3.6M and 3DPW datasets, demonstrating significantly improved visualization results compared to previous methods. |
语种 | 英语 |
源URL | [http://ir.ia.ac.cn/handle/173211/57341] ![]() |
专题 | 模式识别国家重点实验室_三维可视计算 |
通讯作者 | Lin WT(林伟涛); Meng WL(孟维亮) |
作者单位 | 1.State Key Laboratory of Multimodal Artificial Intelligence Systems, Institute of Automation, Chinese Academy of Sciences, Beijing, China 2.School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, China |
推荐引用方式 GB/T 7714 | Lin WT,Zhang JG,Meng WL,et al. HIDE: Hierarchical iterative decoding enhancement for multi-view 3D human parameter regression[J]. Computer Animation and Virtual Worlds,2024(35):3. |
APA | Lin WT,Zhang JG,Meng WL,Liu XL,&Zhang XP.(2024).HIDE: Hierarchical iterative decoding enhancement for multi-view 3D human parameter regression.Computer Animation and Virtual Worlds(35),3. |
MLA | Lin WT,et al."HIDE: Hierarchical iterative decoding enhancement for multi-view 3D human parameter regression".Computer Animation and Virtual Worlds .35(2024):3. |
入库方式: OAI收割
来源:自动化研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。