PyMAF-X: Towards Well-Aligned Full-Body Model Regression From Monocular Images
文献类型:期刊论文
作者 | Zhang, Hongwen2; Tian, Yating3; Zhang, Yuxiang2; Li, Mengcheng2; An, Liang2; Sun, Zhenan1; Liu, Yebin2 |
刊名 | IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE |
出版日期 | 2023-10-01 |
卷号 | 45期号:10页码:12287-12303 |
ISSN号 | 0162-8828 |
关键词 | Expressive human mesh recovery full-body motion capture mesh alignment feedback monocular 3D reconstruction |
DOI | 10.1109/TPAMI.2023.3271691 |
通讯作者 | Liu, Yebin(liuyebin@mail.tsinghua.edu.cn) |
英文摘要 | We present PyMAF-X, a regression-based approach to recovering a parametric full-body model from a single image. This task is very challenging since minor parametric deviation may lead to noticeable misalignment between the estimated mesh and the input image. Moreover, when integrating part-specific estimations into the full-body model, existing solutions tend to either degrade the alignment or produce unnatural wrist poses. To address these issues, we propose a Pyramidal Mesh Alignment Feedback (PyMAF) loop in our regression network for well-aligned human mesh recovery and extend it as PyMAF-X for the recovery of expressive full-body models. The core idea of PyMAF is to leverage a feature pyramid and rectify the predicted parameters explicitly based on the mesh-image alignment status. Specifically, given the currently predicted parameters, mesh-aligned evidence will be extracted from finer-resolution features accordingly and fed back for parameter rectification. To enhance the alignment perception, an auxiliary dense supervision is employed to provide mesh-image correspondence guidance while spatial alignment attention is introduced to enable the awareness of the global contexts for our network. When extending PyMAF for full-body mesh recovery, an adaptive integration strategy is proposed in PyMAF-X to produce natural wrist poses while maintaining the well-aligned performance of the part-specific estimations. The efficacy of our approach is validated on several benchmark datasets for body, hand, face, and full-body mesh recovery, where PyMAF and PyMAF-X effectively improve the mesh-image alignment and achieve new The project page with code and video results can be found at https://www.liuyebin.com/pymaf-x. |
WOS关键词 | HUMAN SHAPE ; POSE ; REPRESENTATION |
资助项目 | National Key Ramp;D Program of China ; National Natural Science Foundation of China[2021ZD0113501] ; National Natural Science Foundation of China[62125107] ; National Natural Science Foundation of China[U1836217] ; China Postdoctoral Science Foundation[62276263] ; [2022M721844] |
WOS研究方向 | Computer Science ; Engineering |
语种 | 英语 |
出版者 | IEEE COMPUTER SOC |
WOS记录号 | WOS:001068816800049 |
资助机构 | National Key Ramp;D Program of China ; National Natural Science Foundation of China ; China Postdoctoral Science Foundation |
源URL | [http://ir.ia.ac.cn/handle/173211/53047] |
专题 | 多模态人工智能系统全国重点实验室 |
通讯作者 | Liu, Yebin |
作者单位 | 1.Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China 2.Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China 3.Nanjing Univ, Dept Comp Sci & Technol, Nanjing 210023, Peoples R China |
推荐引用方式 GB/T 7714 | Zhang, Hongwen,Tian, Yating,Zhang, Yuxiang,et al. PyMAF-X: Towards Well-Aligned Full-Body Model Regression From Monocular Images[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE,2023,45(10):12287-12303. |
APA | Zhang, Hongwen.,Tian, Yating.,Zhang, Yuxiang.,Li, Mengcheng.,An, Liang.,...&Liu, Yebin.(2023).PyMAF-X: Towards Well-Aligned Full-Body Model Regression From Monocular Images.IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE,45(10),12287-12303. |
MLA | Zhang, Hongwen,et al."PyMAF-X: Towards Well-Aligned Full-Body Model Regression From Monocular Images".IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 45.10(2023):12287-12303. |
入库方式: OAI收割
来源:自动化研究所
浏览0
下载0
收藏0
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。