Long-Term Head Pose Forecasting Conditioned on the Gaze-Guiding Prior
文献类型:会议论文
作者 | Shentong Mo2; Xin M(辛淼)1![]() |
出版日期 | 2021 |
会议日期 | June 19, 2021 - June 25, 2021 |
会议地点 | Online |
关键词 | Head Pose Forecasting Gaze-Guiding Prior Variational graph autoencoder |
DOI | 10.1109/CVPRW53098.2021.00253 |
英文摘要 | Forecasting head pose future states is a novel task in computer vision. Since future may have many possibilities, and the logical results are much more important than the impractical ones, the forecasting results for most of the scenarios should be not only diverse but also logically realistic. These requirements pose a real challenge to the current methods, which motivates us to seek for better head pose representation and methods to restrict the forecasting reasonably. In this paper, we adopt a spatial-temporal graph to model the interdependencies between the distribution of landmarks and head pose angles. Furthermore, we propose the conditional spatial-temporal variational graph autoen-coder (CST-VGAE), a deep conditional generative model for learning restricted one-to-many mappings conditioned on the spatial-temporal graph input. Specifically, we improve the proposed CST-VGAE for the long-term head pose forecasting task in terms of several aspects. First, we introduce a gaze-guiding prior based on the physiology. Then we apply a temporal self-attention and self-supervised learning mechanism to learn the long-range dependencies on the gaze prior. To better model head poses structurally, we introduce a Gaussian Mixture Model (GMM), instead of a fixed Gaussian in the encoded latent space. Experiments demonstrate the effectiveness of the proposed method for the long-term head pose forecasting task. We achieve superior forecasting performance on the benchmark datasets compared to the existing methods. |
语种 | 英语 |
WOS研究方向 | Computer Science |
WOS记录号 | WOS:000705890202038 |
源URL | [http://ir.ia.ac.cn/handle/173211/51506] ![]() |
专题 | 复杂系统认知与决策实验室 |
通讯作者 | Xin M(辛淼) |
作者单位 | 1.Institute of Automation, Chinese Academy of Sciences 2.Carnegie Mellon University |
推荐引用方式 GB/T 7714 | Shentong Mo,Xin M. Long-Term Head Pose Forecasting Conditioned on the Gaze-Guiding Prior[C]. 见:. Online. June 19, 2021 - June 25, 2021. |
入库方式: OAI收割
来源:自动化研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。