中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Long-Term Head Pose Forecasting Conditioned on the Gaze-Guiding Prior

文献类型:会议论文

作者Shentong Mo2; Xin M(辛淼)1
出版日期2021
会议日期June 19, 2021 - June 25, 2021
会议地点Online
关键词Head Pose Forecasting Gaze-Guiding Prior Variational graph autoencoder
DOI10.1109/CVPRW53098.2021.00253
英文摘要

Forecasting head pose future states is a novel task in computer vision. Since future may have many possibilities, and the logical results are much more important than the impractical ones, the forecasting results for most of the scenarios should be not only diverse but also logically realistic. These requirements pose a real challenge to the current methods, which motivates us to seek for better head pose representation and methods to restrict the forecasting reasonably. In this paper, we adopt a spatial-temporal graph to model the interdependencies between the distribution of landmarks and head pose angles. Furthermore, we propose the conditional spatial-temporal variational graph autoen-coder (CST-VGAE), a deep conditional generative model for learning restricted one-to-many mappings conditioned on the spatial-temporal graph input. Specifically, we improve the proposed CST-VGAE for the long-term head pose forecasting task in terms of several aspects. First, we introduce a gaze-guiding prior based on the physiology. Then we apply a temporal self-attention and self-supervised learning mechanism to learn the long-range dependencies on the gaze prior. To better model head poses structurally, we introduce a Gaussian Mixture Model (GMM), instead of a fixed Gaussian in the encoded latent space. Experiments demonstrate the effectiveness of the proposed method for the long-term head pose forecasting task. We achieve superior forecasting performance on the benchmark datasets compared to the existing methods.

语种英语
WOS研究方向Computer Science
WOS记录号WOS:000705890202038
源URL[http://ir.ia.ac.cn/handle/173211/51506]  
专题复杂系统认知与决策实验室
通讯作者Xin M(辛淼)
作者单位1.Institute of Automation, Chinese Academy of Sciences
2.Carnegie Mellon University
推荐引用方式
GB/T 7714
Shentong Mo,Xin M. Long-Term Head Pose Forecasting Conditioned on the Gaze-Guiding Prior[C]. 见:. Online. June 19, 2021 - June 25, 2021.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。