中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
BSTG-Trans: A Bayesian Spatial-Temporal Graph Transformer for Long-term Pose Forecasting

文献类型:期刊论文

作者Shentong Mo1; Xin M(辛淼)2
刊名IEEE Transactions on Multimedia
出版日期2023
卷号Early Access期号:Early Access页码:Early Access
关键词long-term forecasting spatial-temporal graph transformer Bayesian transformer uncertainty estimation
ISSN号1520-9210
DOI10.1109/TMM.2023.3269219
英文摘要

Human pose forecasting that aims to predict the body poses happening in the future is an important task in computer vision. However, long-term pose forecasting is particularly challenging because modeling long-range dependencies across the spatial-temporal level is hard for joint-based representation. Another challenge is uncertainty prediction since the future prediction is not a deterministic process. In this work, we present a novel B ayesian S patial- T emporal G raph Trans former (BSTG-Trans) for predicting accurate, diverse, and uncertain future poses. First, we apply a spatial-temporal graph transformer as an encoder and a temporal-spatial graph transformer as a decoder for modeling the long-range spatial-temporal dependencies across pose joints to generate the long-term future body poses. Furthermore, we propose a Bayesian sampling module for uncertainty quantization of diverse future poses. Finally, a novel uncertainty estimation metric, namely Uncertainty Absolute Error is introduced for measuring both the accuracy and uncertainty of each predicted future pose. We achieve state-of-the-art performance against other baselines on Human3.6M and HumanEva-I in terms of accuracy, diversity, and uncertainty for long-term pose forecasting. Moreover, our comprehensive ablation studies demonstrate the effectiveness and generalization of each module proposed in our BSTG-Trans. Code and models are available at https://github.com/stoneMo/BSTG-Trans .

语种英语
源URL[http://ir.ia.ac.cn/handle/173211/51503]  
专题复杂系统认知与决策实验室
中国科学院自动化研究所
类脑芯片与系统研究
通讯作者Xin M(辛淼)
作者单位1.Carnegie Mellon University
2.Institute of Automation, Chinese Academy of Sciences
推荐引用方式
GB/T 7714
Shentong Mo,Xin M. BSTG-Trans: A Bayesian Spatial-Temporal Graph Transformer for Long-term Pose Forecasting[J]. IEEE Transactions on Multimedia,2023,Early Access(Early Access):Early Access.
APA Shentong Mo,&Xin M.(2023).BSTG-Trans: A Bayesian Spatial-Temporal Graph Transformer for Long-term Pose Forecasting.IEEE Transactions on Multimedia,Early Access(Early Access),Early Access.
MLA Shentong Mo,et al."BSTG-Trans: A Bayesian Spatial-Temporal Graph Transformer for Long-term Pose Forecasting".IEEE Transactions on Multimedia Early Access.Early Access(2023):Early Access.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。