中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
TranSkeleton: Hierarchical Spatial-Temporal Transformer for Skeleton-Based Action Recognition

文献类型:期刊论文

作者Haowei Liu; Yongcheng Liu; Yuxin Chen; Chunfeng Yuan; Bing Li; Weiming Hu
刊名IEEE Transactions on Circuits and Systems for Video Technology
出版日期2023
页码1-12
英文摘要

In skeleton-based action recognition, it has been a
dominant paradigm to extract motion features with temporal convolution
and model spatial correlations with graph convolution.
However, it’s difficult for temporal convolution to capture longrange
dependencies effectively. Meanwhile, commonly used multibranch
graph convolution leads to high complexity. In this paper,
we propose TranSkeleton, a powerful Transformer framework
which neatly unifies the spatial and temporal modeling of skeleton
sequences. For temporal modeling, we propose a novel partitionaggregation
temporal Transformer. It works with hierarchical
temporal partition and aggregation, and can capture both longrange
dependencies and subtle temporal structures effectively.
A difference-aware aggregation approach is designed to reduce
information loss during temporal aggregation. For spatial modeling,
we propose a topology-aware spatial Transformer which
utilizes the prior information of human body topology to facilitate
spatial correlation modeling. Extensive experiments on two
challenging benchmark datasets demonstrate that TranSkeleton
notably outperforms the state of the arts.

源URL[http://ir.ia.ac.cn/handle/173211/51592]  
专题自动化研究所_模式识别国家重点实验室_视频内容安全团队
通讯作者Chunfeng Yuan
推荐引用方式
GB/T 7714
Haowei Liu,Yongcheng Liu,Yuxin Chen,et al. TranSkeleton: Hierarchical Spatial-Temporal Transformer for Skeleton-Based Action Recognition[J]. IEEE Transactions on Circuits and Systems for Video Technology,2023:1-12.
APA Haowei Liu,Yongcheng Liu,Yuxin Chen,Chunfeng Yuan,Bing Li,&Weiming Hu.(2023).TranSkeleton: Hierarchical Spatial-Temporal Transformer for Skeleton-Based Action Recognition.IEEE Transactions on Circuits and Systems for Video Technology,1-12.
MLA Haowei Liu,et al."TranSkeleton: Hierarchical Spatial-Temporal Transformer for Skeleton-Based Action Recognition".IEEE Transactions on Circuits and Systems for Video Technology (2023):1-12.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。