中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Temporal-Spatial Mapping for Action Recognition

文献类型:期刊论文

作者Song, Xiaolin2; Lan, Cuiling1; Zeng, Wenjun1; Xing, Junliang3; Sun, Xiaoyan1; Yang, Jingyu2
刊名IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY
出版日期2020-03-01
卷号30期号:3页码:748-759
关键词Two dimensional displays Three-dimensional displays Feature extraction Optical imaging Computational modeling Streaming media Kernel Temporal-spatial mapping (TSM) action recognition deep learning
ISSN号1051-8215
DOI10.1109/TCSVT.2019.2896029
通讯作者Lan, Cuiling(culan@microsoft.com) ; Yang, Jingyu(yjy@tju.edu.cn)
英文摘要Deep learning models have enjoyed great success for image related computer vision tasks such as image classification and object detection. For video related tasks such as human action recognition, however, the advancements are not as significant yet. The main challenge is the lack of effective and efficient models in modeling the rich temporal-spatial information in a video. We introduce a simple yet effective operation, termed temporal-spatial mapping, for capturing the temporal evolution of the frames by jointly analyzing all the frames of a video. We propose a video level 2D feature representation by transforming the convolutional features of all frames to a 2D feature map, referred to as VideoMap. With each row being the vectorized feature representation of a frame, the temporal-spatial features are compactly represented, while the temporal dynamic evolution is also well embedded. Based on the VideoMap representation, we further propose a temporal attention model within a shallow convolutional neural network to efficiently exploit the temporal-spatial dynamics. The experiment results show that the proposed scheme achieves state-of-the-art performance, with 4.2% accuracy gain over the temporal segment network, a competing baseline method, on the challenging human action benchmark dataset HMDB51.
资助项目National Science Foundation of China[61672519] ; National Natural Science Foundation of China[61771339] ; Reserved Peiyang Scholar Program of Tianjin University, Tianjin, China
WOS研究方向Engineering
语种英语
WOS记录号WOS:000519551500010
出版者IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
资助机构National Science Foundation of China ; National Natural Science Foundation of China ; Reserved Peiyang Scholar Program of Tianjin University, Tianjin, China
源URL[http://ir.ia.ac.cn/handle/173211/38619]  
专题智能系统与工程
通讯作者Lan, Cuiling; Yang, Jingyu
作者单位1.Microsoft Res Asia, Beijing 100080, Peoples R China
2.Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
3.Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
推荐引用方式
GB/T 7714
Song, Xiaolin,Lan, Cuiling,Zeng, Wenjun,et al. Temporal-Spatial Mapping for Action Recognition[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY,2020,30(3):748-759.
APA Song, Xiaolin,Lan, Cuiling,Zeng, Wenjun,Xing, Junliang,Sun, Xiaoyan,&Yang, Jingyu.(2020).Temporal-Spatial Mapping for Action Recognition.IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY,30(3),748-759.
MLA Song, Xiaolin,et al."Temporal-Spatial Mapping for Action Recognition".IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY 30.3(2020):748-759.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。