中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Dynamic fusion of convolutional features based on spatial and temporal attention for visual tracking

文献类型:会议论文

作者Zhao, Dongcheng1,3; Zeng, Yi1,2,3,4
出版日期2019-09
会议日期14-19 July 2019
会议地点Budapest, Hungary
关键词Paraventricular Thalamus Spatial Attention Temporal Attention
DOI10.1109/IJCNN.2019.8852301
英文摘要

Convolutional neural networks (CNN) based trackers have been widely employed in visual object tracking due to their powerful representations. Features from different CNN layers encode different information. Deeper layers contain more semantic information, while the resolution is too coarse to localize the target. Shallower layers carry more detail information but are less robust for appearance variations. In this paper, we propose an algorithm which incorporates the Spatial and Temporal attention to take full advantage of the Hierarchical Convolutional Features for Tracking (STHCFT). We firstly learn correlation filters on each convolutional layer. Based on the spatial attention inspired by the paraventricular thalamus (PVT) in the brain, we choose the most important layer to build the base response, and the others to be the auxiliary responses. In addition, we make full use of the temporal attention to determine the weights of the auxiliary responses. Finally, the target is located by the maximum value of the fused responses. Extensive experimental results on the benchmark OTB-2013 and OTB-2015 have shown the proposed algorithm performs favorably against several state-of-the-art trackers.

语种英语
源URL[http://ir.ia.ac.cn/handle/173211/44566]  
专题类脑智能研究中心_类脑认知计算
通讯作者Zeng, Yi
作者单位1.Chinese Acad Sci, Inst Automat, Res Ctr Brain Inspired Intelligence, Beijing, Peoples R China
2.Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, Shanghai, Peoples R China
3.Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China
4.Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing, Peoples R China
推荐引用方式
GB/T 7714
Zhao, Dongcheng,Zeng, Yi. Dynamic fusion of convolutional features based on spatial and temporal attention for visual tracking[C]. 见:. Budapest, Hungary. 14-19 July 2019.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。