中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Detection and tracking based tubelet generation for video object detection

文献类型:期刊论文

作者Xiao, Jun-Bin2,3; Wang, Bin2,3; Zhang, Yong-Dong3; Yan, Quan-Feng1; Tang, Sheng3
刊名JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION
出版日期2019
卷号58页码:102-111
ISSN号1047-3203
关键词Object detection Tubelet generation Tubelet fusion
DOI10.1016/j.jvcir.2018.11.014
英文摘要Video object detection (VID) is a more challenging task compared with still-image object detection, which not only needs to detect objects accurately per frame but also needs to track objects for a long period of time. In order to detect objects from videos, we propose a Detection And Tracking (DAT) based tubelet generation framework. Under this framework, we first propose a detection-based tubelet generation method which can generate tubelets with more accurate bounding boxes compared with traditional tracking-based methods. On the other hand, the latter can produce a higher recall of bounding boxes than the former in general. To take advantage of their complementary attributes, we further propose a novel tubelet fusion method to combine these multi-modal information (appearance information in independent images and contextual information in videos). Our extensive experiments on the well-known ILSVRC 2016 VID dataset show that our proposed method can achieve state-of-the-art performances. (C) 2018 Elsevier Inc. All rights reserved.
资助项目National Key Research and Development Program of China[2017YFB1002202] ; National Natural Science Foundation of China[61572472] ; National Natural Science Foundation of China[61525206] ; National Natural Science Foundation of China[U1703261] ; National Natural Science Foundation of China[61571424]
WOS研究方向Computer Science
语种英语
出版者ACADEMIC PRESS INC ELSEVIER SCIENCE
WOS记录号WOS:000457668100011
源URL[http://119.78.100.204/handle/2XEOYT63/3456]  
专题中国科学院计算技术研究所期刊论文_英文
通讯作者Tang, Sheng
作者单位1.Hunan Inst Sci & Technol, Coll Comp Sci, Yueyang 414006, Hunan, Peoples R China
2.Univ Chinese Acad Sci, Beijing 100049, Peoples R China
3.Chinese Acad Sci, Key Lab Intelligent Informat Proc, Inst Comp Technol, Beijing 100190, Peoples R China
推荐引用方式
GB/T 7714
Xiao, Jun-Bin,Wang, Bin,Zhang, Yong-Dong,et al. Detection and tracking based tubelet generation for video object detection[J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION,2019,58:102-111.
APA Xiao, Jun-Bin,Wang, Bin,Zhang, Yong-Dong,Yan, Quan-Feng,&Tang, Sheng.(2019).Detection and tracking based tubelet generation for video object detection.JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION,58,102-111.
MLA Xiao, Jun-Bin,et al."Detection and tracking based tubelet generation for video object detection".JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION 58(2019):102-111.

入库方式: OAI收割

来源:计算技术研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。