Detection and tracking based tubelet generation for video object detection
文献类型:期刊论文
作者 | Xiao, Jun-Bin2,3; Wang, Bin2,3; Zhang, Yong-Dong3; Yan, Quan-Feng1; Tang, Sheng3 |
刊名 | JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION |
出版日期 | 2019 |
卷号 | 58页码:102-111 |
ISSN号 | 1047-3203 |
关键词 | Object detection Tubelet generation Tubelet fusion |
DOI | 10.1016/j.jvcir.2018.11.014 |
英文摘要 | Video object detection (VID) is a more challenging task compared with still-image object detection, which not only needs to detect objects accurately per frame but also needs to track objects for a long period of time. In order to detect objects from videos, we propose a Detection And Tracking (DAT) based tubelet generation framework. Under this framework, we first propose a detection-based tubelet generation method which can generate tubelets with more accurate bounding boxes compared with traditional tracking-based methods. On the other hand, the latter can produce a higher recall of bounding boxes than the former in general. To take advantage of their complementary attributes, we further propose a novel tubelet fusion method to combine these multi-modal information (appearance information in independent images and contextual information in videos). Our extensive experiments on the well-known ILSVRC 2016 VID dataset show that our proposed method can achieve state-of-the-art performances. (C) 2018 Elsevier Inc. All rights reserved. |
资助项目 | National Key Research and Development Program of China[2017YFB1002202] ; National Natural Science Foundation of China[61572472] ; National Natural Science Foundation of China[61525206] ; National Natural Science Foundation of China[U1703261] ; National Natural Science Foundation of China[61571424] |
WOS研究方向 | Computer Science |
语种 | 英语 |
出版者 | ACADEMIC PRESS INC ELSEVIER SCIENCE |
WOS记录号 | WOS:000457668100011 |
源URL | [http://119.78.100.204/handle/2XEOYT63/3456] |
专题 | 中国科学院计算技术研究所期刊论文_英文 |
通讯作者 | Tang, Sheng |
作者单位 | 1.Hunan Inst Sci & Technol, Coll Comp Sci, Yueyang 414006, Hunan, Peoples R China 2.Univ Chinese Acad Sci, Beijing 100049, Peoples R China 3.Chinese Acad Sci, Key Lab Intelligent Informat Proc, Inst Comp Technol, Beijing 100190, Peoples R China |
推荐引用方式 GB/T 7714 | Xiao, Jun-Bin,Wang, Bin,Zhang, Yong-Dong,et al. Detection and tracking based tubelet generation for video object detection[J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION,2019,58:102-111. |
APA | Xiao, Jun-Bin,Wang, Bin,Zhang, Yong-Dong,Yan, Quan-Feng,&Tang, Sheng.(2019).Detection and tracking based tubelet generation for video object detection.JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION,58,102-111. |
MLA | Xiao, Jun-Bin,et al."Detection and tracking based tubelet generation for video object detection".JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION 58(2019):102-111. |
入库方式: OAI收割
来源:计算技术研究所
浏览0
下载0
收藏0
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。