中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
AutoDet: Pyramid Network Architecture Search for Object Detection

文献类型:期刊论文

作者Li, Zhihang1,2; Xi, Teng3,4; Zhang, Gang3; Liu, Jingtuo3; He, Ran1,2
刊名INTERNATIONAL JOURNAL OF COMPUTER VISION
出版日期2021-01-06
期号4页码:1087-1105
关键词Object detection Neural architecture search Feature pyramids
ISSN号0920-5691
DOI10.1007/s11263-020-01415-x
英文摘要

Feature pyramids have delivered significant improvement in object detection. However, building effective feature pyramids heavily relies on expert knowledge, and also requires strenuous efforts to balance effectiveness and efficiency. Automatic search methods, such as NAS-FPN, automates the design of feature pyramids, but the low search efficiency makes it difficult to apply in a large search space. In this paper, we propose a novel search framework for a feature pyramid network, called AutoDet, which enables to automatic discovery of informative connections between multi-scale features and configure detection architectures with both high efficiency and state-of-the-art performance. In AutoDet, a new search space is specifically designed for feature pyramids in object detectors, which is more general than NAS-FPN. Furthermore, the architecture search process is formulated as a combinatorial optimization problem and solved by a Simulated Annealing-based Network Architecture Search method (SA-NAS). Compared with existing NAS methods, AutoDet ensures a dramatic reduction in search times. For example, our SA-NAS can be up to 30x faster than reinforcement learning-based approaches. Furthermore, AutoDet is compatible with both one-stage and two-stage structures with all kinds of backbone networks. We demonstrate the effectiveness of AutoDet with outperforming single-model results on the COCO dataset. Without pre-training on OpenImages, AutoDet with the ResNet-101 backbone achieves an AP of 39.7 and 47.3 for one-stage and two-stage architectures, respectively, which surpass current state-of-the-art methods.

资助项目Beijing Natural Science Foundation[JQ18017] ; National Natural Science Foundation of China[U20A20223] ; Youth Innovation Promotion Association CAS[Y201929]
WOS研究方向Computer Science
语种英语
WOS记录号WOS:000605541100007
出版者SPRINGER
资助机构Beijing Natural Science Foundation ; National Natural Science Foundation of China ; Youth Innovation Promotion Association CAS
源URL[http://ir.ia.ac.cn/handle/173211/42562]  
专题自动化研究所_智能感知与计算研究中心
通讯作者He, Ran
作者单位1.Chinese Acad Sci, NLPR, CRIPAC, CEBSIT, Beijing, Peoples R China
2.Univ Chinese Acad Sci, Sch Artif Intelligence, Beijing, Peoples R China
3.Baidu Inc, Dept Comp Vis Technol VIS, Beijing, Peoples R China
4.Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
推荐引用方式
GB/T 7714
Li, Zhihang,Xi, Teng,Zhang, Gang,et al. AutoDet: Pyramid Network Architecture Search for Object Detection[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION,2021(4):1087-1105.
APA Li, Zhihang,Xi, Teng,Zhang, Gang,Liu, Jingtuo,&He, Ran.(2021).AutoDet: Pyramid Network Architecture Search for Object Detection.INTERNATIONAL JOURNAL OF COMPUTER VISION(4),1087-1105.
MLA Li, Zhihang,et al."AutoDet: Pyramid Network Architecture Search for Object Detection".INTERNATIONAL JOURNAL OF COMPUTER VISION .4(2021):1087-1105.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。