Adaptive feature fusion with attention mechanism for multi-scale target detection
文献类型:期刊论文
作者 | Ju MR(鞠默然)1,2,3,4,6![]() ![]() ![]() |
刊名 | Neural Computing and Applications
![]() |
出版日期 | 2021 |
卷号 | 33期号:7页码: 2769-2781 |
关键词 | Deep learning Target detection Adaptive feature fusion Attention mechanism |
ISSN号 | 0941-0643 |
产权排序 | 1 |
英文摘要 | To detect the targets of different sizes, multi-scale output is used by target detectors such as YOLO V3 and DSSD. To improve the detection performance, YOLO V3 and DSSD perform feature fusion by combining two adjacent scales. However, the feature fusion only between the adjacent scales is not sufficient. It hasn’t made advantage of the features at other scales. What is more, as a common operation for feature fusion, concatenating can’t provide a mechanism to learn the importance and correlation of the features at different scales. In this paper, we propose adaptive feature fusion with attention mechanism (AFFAM) for multi-scale target detection. AFFAM utilizes pathway layer and subpixel convolution layer to resize the feature maps, which is helpful to learn better and complex feature mapping. In addition, AFFAM utilizes global attention mechanism and spatial position attention mechanism, respectively, to learn the correlation of the channel features and the importance of the spatial features at different scales adaptively. Finally, we combine AFFAM with YOLO V3 to build an efficient multi-scale target detector. The comparative experiments are conducted on PASCAL VOC dataset, KITTI dataset and Smart UVM dataset. Compared with the state-of-the-art target detectors, YOLO V3 with AFFAM achieved 84.34% mean average precision (mAP) at 19.9 FPS on PASCAL VOC dataset, 87.2% mAP at 21 FPS on KITTI dataset and 99.22% mAP at 20.6 FPS on Smart UVM dataset which outperforms other advanced target detectors. |
WOS关键词 | RECOGNITION |
WOS研究方向 | Computer Science |
语种 | 英语 |
WOS记录号 | WOS:000547247700004 |
源URL | [http://ir.sia.cn/handle/173321/27369] ![]() |
专题 | 沈阳自动化研究所_光电信息技术研究室 |
通讯作者 | Luo HB(罗海波) |
作者单位 | 1.Institutes for Robotics and Intelligent Manufacturing, Chinese Academy of Sciences, Shenyang 2.Key Laboratory of Opt-Electronic Information Processing, Chinese Academy of Sciences, Shenyang 3.The Key Laboratory of Image Understanding and Computer Vision, Shenyang Liaoning 110016, China 4.University of Chinese Academy of Sciences, Beijing 100049, China 5.McGill University, Montreal, QC H3A 0G4, Canada 6.Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang, Liaoning 110016, China |
推荐引用方式 GB/T 7714 | Ju MR,Luo JN,Wang ZB,et al. Adaptive feature fusion with attention mechanism for multi-scale target detection[J]. Neural Computing and Applications,2021,33(7): 2769-2781. |
APA | Ju MR,Luo JN,Wang ZB,&Luo HB.(2021).Adaptive feature fusion with attention mechanism for multi-scale target detection.Neural Computing and Applications,33(7), 2769-2781. |
MLA | Ju MR,et al."Adaptive feature fusion with attention mechanism for multi-scale target detection".Neural Computing and Applications 33.7(2021): 2769-2781. |
入库方式: OAI收割
来源:沈阳自动化研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。