Efficient Attention Pyramid Network for Semantic Segmentation
文献类型:期刊论文
作者 | Yang QR(杨琦瑞)1,2,3; Ku T(库涛)1,2![]() ![]() |
刊名 | IEEE Access
![]() |
出版日期 | 2021 |
卷号 | 9页码:18867-18875 |
关键词 | Semantic segmentation attention mechanism spatial pyramid PASCAL VOC 2012 Cityscapes |
ISSN号 | 2169-3536 |
产权排序 | 1 |
英文摘要 | Semantic segmentation is a task that covers most of the perception needs of intelligent vehicles in an unified way. Recent studies witnessed that attention mechanisms achieve impressive performance in computer vision task. Current attention mechanisms based segmentation methods differ with each other in position and form of the attention mechanism, and perform differently in practice. This paper firstly introduces the effectiveness of multi-scale context features and attention mechanisms in segmentation tasks. We find that multi-scale and channel attention can play a vital role in constructing effective context features. Based on this analysis, this paper proposes an efficient attention pyramid network (EAPNet) for semantic segmentation. Specifically, to efficient handle the problem of segmenting objects at multiple scales, we design efficient channel attention pyramid (ECAP) which employ atrous convolution with channel attention in cascade or in parallel to capture multi-scale context by using multiple atrous rates. Furthermore, we propose a residual attention fusion block (RAFB), whose purpose is to simultaneously focus on meaningful low-level feature maps and spatial location information. At the same time, we will explore different channel attention modules and spatial attention modules, and describe their impact on network performance. We empirically evaluate our EAPNet on two semantic segmentation datasets, including PASCAL VOC 2012 and Cityscapes datasets. Experimental results show that without MS COCO pre-training and any post-processing, EAPNet achieved 81.7% mIoU on the PASCAL VOC 2012 validation set. With deeplabv3+ as the benchmark, EAPNet improve the model performance of more than 1.50% mIoU. |
资助项目 | National Key Research and Development Program of China[2019YFB17050003] ; National Key Research and Development Program of China[2018YFB1308801] ; National Key Research and Development Program of China[2017YFB0306401] ; Consulting Research Project of the Chinese Academy of Engineering[2019-XZ-7] |
WOS研究方向 | Computer Science ; Engineering ; Telecommunications |
语种 | 英语 |
WOS记录号 | WOS:000619303100001 |
资助机构 | National Key Research and Development Program of China under Grant 2019YFB17050003, Grant 2018YFB1308801, and Grant 2017YFB0306401 ; Consulting Research Project of the Chinese Academy of Engineering under Grant 2019-XZ-7. |
源URL | [http://ir.sia.cn/handle/173321/28337] ![]() |
专题 | 沈阳自动化研究所_数字工厂研究室 |
通讯作者 | Yang QR(杨琦瑞) |
作者单位 | 1.Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110016, China 2.Institutes for Robotics and Intelligent Manufacturing, Chinese Academy of Sciences, Shenyang 110169, China 3.School of Computer and Control, University of Chinese Academy of Sciences, Beijing 100049, China |
推荐引用方式 GB/T 7714 | Yang QR,Ku T,Hu KY. Efficient Attention Pyramid Network for Semantic Segmentation[J]. IEEE Access,2021,9:18867-18875. |
APA | Yang QR,Ku T,&Hu KY.(2021).Efficient Attention Pyramid Network for Semantic Segmentation.IEEE Access,9,18867-18875. |
MLA | Yang QR,et al."Efficient Attention Pyramid Network for Semantic Segmentation".IEEE Access 9(2021):18867-18875. |
入库方式: OAI收割
来源:沈阳自动化研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。