Pyramid ALKNet for Semantic Parsing of Building Facade Image
文献类型:期刊论文
作者 | Ma, Wenguang1; Ma, Wei1; Xu, Shibiao2![]() |
刊名 | IEEE GEOSCIENCE AND REMOTE SENSING LETTERS
![]() |
出版日期 | 2021-06-01 |
卷号 | 18期号:6页码:1009-1013 |
关键词 | Buildings Semantics Kernel Measurement Image segmentation Shape Task analysis Facade parsing large kernel man-made structure nonlocal context |
ISSN号 | 1545-598X |
DOI | 10.1109/LGRS.2020.2993451 |
通讯作者 | Ma, Wei(mawei@bjut.edu.cn) |
英文摘要 | The semantic parsing of building facade images is a fundamental yet challenging task in urban scene understanding. Existing works sought to tackle this task by using facade grammars or convolutional neural networks (CNNs). The former can hardly generate parsing results coherent with real images while the latter often fails to capture relationships among facade elements. In this letter, we propose a pyramid atrous large kernel (ALK) network (ALKNet) for the semantic segmentation of facade images. The pyramid ALKNet captures long-range dependencies among building elements by using ALK modules in multiscale feature maps. It makes full use of the regular structures of facades to aggregate useful nonlocal context information and thereby is capable of dealing with challenging image regions caused by occlusions, ambiguities, and so on. Experiments on both rectified and unrectified facade data sets show that ALKNet has better performances than those of state-of-the-art methods. |
WOS关键词 | RECONSTRUCTION |
资助项目 | National Natural Science Foundation of China[61771026] ; National Natural Science Foundation of China[61971418] ; National Natural Science Foundation of China[61671451] ; Open Project Program of the National Laboratory of Pattern Recognition (NLPR) |
WOS研究方向 | Geochemistry & Geophysics ; Engineering ; Remote Sensing ; Imaging Science & Photographic Technology |
语种 | 英语 |
WOS记录号 | WOS:000652799700015 |
出版者 | IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC |
资助机构 | National Natural Science Foundation of China ; Open Project Program of the National Laboratory of Pattern Recognition (NLPR) |
源URL | [http://ir.ia.ac.cn/handle/173211/45295] ![]() |
专题 | 模式识别国家重点实验室_三维可视计算 |
通讯作者 | Ma, Wei |
作者单位 | 1.Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China 2.Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China 3.Peking Univ, Sch Elect Engn & Comp Sci, Key Lab Machine Percept MOE, Beijing 100871, Peoples R China |
推荐引用方式 GB/T 7714 | Ma, Wenguang,Ma, Wei,Xu, Shibiao,et al. Pyramid ALKNet for Semantic Parsing of Building Facade Image[J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS,2021,18(6):1009-1013. |
APA | Ma, Wenguang,Ma, Wei,Xu, Shibiao,&Zha, Hongbin.(2021).Pyramid ALKNet for Semantic Parsing of Building Facade Image.IEEE GEOSCIENCE AND REMOTE SENSING LETTERS,18(6),1009-1013. |
MLA | Ma, Wenguang,et al."Pyramid ALKNet for Semantic Parsing of Building Facade Image".IEEE GEOSCIENCE AND REMOTE SENSING LETTERS 18.6(2021):1009-1013. |
入库方式: OAI收割
来源:自动化研究所
浏览0
下载0
收藏0
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。