中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Pyramid ALKNet for Semantic Parsing of Building Facade Image

文献类型:期刊论文

作者Ma, Wenguang1; Ma, Wei1; Xu, Shibiao2; Zha, Hongbin3
刊名IEEE GEOSCIENCE AND REMOTE SENSING LETTERS
出版日期2021-06-01
卷号18期号:6页码:1009-1013
关键词Buildings Semantics Kernel Measurement Image segmentation Shape Task analysis Facade parsing large kernel man-made structure nonlocal context
ISSN号1545-598X
DOI10.1109/LGRS.2020.2993451
通讯作者Ma, Wei(mawei@bjut.edu.cn)
英文摘要The semantic parsing of building facade images is a fundamental yet challenging task in urban scene understanding. Existing works sought to tackle this task by using facade grammars or convolutional neural networks (CNNs). The former can hardly generate parsing results coherent with real images while the latter often fails to capture relationships among facade elements. In this letter, we propose a pyramid atrous large kernel (ALK) network (ALKNet) for the semantic segmentation of facade images. The pyramid ALKNet captures long-range dependencies among building elements by using ALK modules in multiscale feature maps. It makes full use of the regular structures of facades to aggregate useful nonlocal context information and thereby is capable of dealing with challenging image regions caused by occlusions, ambiguities, and so on. Experiments on both rectified and unrectified facade data sets show that ALKNet has better performances than those of state-of-the-art methods.
WOS关键词RECONSTRUCTION
资助项目National Natural Science Foundation of China[61771026] ; National Natural Science Foundation of China[61971418] ; National Natural Science Foundation of China[61671451] ; Open Project Program of the National Laboratory of Pattern Recognition (NLPR)
WOS研究方向Geochemistry & Geophysics ; Engineering ; Remote Sensing ; Imaging Science & Photographic Technology
语种英语
WOS记录号WOS:000652799700015
出版者IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
资助机构National Natural Science Foundation of China ; Open Project Program of the National Laboratory of Pattern Recognition (NLPR)
源URL[http://ir.ia.ac.cn/handle/173211/45295]  
专题模式识别国家重点实验室_三维可视计算
通讯作者Ma, Wei
作者单位1.Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
2.Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
3.Peking Univ, Sch Elect Engn & Comp Sci, Key Lab Machine Percept MOE, Beijing 100871, Peoples R China
推荐引用方式
GB/T 7714
Ma, Wenguang,Ma, Wei,Xu, Shibiao,et al. Pyramid ALKNet for Semantic Parsing of Building Facade Image[J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS,2021,18(6):1009-1013.
APA Ma, Wenguang,Ma, Wei,Xu, Shibiao,&Zha, Hongbin.(2021).Pyramid ALKNet for Semantic Parsing of Building Facade Image.IEEE GEOSCIENCE AND REMOTE SENSING LETTERS,18(6),1009-1013.
MLA Ma, Wenguang,et al."Pyramid ALKNet for Semantic Parsing of Building Facade Image".IEEE GEOSCIENCE AND REMOTE SENSING LETTERS 18.6(2021):1009-1013.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。