中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Kinematic skeleton graph augmented network for human parsing

文献类型:期刊论文

作者Liu, Jinde1,3; Zhang, Zhang1,3; Shan, Caifeng2,4; Tan, Tieniu1,3
刊名NEUROCOMPUTING
出版日期2020-11-06
卷号413页码:457-470
关键词Image segmentation Human parsing Deeplab V3+ Kinematic skeleton graph Human parsing dataset
ISSN号0925-2312
DOI10.1016/j.neucom.2020.07.002
通讯作者Zhang, Zhang(zzhang@nlpr.ia.ac.cn)
英文摘要Human parsing, which is a task of labeling pixels in human images into different fine-grained semantic parts, has achieved significant progress during the past decade. However, there are still several challenges in human parsing, due to occlusions, varying poses and similar appearance between the left/right parts. To tackle these problems, a Human Kinematic Skeleton Graph Layer (HKSGL) is proposed to augment regular neural networks with human kinematic skeleton information. The HKSGL has two major components: kinematic skeleton graph and interconnected modular neural layer. The kinematic skeleton graph is a user pre-defined skeleton graph, which models the interconnections between different semantic parts. Then the skeleton graph is passed to the interconnected modular neural layer which is composed of a set of modular blocks corresponding to different semantic parts. The HKSGL is a lightweight, low costs layer which can be easily attached to any existing neural networks. To demonstrate the power of the HKSGL, a new dataset on human parsing in occlusions is also collected, termed the RAP-Occ. Extensive experiments have been performed on four datasets on human parsing, including the LIP, the CIHP, the ATR and the RAP-Occ. And two popular baselines, i.e., the Deeplab V3+ and the CE2P, are agumented by the proposed HKSGL. Competitive performance of the augmented models has been achieved in comparison with state-of-the-art methods. (C) 2020 Elsevier B.V. All rights reserved.
WOS关键词SEGMENTATION
资助项目National Key Research and Development Program of China[2016YFB1001002] ; Shandong Provincial Key Research and Development Program (Major Scientific and Technological Innovation Project)[2019JZZY010119]
WOS研究方向Computer Science
语种英语
WOS记录号WOS:000579803700037
出版者ELSEVIER
资助机构National Key Research and Development Program of China ; Shandong Provincial Key Research and Development Program (Major Scientific and Technological Innovation Project)
源URL[http://ir.ia.ac.cn/handle/173211/42125]  
专题自动化研究所_智能感知与计算研究中心
通讯作者Zhang, Zhang
作者单位1.Chinese Acad Sci, Ctr Res Intelligent Percept & Comp, Inst Automat, Beijing, Peoples R China
2.Shandong Univ Sci & Technol, Coll Elect Engn & Automat, Qingdao, Peoples R China
3.Univ Chinese Acad Sci, Beijing, Peoples R China
4.CAS CAS AIR, Artificial Intelligence Res, Beijing, Peoples R China
推荐引用方式
GB/T 7714
Liu, Jinde,Zhang, Zhang,Shan, Caifeng,et al. Kinematic skeleton graph augmented network for human parsing[J]. NEUROCOMPUTING,2020,413:457-470.
APA Liu, Jinde,Zhang, Zhang,Shan, Caifeng,&Tan, Tieniu.(2020).Kinematic skeleton graph augmented network for human parsing.NEUROCOMPUTING,413,457-470.
MLA Liu, Jinde,et al."Kinematic skeleton graph augmented network for human parsing".NEUROCOMPUTING 413(2020):457-470.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。