中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Adaptive Dilated Convolution For Human Pose Estimation

文献类型:会议论文

作者Zhengxiong Luo1,2,3,4; Zhicheng Wang3; Yan Huang1,4; Liang Wang1,4; Tieniu Tan4; Erin Zhou3
出版日期2022-08
会议日期2022-8
会议地点Montréal Québec
英文摘要

Most existing human pose estimation (HPE) methods exploit multi-scale information by fusing feature maps of four different spatial sizes, i.e. 1/4, 1/8, 1/16, and 1/32 of the input image. There are two drawbacks of this strategy: 1) feature maps of different spatial sizes may be not well spatially aligned, which potentially hurts the accuracy of keypoint location; 2) these scales are fixed and inflexible, which may restrict the generalization ability over various human sizes. To- wards these issues, we propose an adaptive dilated convolution (ADC). It can generate and fuse multi-scale features of the same spatial sizes by setting different dilation rates for different channels. Specifically, it uses a regression module to adaptively generate dilation rates for different channels. This also enables ADC to adjust the fused scales according to the sizes of test persons, and thus helps ADC to have better generalization ability. ADC can be end-to-end trained and easily plugged into existing methods. Extensive experiments show that ADC can bring consistent improvements to various HPE methods. The source codes will be released for further research.

语种英语
源URL[http://ir.ia.ac.cn/handle/173211/52066]  
专题自动化研究所_智能感知与计算研究中心
通讯作者Yan Huang
作者单位1.Institute of Automation, Chinese Academy of Sciences (CASIA)
2.School of Artificial Intelligence, University of Chinese Academy of Sciences (UCAS)
3.Megvii Inc
4.Center for Research on Intelligent Perception and Computing (CRIPAC) National Laboratory of Pattern Recognition (NLPR)
推荐引用方式
GB/T 7714
Zhengxiong Luo,Zhicheng Wang,Yan Huang,et al. Adaptive Dilated Convolution For Human Pose Estimation[C]. 见:. Montréal Québec. 2022-8.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。