中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Robust person head detection based on multi-scale representation fusion of deep convolution neural network

文献类型:会议论文

作者Wang Yingying1,2; Yin Yingjie1; Wu Wenqi1,2; Sun Siyang1,2; Wang Xingang1
出版日期2017
会议日期2017.12.5-2017.12.8
会议地点澳门
关键词Person Head Detection Deep Convolution Neural Network Multi-scale Representations Multi-task Learning
页码296-301
英文摘要
Person head detection is still a challenge due to the large variability in heads’ sizes and orientations, lighting conditions and strong occlusions. Small heads require local information contained in low level layers instead of semantic features of upper layers. But most of these fine details are lost in the early convolutional layers of the deep convolution neural networks (DCNN). In order to improve the overall detection accuracy, it is important to utilize local information from lower layers into the detection framework. In this letter, we use multi-scale representation fusion of DCNN as a way to incorporate lower layers with upper layers for detection. Our proposed model is based on the recent object detection network Single Shot MultiBox Detector (SSD). VGG16 is used as the base network. Batch normalization (BN) layers are used in our proposed multi-task learning method to accelerate training process and improve the robustness. Compared to state-of-the-art methods, our proposed detector achieves superior person head detection performance on the HollywoodHeads dataset (81.0 AP) and Casablance dataset (78.5 AP).
源URL[http://ir.ia.ac.cn/handle/173211/19753]  
专题精密感知与控制研究中心_精密感知与控制
作者单位1.自动化研究所精密感知与控制研究中心
2.中国科学院大学
推荐引用方式
GB/T 7714
Wang Yingying,Yin Yingjie,Wu Wenqi,et al. Robust person head detection based on multi-scale representation fusion of deep convolution neural network[C]. 见:. 澳门. 2017.12.5-2017.12.8.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。