中国科学院机构知识库网格系统: Gesture recognition based on deep deformable 3D convolutional neural networks

Gesture recognition based on deep deformable 3D convolutional neural networks

文献类型：期刊论文


作者	Zhang, Yifan1,4,5 ; Shi, Lei1,4,5 ; Wu, Yi 3; Cheng, Ke1,4,5 ; Cheng, Jian1,2,4,5 ; Lu, Hanqing1,4,5
刊名	PATTERN RECOGNITION
出版日期	2020-11-01
期号	107 页码:12
关键词	Gesture recognition Spatiotemporal deformable convolution Spatiotemporal convolutional neural network
ISSN号	0031-3203
DOI	10.1016/j.patcog.2020.107416
英文摘要	Dynamic gesture recognition, which plays an essential role in human-computer interaction, has been widely investigated but not yet fully addressed. The challenge mainly lies in three folders: 1) to model both of the spatial appearance and the temporal evolution simultaneously; 2) to address the interference from the varied and complex background; 3) the requirement of real-time processing. In this paper, we address the above challenges by proposing a novel deep deformable 3D convolutional neural network for end-to-end learning, which not only gains impressive accuracy in challenging datasets but also can meet the requirement of the real-time processing. We propose three types of very deep 3D CNNs for gesture recognition, which can directly model the spatiotemporal information with their inherent hierarchical structure. To eliminate the background interference, a light-weight spatiotemporal deformable convolutional module is specially designed to augment the spatiotemporal sampling locations of the 3D convolution by learning additional offsets according to the preceding feature map. It can not only diversify the shape of the convolution kernel to better fit the appearance of the hands and arms, but also help the models pay more attention to the discriminative frames in the video sequence. The proposed method is evaluated on three challenging datasets, EgoGesture, Jester and Chalearn-IsoGD, and achieves the state-of-the-art performance on all of them. Our model ranked first on Jester's official leader-board until the submission time. The code and the trained models are released for better communication and future works(1). (C) 2020 Elsevier Ltd. All rights reserved.
WOS关键词	DATASET ; FUSION ; TIME
资助项目	NSFC[61876182] ; NSFC[61872364] ; NSFC[61876086] ; Jiangsu Frontier Technology Basic Research Project[BK20192004]
WOS研究方向	Computer Science ; Engineering
语种	英语
WOS记录号	WOS:000552866000006
出版者	ELSEVIER SCI LTD
资助机构	NSFC ; Jiangsu Frontier Technology Basic Research Project
源URL	[http://ir.ia.ac.cn/handle/173211/40289]
专题	自动化研究所_模式识别国家重点实验室_图像与视频分析团队类脑芯片与系统研究
通讯作者	Zhang, Yifan
作者单位	1.Chinese Acad Sci, Inst Automat, NLPR, Beijing, Peoples R China 2.CAS Ctr Excellence Brain Sci & Intelligence Techn, Beijing, Peoples R China 3.Wormpex AI Res, Bellevue, WA USA 4.Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China 5.Chinese Acad Sci, Inst Automat, AIRIA, Beijing, Peoples R China
推荐引用方式 GB/T 7714	Zhang, Yifan,Shi, Lei,Wu, Yi,et al. Gesture recognition based on deep deformable 3D convolutional neural networks[J]. PATTERN RECOGNITION,2020(107):12.
APA	Zhang, Yifan,Shi, Lei,Wu, Yi,Cheng, Ke,Cheng, Jian,&Lu, Hanqing.(2020).Gesture recognition based on deep deformable 3D convolutional neural networks.PATTERN RECOGNITION(107),12.
MLA	Zhang, Yifan,et al."Gesture recognition based on deep deformable 3D convolutional neural networks".PATTERN RECOGNITION .107(2020):12.

入库方式： OAI收割

来源：自动化研究所

下载0

Gesture recognition based on deep deformable 3D convolutional neural networks

其他版本