A convolution-transformer dual branch network for head-pose and occlusion facial expression recognition
文献类型:期刊论文
作者 | Liang, Xingcan1,2; Xu, Linsen5![]() |
刊名 | VISUAL COMPUTER
![]() |
出版日期 | 2022-02-13 |
关键词 | Facial expression recognition CNNs Transformers Feature fusion Robust on occlusions and head-pose variations |
ISSN号 | 0178-2789 |
DOI | 10.1007/s00371-022-02413-5 |
通讯作者 | Xu, Linsen(lsxu@hhu.edu.cn) |
英文摘要 | Facial expression recognition (FER) has attracted much more attention due to its broad range of applications. Occlusions and head-pose variations are two major obstacles for automatic FER. In this paper, we propose a convolution-transformer dual branch network (CT-DBN) that takes advantage of local and global facial information to tackle the real-word occlusions and head-pose variant robust FER. The CT-DBN contains two branches. Taking into account local modeling ability of CNN, the first branch utilizes CNN to capture local edge information. Inspired by transformers' successful application in natural language processing, we employ transformer to the second branch to be responsible for obtaining better global representation. Then, a local-global feature fusion module is proposed to adaptively integrate both features to hybrid features and model the relationship between them. With the help of feature fusion module, our network not only integrates local and global features in an adaptive weighting manner but can also learn the corresponding distinguishable features autonomously. Experimental results under inner-database and cross-database evaluation on four leading facial expression databases illustrate that our proposed CT-DBN outperforms other state-of-the-art methods and achieves robust performance under in-the-wild condition. |
资助项目 | National Key R&D Program of China[2017YFB1303200] ; Jiangsu Special Project for Frontier Leading Base Technology[BK20192004] ; Key Support Project of Dean Fund of Hefei Institutes of Physical Science, CAS[YZJJZX202017] ; Strategic High-tech Innovation Fund of Chinese Academy of Sciences[GQRC-19-15] |
WOS研究方向 | Computer Science |
语种 | 英语 |
WOS记录号 | WOS:000754455000001 |
出版者 | SPRINGER |
资助机构 | National Key R&D Program of China ; Jiangsu Special Project for Frontier Leading Base Technology ; Key Support Project of Dean Fund of Hefei Institutes of Physical Science, CAS ; Strategic High-tech Innovation Fund of Chinese Academy of Sciences |
源URL | [http://ir.hfcas.ac.cn:8080/handle/334002/127808] ![]() |
专题 | 中国科学院合肥物质科学研究院 |
通讯作者 | Xu, Linsen |
作者单位 | 1.Chinese Acad Sci, Inst Intelligent Machines, Hefei Inst Phys Sci, Hefei 230031, Peoples R China 2.Univ Sci & Technol China, Hefei 230026, Peoples R China 3.Changzhou Univ, Sch Microelect & Control Engn, Changzhou 213164, Peoples R China 4.Anhui Jianzhu Univ, Sch Elect & Informat Engn, Hefei 230009, Peoples R China 5.Hohai Univ, Coll Mech & Elect Engn, Changzhou 213022, Peoples R China |
推荐引用方式 GB/T 7714 | Liang, Xingcan,Xu, Linsen,Zhang, Wenxiang,et al. A convolution-transformer dual branch network for head-pose and occlusion facial expression recognition[J]. VISUAL COMPUTER,2022. |
APA | Liang, Xingcan,Xu, Linsen,Zhang, Wenxiang,Zhang, Yan,Liu, Jinfu,&Liu, Zhipeng.(2022).A convolution-transformer dual branch network for head-pose and occlusion facial expression recognition.VISUAL COMPUTER. |
MLA | Liang, Xingcan,et al."A convolution-transformer dual branch network for head-pose and occlusion facial expression recognition".VISUAL COMPUTER (2022). |
入库方式: OAI收割
来源:合肥物质科学研究院
浏览0
下载0
收藏0
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。