中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Semi-supervised Ladder Networks for Speech Emotion Recognition

文献类型:期刊论文

作者Jianhua Tao1,2,3; Jian Huang1,2; Ya Li1; Zheng Lian1,2; Mingyue Niu1,2; Tao, Jianhua; Huang, Jian; Li, Ya; Lian, Zheng; Niu, Mingyue
刊名International Journal of Automation and Computing
出版日期2019-03
卷号16期号:4页码:437-448
关键词Speech emotion recognition the ladder network semi-supervised learning autoencoder regularization
英文摘要

As a major component of speech signal processing, speech emotion recognition has become increasingly essential to understanding human communication. Benefitting from deep learning, many researchers have proposed various unsupervised models to extract effective emotional features and supervised models to train emotion recognition systems. In this paper, we utilize semi-supervised ladder networks for speech emotion recognition. The model is trained by minimizing the supervised loss and auxiliary unsupervised cost function. The addition of the unsupervised auxiliary task provides powerful discriminative representations of the input features, and is also regarded as the regularization of the emotional supervised task. We also compare the ladder network with other classical autoencoder structures. The experiments were conducted on the interactive emotional dyadic motion capture (IEMOCAP) database, and the results reveal that the proposed methods achieve superior performance with a small number of labelled data and achieves better performance than other methods.

源URL[http://ir.ia.ac.cn/handle/173211/39297]  
专题模式识别国家重点实验室_智能交互
自动化研究所_学术期刊_International Journal of Automation and Computing
通讯作者Jianhua Tao; Tao, Jianhua
作者单位1.National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing, China
2.School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, China
3.CAS Center for Excellence in Brain Science and Intelligence Technology, Beijing, China
推荐引用方式
GB/T 7714
Jianhua Tao,Jian Huang,Ya Li,et al. Semi-supervised Ladder Networks for Speech Emotion Recognition[J]. International Journal of Automation and Computing,2019,16(4):437-448.
APA Jianhua Tao.,Jian Huang.,Ya Li.,Zheng Lian.,Mingyue Niu.,...&Niu, Mingyue.(2019).Semi-supervised Ladder Networks for Speech Emotion Recognition.International Journal of Automation and Computing,16(4),437-448.
MLA Jianhua Tao,et al."Semi-supervised Ladder Networks for Speech Emotion Recognition".International Journal of Automation and Computing 16.4(2019):437-448.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。