中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Statistic model based dynamic channel compensation for telephony speech recognition

文献类型:期刊论文

作者Zhang, HY; Han, ZB; Xu, B
刊名CHINESE JOURNAL OF ELECTRONICS
出版日期2004-10-01
卷号13期号:4页码:665-670
关键词Automatic speech recognition (ASR) telephone channel compensation statistic model maximum likelihood estimation maximum a posteriori estimation
英文摘要The degradation of speech recognition performance in real-life environments and through transmission channels is a main embarrassment for many speech-based applications around the world, especially when non-stationary noise and changing channel exist. Previous works have shown that the main reason for this performance degradation is the variational mismatch caused by different telephone channels between the testing and training sets. In this paper, we propose a statistic model based implementation to dynamically compensate this mismatch. Firstly, we focus on a Maximum-likelihood (ML) estimation algorithm for telephone channels. In experiments on Mandarin Large vocabulary continuous speech recognition (LVCSR) over telephone lines, the Character error rate (CER) decreases more than 20%. The average delay is about 300similar to400ms. Secondly, we will extend it by introducing a phone-conditioned prior statistic model for the channels and applying Maximum a posteriori (MAP) estimation technique. Compared to the ML based method, the MAP based algorithm follows with the variations within channels more effectively. Average delay of the algorithm is decreased to 200ms. An additional 7similar to8% CER relative reduction is observed in LVCSR.
WOS标题词Science & Technology ; Technology
类目[WOS]Engineering, Electrical & Electronic
研究领域[WOS]Engineering
收录类别SCI
语种英语
WOS记录号WOS:000224787200024
公开日期2015-12-24
源URL[http://ir.ia.ac.cn/handle/173211/8927]  
专题自动化研究所_09年以前成果
作者单位1.Chinese Acad Sci, Inst Automat, Hightech Innovat Ctr, Beijing 100080, Peoples R China
2.Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100080, Peoples R China
推荐引用方式
GB/T 7714
Zhang, HY,Han, ZB,Xu, B. Statistic model based dynamic channel compensation for telephony speech recognition[J]. CHINESE JOURNAL OF ELECTRONICS,2004,13(4):665-670.
APA Zhang, HY,Han, ZB,&Xu, B.(2004).Statistic model based dynamic channel compensation for telephony speech recognition.CHINESE JOURNAL OF ELECTRONICS,13(4),665-670.
MLA Zhang, HY,et al."Statistic model based dynamic channel compensation for telephony speech recognition".CHINESE JOURNAL OF ELECTRONICS 13.4(2004):665-670.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。