Statistic model based dynamic channel compensation for telephony speech recognition
文献类型:期刊论文
作者 | Zhang, HY; Han, ZB; Xu, B![]() |
刊名 | CHINESE JOURNAL OF ELECTRONICS
![]() |
出版日期 | 2004-10-01 |
卷号 | 13期号:4页码:665-670 |
关键词 | Automatic speech recognition (ASR) telephone channel compensation statistic model maximum likelihood estimation maximum a posteriori estimation |
英文摘要 | The degradation of speech recognition performance in real-life environments and through transmission channels is a main embarrassment for many speech-based applications around the world, especially when non-stationary noise and changing channel exist. Previous works have shown that the main reason for this performance degradation is the variational mismatch caused by different telephone channels between the testing and training sets. In this paper, we propose a statistic model based implementation to dynamically compensate this mismatch. Firstly, we focus on a Maximum-likelihood (ML) estimation algorithm for telephone channels. In experiments on Mandarin Large vocabulary continuous speech recognition (LVCSR) over telephone lines, the Character error rate (CER) decreases more than 20%. The average delay is about 300similar to400ms. Secondly, we will extend it by introducing a phone-conditioned prior statistic model for the channels and applying Maximum a posteriori (MAP) estimation technique. Compared to the ML based method, the MAP based algorithm follows with the variations within channels more effectively. Average delay of the algorithm is decreased to 200ms. An additional 7similar to8% CER relative reduction is observed in LVCSR. |
WOS标题词 | Science & Technology ; Technology |
类目[WOS] | Engineering, Electrical & Electronic |
研究领域[WOS] | Engineering |
收录类别 | SCI |
语种 | 英语 |
WOS记录号 | WOS:000224787200024 |
公开日期 | 2015-12-24 |
源URL | [http://ir.ia.ac.cn/handle/173211/8927] ![]() |
专题 | 自动化研究所_09年以前成果 |
作者单位 | 1.Chinese Acad Sci, Inst Automat, Hightech Innovat Ctr, Beijing 100080, Peoples R China 2.Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100080, Peoples R China |
推荐引用方式 GB/T 7714 | Zhang, HY,Han, ZB,Xu, B. Statistic model based dynamic channel compensation for telephony speech recognition[J]. CHINESE JOURNAL OF ELECTRONICS,2004,13(4):665-670. |
APA | Zhang, HY,Han, ZB,&Xu, B.(2004).Statistic model based dynamic channel compensation for telephony speech recognition.CHINESE JOURNAL OF ELECTRONICS,13(4),665-670. |
MLA | Zhang, HY,et al."Statistic model based dynamic channel compensation for telephony speech recognition".CHINESE JOURNAL OF ELECTRONICS 13.4(2004):665-670. |
入库方式: OAI收割
来源:自动化研究所
浏览0
下载0
收藏0
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。