中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Monaural speech separation based on computational auditory scene analysis and objective quality assessment of speech

文献类型:期刊论文

作者Li, Peng; Guan, Yong; Xu, Bo; Liu, Wenju
刊名IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING
出版日期2006-11-01
卷号14期号:6页码:2014-2023
关键词computational auditory scene analysis (CASA) grouping monaural speech separation objective quality assessment of speech (OQAS) segmentation
英文摘要Monaural speech separation is a very challenging problem in speech signal processing. It has been studied extensively, and many separation systems based on computational auditory scene analysis (CASA) have been proposed in the last two decades. Although the research on CASA has tended to introduce high-level knowledge into separation processes using primitive data-driven methods, the knowledge on speech quality still has not been combined with it. This makes the performance evaluation of CASA mainly focused on the signal-to-noise ratio (SNR) improvement. Actually, the quality of the separated speech is not directly related to its SNR. In order to solve this problem, we propose a new method which combines CASA with objective quality assessment of speech (OQAS). In the grouping process of CASA, we use OQAS as the guide to instruct the CASA system. With this combination, the performance of the speech separation can be improved not only in SNR, but also in mean opinion score (MOS). Our system is systematically evaluated and compared with previous systems, and it yields substantially better performance, especially for the subjective perceptual quality of separated speech.
WOS标题词Science & Technology ; Technology
类目[WOS]Acoustics ; Engineering, Electrical & Electronic
研究领域[WOS]Acoustics ; Engineering
关键词[WOS]PITCH ; NOISE ; MODEL
收录类别SCI
语种英语
WOS记录号WOS:000241567200014
源URL[http://ir.ia.ac.cn/handle/173211/9330]  
专题自动化研究所_09年以前成果
作者单位1.Chinese Acad Sci, Hightech Innovat Ctr, Inst Automat, Beijing 100080, Peoples R China
2.Chinese Acad Sci, Natl Lab Pattern Recognit, Inst Automat, Beijing 100080, Peoples R China
推荐引用方式
GB/T 7714
Li, Peng,Guan, Yong,Xu, Bo,et al. Monaural speech separation based on computational auditory scene analysis and objective quality assessment of speech[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING,2006,14(6):2014-2023.
APA Li, Peng,Guan, Yong,Xu, Bo,&Liu, Wenju.(2006).Monaural speech separation based on computational auditory scene analysis and objective quality assessment of speech.IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING,14(6),2014-2023.
MLA Li, Peng,et al."Monaural speech separation based on computational auditory scene analysis and objective quality assessment of speech".IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING 14.6(2006):2014-2023.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。