中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Remote speaker recognition based on the enhanced LDV-captured speech

文献类型:期刊论文

作者Zhang, Heyong1; Yan, Chunhui1; Wu, Shisong1; Han, Xiyu1; Lv, Tao1; Peng, Shuping2
刊名APPLIED ACOUSTICS
出版日期2019
卷号143页码:165-170
关键词Laser Doppler Vibrometer Remote acoustic detection Speaker recognition
ISSN号0003-682X
DOI10.1016/j.apacoust.2018.08.007
通讯作者Lv, Tao(18767120269@163.com)
英文摘要Speaker recognition technique is one of the popular biometric identification technology, which identifies the speaker's identity based on the speaker's voice. Whereas almost all speaker verification system shows poor performance when the system and speaker are far apart. To address the challenges of remote speaker recognition, a Laser Doppler Vibrometer (LDV) is used to recognize remote speaker. In this paper, three LDV speech corpuses, each consists of 50 speakers, are collected from the vibrations of a plastic bag, a mineral water bottle and a computer screen, using the LDV developed by us. The distance from the LDV sensor to the vibration targets is approximately 50 m. In order to improve the quality of the LDV-captured speech, the speech enhancement technology based on optimally modified log-spectral amplitude (OM-LSA) algorithm is used. According to the enhanced LDV-captured speech, a GMM-UBM model is built to recognize remote speaker. The experiment results show that the average EER using LDV-captured speech is 16.9590%. These results show great promise of using LDV for long range speaker recognition applications. (C) 2018 Published by Elsevier Ltd.
资助项目National Natural Science Foundation of China[61205143]
WOS研究方向Acoustics
语种英语
WOS记录号WOS:000449138700017
出版者ELSEVIER SCI LTD
资助机构National Natural Science Foundation of China
源URL[http://ir.ciomp.ac.cn/handle/181722/60381]  
专题中国科学院长春光学精密机械与物理研究所
通讯作者Lv, Tao
作者单位1.Chinese Acad Sci, Changchun Inst Opt Fine Mech & Phys, State Key Lab Laser Interact Matter, Changchun 130033, Jilin, Peoples R China
2.Zhejiang Univ Technol, Hangzhou 310014, Zhejiang, Peoples R China
推荐引用方式
GB/T 7714
Zhang, Heyong,Yan, Chunhui,Wu, Shisong,et al. Remote speaker recognition based on the enhanced LDV-captured speech[J]. APPLIED ACOUSTICS,2019,143:165-170.
APA Zhang, Heyong,Yan, Chunhui,Wu, Shisong,Han, Xiyu,Lv, Tao,&Peng, Shuping.(2019).Remote speaker recognition based on the enhanced LDV-captured speech.APPLIED ACOUSTICS,143,165-170.
MLA Zhang, Heyong,et al."Remote speaker recognition based on the enhanced LDV-captured speech".APPLIED ACOUSTICS 143(2019):165-170.

入库方式: OAI收割

来源:长春光学精密机械与物理研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。