中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Deep Neural Network-based Generalized Sidelobe Canceller for Dual-channel Far-field Speech Recognition

文献类型:期刊论文

作者Li GJ(李冠君)
刊名Neural Networks
出版日期2021
期号Volume 141,页码:Pages 225-237
关键词Deep neural networkGeneralized sidelobe cancellerDual-channelFar-field speech recognition
文献子类期刊
英文摘要

The traditional generalized sidelobe canceller (GSC) is a common speech enhancement front end to improve the noise robustness of automatic speech recognition (ASR) systems in the far-field cases. However, the traditional GSC is optimized based on the signal level criteria, causing it not to guarantee the optimal ASR performance. To address this issue, we propose a novel dual-channel deep neural network (DNN)-based GSC structure, called nnGSC, which is optimized by using the objective of maximizing the ASR performance. Our key idea is to make each module of the traditional GSC fully learnable and use the acoustic model to perform joint optimization with GSC. We use the coefficients of the traditional GSC to initialize nnGSC, so that both traditional signal processing knowledge and large amounts of data can be used to guide the network learning. In addition, nnGSC can automatically track the target direction-of-arrival (DOA) frame-by-frame without the need for additional localization algorithms. In the experiments, nnGSC achieves a relative character error rate (CER) improvement of 23.7% compared to the microphone observation, 13.5% compared to the oracle direction-based super-directive beamformer, 12.2% compared to the oracle direction-based traditional GSC and 5.9% compared to the oracle mask-based minimum variance distortionless response (MVDR) beamformer. Moreover, we can improve the robustness of nnGSC against array geometry mismatches by training with multi-geometry data.

源URL[http://ir.ia.ac.cn/handle/173211/44846]  
专题模式识别国家重点实验室_智能交互
作者单位Institute of Automation, Chinese Academy of Sciences
推荐引用方式
GB/T 7714
Li GJ. Deep Neural Network-based Generalized Sidelobe Canceller for Dual-channel Far-field Speech Recognition[J]. Neural Networks,2021(Volume 141,):Pages 225-237.
APA Li GJ.(2021).Deep Neural Network-based Generalized Sidelobe Canceller for Dual-channel Far-field Speech Recognition.Neural Networks(Volume 141,),Pages 225-237.
MLA Li GJ."Deep Neural Network-based Generalized Sidelobe Canceller for Dual-channel Far-field Speech Recognition".Neural Networks .Volume 141,(2021):Pages 225-237.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。