中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Improving speech transcription by exploiting user feedback and word repetition

文献类型:期刊论文

作者Wang, Xiangdong1,2; Yang, Ying3; Liu, Hong1,2; Qian, Yueliang1,2
刊名MULTIMEDIA TOOLS AND APPLICATIONS
出版日期2017-10-01
卷号76期号:19页码:20359-20376
关键词Speech transcription Error correction User feedback Repeated word
ISSN号1380-7501
DOI10.1007/s11042-017-4714-x
英文摘要Speech Transcription is important for video/audio retrieval and many other applications. In automatic speech transcription, recognition errors are inevitable, which makes user feedback such as manual error correction necessary. In this paper, an approach is proposed to improve the accuracy of speech transcription by exploiting user feedback and word repetition. The method aims at learning from user feedback and recognition results of preceding utterances and then correcting errors when repeated words are falsely recognized in following utterances. An interaction scheme for user feedback is proposed, which facilitate error correction by candidate lists and provide a new kind of feedback referred to as word indication to extend error correction from repeated words to repeated phrases. For template extraction and matching, the representation of word template and recognition results based on syllable confusion network (SCN) is proposed. During the transcription, templates of multi-syllable words/phrases based on SCN are extracted from user feedback and the N-best lattice, and then matched in SCN corresponding to recognition results of subsequent utterances to yield a new candidate list when repeated words are detected. Experimental results show that considerate error reduction is achieved in the newly-generated candidate lists.
WOS研究方向Computer Science ; Engineering
语种英语
WOS记录号WOS:000409180500058
出版者SPRINGER
源URL[http://119.78.100.204/handle/2XEOYT63/6620]  
专题中国科学院计算技术研究所期刊论文_英文
通讯作者Wang, Xiangdong
作者单位1.Chinese Acad Sci, Beijing Key Lab Mobile Comp & Pervas Device, Inst Comp Technol, Beijing 100190, Peoples R China
2.Chinese Acad Sci, Inst Comp Technol, Res Ctr Ubiquitous Comp Syst, Beijing 100190, Peoples R China
3.China Agr Univ, Beijing 100083, Peoples R China
推荐引用方式
GB/T 7714
Wang, Xiangdong,Yang, Ying,Liu, Hong,et al. Improving speech transcription by exploiting user feedback and word repetition[J]. MULTIMEDIA TOOLS AND APPLICATIONS,2017,76(19):20359-20376.
APA Wang, Xiangdong,Yang, Ying,Liu, Hong,&Qian, Yueliang.(2017).Improving speech transcription by exploiting user feedback and word repetition.MULTIMEDIA TOOLS AND APPLICATIONS,76(19),20359-20376.
MLA Wang, Xiangdong,et al."Improving speech transcription by exploiting user feedback and word repetition".MULTIMEDIA TOOLS AND APPLICATIONS 76.19(2017):20359-20376.

入库方式: OAI收割

来源:计算技术研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。