中国科学院机构知识库网格系统: 连续语音识别及声调识别的研究

中国科学院机构知识库网格

Chinese Academy of Sciences Institutional Repositories Grid

连续语音识别及声调识别的研究

文献类型：学位论文


作者	李明
学位类别	博士
答辩日期	1999
授予单位	中国科学院声学研究所
授予地点	中国科学院声学研究所
关键词	连续语音识别声调识别建模方法基于最大后验概率上下文相关
中文摘要	首先我们先概要介绍作者在硕士生期间所做的主要研究工作，然后再就每一章的内容分别加以说明。作者在连续密度HMM方面尝试了一种新的建模方式，即多组加权系数的高斯混合模型，实验表明该方法减少了约14%的错误。在汉语连续语音声调识别方面，作者在聚类映射识别法的基础之上，提出了基于最大后验概率的上下文相关识别方法，也取得了一定的改进效果。此外，作者还建立了两个识别系统。其中连呼数字串识别系统的数字正确率可达98%，汉语连续语音识别系统用863语音库进行测试，PLU(phoneme-like-unit)的首选正确率也在72%以上。第一章回顾了语音识别技术发展的过程，介绍了在发展过程中具有代表性和具有里程碑意义的成果。此外，本章还分析了语音识别中存在的各种困难。第二章总结了目前语音识别技术的整体框架，并对一些主要技术做了必要的论述和分析。本章的另一部分则介绍了语音识别中语音信号分析的一些常用方法。第三章首先概要地论述了语音识别的主流技术——隐马尔克夫模型。接着分别详细说明了在实现离散HMM和连续HMM时遇到的一些具体问题。然后介绍了我们采用连续密度HMM的方法建立的连呼数字串识别系统和汉语连续语音识别系统。在这一章的最后，还介绍了我们在连续HMM建模方法上的探索。第四章介绍了我们在汉语连续语音不认人声调识别方面的研究工作。首先我们介绍了基音的检测方法，然后研究了传统的声调识别方法，最后我们论述了两种较为新颖的声调识别方法——聚类映射识别法和基于最大后验概率的上下文相关识别方法。最后一章简要回顾作者在硕士生期间所做的主要工作。
英文摘要	First we will introduce the main study of the author during the last three years and then describe the content of every chapter respectively. As to the continuous density HMM, the author puts forward a new modeling method-multi-weight coefficient gaussion mixture model. The experiment showed that it reduces about 14% error. On the respect of tone recognition of Chinese continuous speech, the author presents a MAP-based context dependent tone recognition method. Beside these, two recognition systems have been built up. The digit string recognition system reaches 98% accuracy of digit. The Chinese continuous speech recognition system achieves above 72% correct rate of the top one PLU (phoneme-like-unit). In the chapter 1 the author gives a summarization of history of speech recognition. In this chapter, we also list all kinds of difficulty in speech recognition. In the chapter 2 the author reviews the whole framework of speech recognition and some of the key advances in several areas. The other part of this chapter introduces the common method of speech signal analysis. In the chapter 3 the paper concisely introduces the basic theory of Hidden Markov Model. Then the paper discusses the practical issues while implementing the discrete HMM and continuous density HMM. In the following the paper introduces a digit string recognition system and a Chinese continuous speech recognition system. In the end of the chapter, the author presents a new modeling method of HMM. The study on Chinese continuous speaker independent tone recognition is described in details in the chapter 4. The paper reviews the conventional method of tone recognition and then introduces the Cluster-Mapping tone recognition method and the MAP-based Context Dependent tone recognition method. In the last chapter, the author looks back on the main study during the last three years.
语种	中文
公开日期	2011-05-07
页码	62
源URL	[http://159.226.59.140/handle/311008/696]
专题	声学研究所_声学所博硕士学位论文_1981-2009博硕士学位论文
推荐引用方式 GB/T 7714	李明. 连续语音识别及声调识别的研究[D]. 中国科学院声学研究所. 中国科学院声学研究所. 1999.

入库方式： OAI收割

来源：声学研究所

浏览0

下载0

收藏0

其他版本

除非特别说明，本系统中所有内容都受版权保护，并保留所有权利。