中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Information geometry theory of high-dimension space and application for speaker independent continuous digit speech recognition

文献类型:期刊论文

作者Wang Shoujue; Cao Wenming; Pan Xiaoxia
刊名Chinese journal of electronics
出版日期2006-10-01
卷号15期号:4a页码:768-784
关键词High-dimension space High-dimension space covering theory Continuous speech of speaker-independent
ISSN号1022-4653
通讯作者Wang shoujue()
英文摘要In the light of descriptive geometry and notions in set theory, this paper re-defines the basic elements in space such as curve and surface and so on, presents some fundamental notions with respect to the point cover based on the high-dimension space (hds) point covering theory, finally takes points from mapping part of speech signals to hds, so as to analyze distribution information of these speech points in hds, and various geometric covering objects for speech points and their relationship. besides, this paper also proposes a new algorithm for speaker independent continuous digit speech recognition based on the hds point dynamic searching theory without end-points detection and segmentation. first from the different digit syllables in real continuous digit speech, we establish the covering area in feature space for continuous speech. during recognition, we make use of the point covering dynamic searching theory in hds to do recognition, and then get the satisfying recognized results. at last, compared to hmm (hidden markov models)-based method, from the development trend of the comparing results, as sample amount increasing, the difference of recognition rate between two methods will decrease slowly, while sample amount approaching to be very large, two recognition rates all close to 100% little by little. as seen from the results, the recognition rate of hds point covering method is higher than that of in hmm (hidden markov models) based method, because, the point covering describes the morphological distribution for speech in hds, whereas hmm-based method is only a probability distribution, whose accuracy is certainly inferior to point covering.
WOS关键词TONE RECOGNITION ; NEURAL-NETWORKS ; LANGUAGE ; CHINESE
WOS研究方向Engineering
WOS类目Engineering, Electrical & Electronic
语种英语
WOS记录号WOS:000241920200003
出版者TECHNOLOGY EXCHANGE LIMITED HONG KONG
URI标识http://www.irgrid.ac.cn/handle/1471x/2426862
专题半导体研究所
通讯作者Wang Shoujue
作者单位1.Chinese Acad Sci, Inst Semicond, Beijing 100083, Peoples R China
2.Zhejiang Univ Technol, Informat Coll, Inst Intelligent Informat Syst, Hangzhou 310032, Peoples R China
3.Shenzhen Univ, Key Lab Intelligent Informat Proc, Shenzhen 518060, Peoples R China
推荐引用方式
GB/T 7714
Wang Shoujue,Cao Wenming,Pan Xiaoxia. Information geometry theory of high-dimension space and application for speaker independent continuous digit speech recognition[J]. Chinese journal of electronics,2006,15(4a):768-784.
APA Wang Shoujue,Cao Wenming,&Pan Xiaoxia.(2006).Information geometry theory of high-dimension space and application for speaker independent continuous digit speech recognition.Chinese journal of electronics,15(4a),768-784.
MLA Wang Shoujue,et al."Information geometry theory of high-dimension space and application for speaker independent continuous digit speech recognition".Chinese journal of electronics 15.4a(2006):768-784.

入库方式: iSwitch采集

来源:半导体研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。