中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
A Study on Bag of Gaussian Model with Application to Voice Conversion

文献类型:会议论文

作者Yu Qiao; Tong Tong; Nobuaki Minematsu
出版日期2011
会议名称12th Annual Conference of the International-Speech-Communication-Association 2011
会议地点Florence, ITALY
英文摘要The GMM based mapping techniques proved to be an efficient method to find nonlinear regression function between two spaces, and found success in voiceconversion. In these methods, a linear transformation is estimated for each Guassian component, and the final conversion function is a weighted summation ofall linear transformations. These linear transformations fit well for the samples near to the center of at least one Guassian component, but may not deal wellwith the samples far from the centers of all Gaussian distributions. To overcome this problem, this paper proposes Bag of Gaussian Model (BGM). BGM modelconsists of two types of Gaussian distributions, namely basic and complex distributions. Compared with classical GMM, BGM is adaptive for samples. That is for a sample, BGM can select a set of Guassian distributions which fit the sample best. We develop a data-driven method to construct BGM model and show how to estimate regression function with BGM. We carry out experiment on voice conversion tasks. The experimental results exhibit the usefulness of BGM based methods.
收录类别EI
语种英语
源URL[http://ir.siat.ac.cn:8080/handle/172644/3263]  
专题深圳先进技术研究院_集成所
作者单位2011
推荐引用方式
GB/T 7714
Yu Qiao,Tong Tong,Nobuaki Minematsu. A Study on Bag of Gaussian Model with Application to Voice Conversion[C]. 见:12th Annual Conference of the International-Speech-Communication-Association 2011. Florence, ITALY.

入库方式: OAI收割

来源:深圳先进技术研究院

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。