A Study on Bag of Gaussian Model with Application to Voice Conversion
文献类型:会议论文
作者 | Yu Qiao; Tong Tong; Nobuaki Minematsu |
出版日期 | 2011 |
会议名称 | 12th Annual Conference of the International-Speech-Communication-Association 2011 |
会议地点 | Florence, ITALY |
英文摘要 | The GMM based mapping techniques proved to be an efficient method to find nonlinear regression function between two spaces, and found success in voiceconversion. In these methods, a linear transformation is estimated for each Guassian component, and the final conversion function is a weighted summation ofall linear transformations. These linear transformations fit well for the samples near to the center of at least one Guassian component, but may not deal wellwith the samples far from the centers of all Gaussian distributions. To overcome this problem, this paper proposes Bag of Gaussian Model (BGM). BGM modelconsists of two types of Gaussian distributions, namely basic and complex distributions. Compared with classical GMM, BGM is adaptive for samples. That is for a sample, BGM can select a set of Guassian distributions which fit the sample best. We develop a data-driven method to construct BGM model and show how to estimate regression function with BGM. We carry out experiment on voice conversion tasks. The experimental results exhibit the usefulness of BGM based methods. |
收录类别 | EI |
语种 | 英语 |
源URL | [http://ir.siat.ac.cn:8080/handle/172644/3263] ![]() |
专题 | 深圳先进技术研究院_集成所 |
作者单位 | 2011 |
推荐引用方式 GB/T 7714 | Yu Qiao,Tong Tong,Nobuaki Minematsu. A Study on Bag of Gaussian Model with Application to Voice Conversion[C]. 见:12th Annual Conference of the International-Speech-Communication-Association 2011. Florence, ITALY. |
入库方式: OAI收割
来源:深圳先进技术研究院
浏览0
下载0
收藏0
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。