A study on invariance of f-divergence and its application to speech recognition
文献类型:期刊论文
作者 | Yu Qiao; Nobuaki Minematsu |
刊名 | IEEE TRANSACTIONS ON SIGNAL PROCESSING
![]() |
出版日期 | 2010 |
卷号 | 58期号:7页码:3884-3890 |
英文摘要 | Identifying features invariant to certain transformations is a fundamental problem in the fields of signal processing and pattern recognition. This correspondence explores a family of measures called f-divergences that are invariant to invertible transformations, and studies their application to speech recognition. We provide novel proofs for the sufficiency and necessity of the invariance of f-divergence. Several techniques to calculate or approximate f-divergences in general cases and for special distributions such as Gaussian and Gaussian mixture are reviewed. We show how to construct an invariant structural representation from sequence data through maximum likelihood decomposition, and prove the invariance of this decomposition. We demonstrate an application of this invariant representation to recognizing connected Japanese vowel utterances. In addition, we propose several techniques to improve the recognition performance. The experimental results show that the invariant structure achieves better performance than hidden Markov models, a widely used technique for acoustic modeling of speech sounds |
收录类别 | SCI |
原文出处 | http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=5440958 |
语种 | 英语 |
源URL | [http://ir.siat.ac.cn:8080/handle/172644/2677] ![]() |
专题 | 深圳先进技术研究院_集成所 |
作者单位 | IEEE TRANSACTIONS ON SIGNAL PROCESSING |
推荐引用方式 GB/T 7714 | Yu Qiao,Nobuaki Minematsu. A study on invariance of f-divergence and its application to speech recognition[J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING,2010,58(7):3884-3890. |
APA | Yu Qiao,&Nobuaki Minematsu.(2010).A study on invariance of f-divergence and its application to speech recognition.IEEE TRANSACTIONS ON SIGNAL PROCESSING,58(7),3884-3890. |
MLA | Yu Qiao,et al."A study on invariance of f-divergence and its application to speech recognition".IEEE TRANSACTIONS ON SIGNAL PROCESSING 58.7(2010):3884-3890. |
入库方式: OAI收割
来源:深圳先进技术研究院
浏览0
下载0
收藏0
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。