مرکز منطقه ای اطلاع رساني علوم و فناوري - Subspace Gaussian Mixture Models for speech recognition

DocumentCode :

3636219

Title :

Subspace Gaussian Mixture Models for speech recognition

Author :

Daniel Povey;Lukśš Burget;Mohit Agarwal;Pinar Akyazi;Kai Feng;Arnab Ghoshal;Ondřej Glembek;Nagendra Kumar Goel;Martin Karafiát;Ariya Rastrow;Richard C. Rose;Petr Schwarz;Samuel Thomas

Author_Institution :

Microsoft Research, Redmond, WA, USA

fYear :

2010

fDate :

3/1/2010 12:00:00 AM

Firstpage :

4330

Lastpage :

4333

Abstract :

We describe an acoustic modeling approach in which all phonetic states share a common Gaussian Mixture Model structure, and the means and mixture weights vary in a subspace of the total parameter space. We call this a Subspace Gaussian Mixture Model (SGMM). Globally shared parameters define the subspace. This style of acoustic model allows for a much more compact representation and gives better results than a conventional modeling approach, particularly with smaller amounts of training data.

Keywords :

"Speech recognition","Hidden Markov models","Training data","Software tools","Acoustic testing","Software testing","Equations","Costs","Natural languages","Loudspeakers"

Publisher :

ieee

Conference_Titel :

Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on

ISSN :

1520-6149

Print_ISBN :

978-1-4244-4295-9

Electronic_ISBN :

2379-190X

Type :

conf

DOI :

10.1109/ICASSP.2010.5495662

Filename :

5495662

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3636219