Title :
Subspace Gaussian Mixture Models for speech recognition
Author :
Daniel Povey;Lukśš Burget;Mohit Agarwal;Pinar Akyazi;Kai Feng;Arnab Ghoshal;Ondřej Glembek;Nagendra Kumar Goel;Martin Karafiát;Ariya Rastrow;Richard C. Rose;Petr Schwarz;Samuel Thomas
Author_Institution :
Microsoft Research, Redmond, WA, USA
fDate :
3/1/2010 12:00:00 AM
Abstract :
We describe an acoustic modeling approach in which all phonetic states share a common Gaussian Mixture Model structure, and the means and mixture weights vary in a subspace of the total parameter space. We call this a Subspace Gaussian Mixture Model (SGMM). Globally shared parameters define the subspace. This style of acoustic model allows for a much more compact representation and gives better results than a conventional modeling approach, particularly with smaller amounts of training data.
Keywords :
"Speech recognition","Hidden Markov models","Training data","Software tools","Acoustic testing","Software testing","Equations","Costs","Natural languages","Loudspeakers"
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
2379-190X
DOI :
10.1109/ICASSP.2010.5495662