DocumentCode :
3636219
Title :
Subspace Gaussian Mixture Models for speech recognition
Author :
Daniel Povey;Lukśš Burget;Mohit Agarwal;Pinar Akyazi;Kai Feng;Arnab Ghoshal;Ondřej Glembek;Nagendra Kumar Goel;Martin Karafiát;Ariya Rastrow;Richard C. Rose;Petr Schwarz;Samuel Thomas
Author_Institution :
Microsoft Research, Redmond, WA, USA
fYear :
2010
fDate :
3/1/2010 12:00:00 AM
Firstpage :
4330
Lastpage :
4333
Abstract :
We describe an acoustic modeling approach in which all phonetic states share a common Gaussian Mixture Model structure, and the means and mixture weights vary in a subspace of the total parameter space. We call this a Subspace Gaussian Mixture Model (SGMM). Globally shared parameters define the subspace. This style of acoustic model allows for a much more compact representation and gives better results than a conventional modeling approach, particularly with smaller amounts of training data.
Keywords :
"Speech recognition","Hidden Markov models","Training data","Software tools","Acoustic testing","Software testing","Equations","Costs","Natural languages","Loudspeakers"
Publisher :
ieee
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
2379-190X
Type :
conf
DOI :
10.1109/ICASSP.2010.5495662
Filename :
5495662
Link To Document :
بازگشت