• DocumentCode
    3636219
  • Title

    Subspace Gaussian Mixture Models for speech recognition

  • Author

    Daniel Povey;Lukśš Burget;Mohit Agarwal;Pinar Akyazi;Kai Feng;Arnab Ghoshal;Ondřej Glembek;Nagendra Kumar Goel;Martin Karafiát;Ariya Rastrow;Richard C. Rose;Petr Schwarz;Samuel Thomas

  • Author_Institution
    Microsoft Research, Redmond, WA, USA
  • fYear
    2010
  • fDate
    3/1/2010 12:00:00 AM
  • Firstpage
    4330
  • Lastpage
    4333
  • Abstract
    We describe an acoustic modeling approach in which all phonetic states share a common Gaussian Mixture Model structure, and the means and mixture weights vary in a subspace of the total parameter space. We call this a Subspace Gaussian Mixture Model (SGMM). Globally shared parameters define the subspace. This style of acoustic model allows for a much more compact representation and gives better results than a conventional modeling approach, particularly with smaller amounts of training data.
  • Keywords
    "Speech recognition","Hidden Markov models","Training data","Software tools","Acoustic testing","Software testing","Equations","Costs","Natural languages","Loudspeakers"
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    2379-190X
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495662
  • Filename
    5495662