• DocumentCode
    3163255
  • Title

    Modeling gender dependency in the Subspace GMM framework

  • Author

    Vu, Ngoc Thang ; Schultz, Tanja ; Povey, Daniel

  • Author_Institution
    Karlsruhe Inst. of Technol., Karlsruhe, Germany
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    4345
  • Lastpage
    4348
  • Abstract
    The Subspace GMM acoustic model has both globally shared parameters and parameters specific to acoustic states, and this makes it possible to do various kinds of tying. In the past we have investigated sharing the global parameters among systems with distinct acoustic states; this can be useful in a multilingual setting. In the current paper we investigate the reverse idea: to have different global parameters for different acoustic conditions (gender, in this case) while sharing the acoustic-state-specific parameters. We experiment with modeling gender dependency in this way, and show Word Error Rate improvements on a range of tasks and comparable results to the Vocal Tract Length Normalization (VTLN)-like technique Exponential Transform (ET).
  • Keywords
    Gaussian processes; speech recognition; transforms; ET; Gaussian mixture model; VTLN; acoustic-state-specific parameter; exponential transform; gender dependency modeling; global shared parameter; multilingual setting; speech recognition; subspace GMM acoustic model; vocal tract length normalization; word error rate improvement; Acoustics; Adaptation models; Decoding; Hidden Markov models; Speech; Speech recognition; Training; Subspace Gaussian Mixture Models; gender dependency modeling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6288881
  • Filename
    6288881