• DocumentCode
    3424670
  • Title

    Modeling inter-speaker variability in speech recognition

  • Author

    Cloarec, Gwenael ; Jouvet, Denis

  • Author_Institution
    Div. R&D, France Telecom, Lannion
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    4529
  • Lastpage
    4532
  • Abstract
    This paper details a method for taking into account variability influence in HMM-based speech recognition. The set of Gaussian components of the mixtures represents the entire acoustic space covered for all possible variability values. For each utterance to be recognized, the corresponding variability value is estimated and used to weight and/or constrain dynamically the acoustic space for each pdf. To do that, the weight coefficients of the Gaussian mixtures are set dependent on the variability value. As an example, the variability considered is the inter-speaker variability, and is handled through speaker classes. Taking into account for each utterance the four speaker classes that best match with the utterance signal leads to a significant word error rate reduction on a continuous speech recognition task, as compared to standard speaker-independent modeling.
  • Keywords
    Gaussian processes; error statistics; hidden Markov models; speech recognition; Gaussian components; Gaussian mixtures; HMM-based speech recognition; continuous speech recognition task; inter-speaker variability modeling; speaker-independent modeling; utterance signal; word error rate reduction; Acoustic testing; Bayesian methods; Decoding; Error analysis; Hidden Markov models; Loudspeakers; Performance evaluation; Research and development; Speech recognition; Telecommunications; Speech recognition; acoustic modeling; dynamic Bayesian network; inter-speaker variability; speaker class;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4518663
  • Filename
    4518663