• DocumentCode
    3347254
  • Title

    Improving environmental robustness in large vocabulary speech recognition

  • Author

    Woodland, P.C. ; Gales, M.J.F. ; Pye, D.

  • Author_Institution
    Dept. of Eng., Cambridge Univ., UK
  • Volume
    1
  • fYear
    1996
  • fDate
    7-10 May 1996
  • Firstpage
    65
  • Abstract
    This paper describes techniques to improve the robustness of the HTK large vocabulary speech recognition system to non-ideal acoustic environments. The primary methods are single-pass retraining using stereo training data; parallel model combination which combines HMMs trained on clean data with estimates of convolutional and additive noise; and maximum likelihood linear regression which estimates a set of linear transformations of the model parameters to the current conditions. Experiments are reported on both the 1994 ARPA CSR S5 (alternate microphones) and S10 (additive noise) spoken tasks and the 1995 ARPA CSR H3 task (multiple unknown microphones). The HTK system yielded the lowest error rates in both the H3-P0 and HS-C0 tests
  • Keywords
    hidden Markov models; maximum likelihood estimation; noise; noise pollution; speech recognition; 1994 ARPA CSR S5; 1995 ARPA CSR H3 task; H3-P0 tests; HMM; HS-C0 tests; HTK system; S10 spoken tasks; additive noise; alternate microphones; clean data; convolutional noise; environmental robustness; error rates; experiments; large vocabulary speech recognition; linear transformations; maximum likelihood linear regression; nonideal acoustic environments; parallel model combination; parameter estimation; single pass retraining; stereo training data; Additive noise; Error analysis; Hidden Markov models; Maximum likelihood linear regression; Microphones; Noise robustness; Speech recognition; System testing; Training data; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
  • Conference_Location
    Atlanta, GA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-3192-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1996.540291
  • Filename
    540291