• DocumentCode
    417167
  • Title

    Enrollment in low-resource speech recognition systems

  • Author

    Deligne, Sabine ; Dharanipragada, Satya

  • Author_Institution
    IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    We consider the problem of enrollment for low-resource speech recognition systems designed for noisy environments. Noise robustness concerns, memory and computational constraints, along with the use of compact acoustic models for fast Gaussian computation, make adaptation especially challenging. We derive a maximum a posteriori (MAP) algorithm especially designed for the fast off-line adaptation of these compact acoustic models. It requires less computation and memory than standard feature-space maximum likelihood linear regression (FMLLR) which is another technique well suited for compact acoustic models. In our experiments of speaker enrollment for speech recognition in the car, we present a computationally efficient procedure to simulate noisy conditions with the adaptation data. In these experiments, MAP compares favorably with FMLLR in terms of recognition accuracy. Besides, combining FMLLR and MAP significantly outperforms each technique individually, thus providing an efficient alternative for systems with larger resources.
  • Keywords
    acoustic noise; acoustic signal processing; maximum likelihood estimation; random noise; speech recognition; Gaussian computation; MAP algorithm; compact acoustic models; enrollment; feature-space MLLR; feature-space maximum likelihood linear regression; low-resource speech recognition systems; maximum a posteriori algorithm; Acoustic noise; Algorithm design and analysis; Computational modeling; Gaussian noise; Loudspeakers; Maximum likelihood linear regression; Memory management; Noise robustness; Speech recognition; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1325992
  • Filename
    1325992