• DocumentCode
    2926449
  • Title

    Environmental robustness in automatic speech recognition

  • Author

    Acero, Alejandro ; Stern, Richard

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA
  • fYear
    1990
  • fDate
    3-6 Apr 1990
  • Firstpage
    849
  • Abstract
    Initial efforts to make Sphinx, a continuous-speech speaker-independent recognition system, robust to changes in the environment are reported. To deal with differences in noise level and spectral tilt between close-talking and desk-top microphones, two novel methods based on additive corrections in the cepstral domain are proposed. In the first algorithm, the additive correction depends on the instantaneous SNR of the signal. In the second technique, expectation-maximization techniques are used to best match the cepstral vectors of the input utterances to the ensemble of codebook entries representing a standard acoustical ambience. Use of the algorithms dramatically improves recognition accuracy when the system is tested on a microphone other than the one on which it was trained
  • Keywords
    interference suppression; microphones; speech recognition; Sphinx; additive correction; cepstral domain; close-talking microphones; continuous-speech speaker-independent recognition system; desk-top microphones; expectation-maximization techniques; instantaneous SNR; noise level; recognition accuracy; spectral tilt; Acoustic testing; Additive noise; Automatic speech recognition; Cepstral analysis; Code standards; Microphones; Noise level; Noise robustness; Signal to noise ratio; System testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1990. ICASSP-90., 1990 International Conference on
  • Conference_Location
    Albuquerque, NM
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.1990.115971
  • Filename
    115971