Environmental robustness in automatic speech recognition

Author

Acero, Alejandro ; Stern, Richard

Author_Institution

Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA

fYear

1990

fDate

3-6 Apr 1990

Firstpage

849

Abstract

Initial efforts to make Sphinx, a continuous-speech speaker-independent recognition system, robust to changes in the environment are reported. To deal with differences in noise level and spectral tilt between close-talking and desk-top microphones, two novel methods based on additive corrections in the cepstral domain are proposed. In the first algorithm, the additive correction depends on the instantaneous SNR of the signal. In the second technique, expectation-maximization techniques are used to best match the cepstral vectors of the input utterances to the ensemble of codebook entries representing a standard acoustical ambience. Use of the algorithms dramatically improves recognition accuracy when the system is tested on a microphone other than the one on which it was trained

Keywords

interference suppression; microphones; speech recognition; Sphinx; additive correction; cepstral domain; close-talking microphones; continuous-speech speaker-independent recognition system; desk-top microphones; expectation-maximization techniques; instantaneous SNR; noise level; recognition accuracy; spectral tilt; Acoustic testing; Additive noise; Automatic speech recognition; Cepstral analysis; Code standards; Microphones; Noise level; Noise robustness; Signal to noise ratio; System testing;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1990. ICASSP-90., 1990 International Conference on

Conference_Location

Albuquerque, NM

ISSN

1520-6149

Type

conf

DOI

10.1109/ICASSP.1990.115971

Filename

115971