DocumentCode
2926449
Title
Environmental robustness in automatic speech recognition
Author
Acero, Alejandro ; Stern, Richard
Author_Institution
Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear
1990
fDate
3-6 Apr 1990
Firstpage
849
Abstract
Initial efforts to make Sphinx, a continuous-speech speaker-independent recognition system, robust to changes in the environment are reported. To deal with differences in noise level and spectral tilt between close-talking and desk-top microphones, two novel methods based on additive corrections in the cepstral domain are proposed. In the first algorithm, the additive correction depends on the instantaneous SNR of the signal. In the second technique, expectation-maximization techniques are used to best match the cepstral vectors of the input utterances to the ensemble of codebook entries representing a standard acoustical ambience. Use of the algorithms dramatically improves recognition accuracy when the system is tested on a microphone other than the one on which it was trained
Keywords
interference suppression; microphones; speech recognition; Sphinx; additive correction; cepstral domain; close-talking microphones; continuous-speech speaker-independent recognition system; desk-top microphones; expectation-maximization techniques; instantaneous SNR; noise level; recognition accuracy; spectral tilt; Acoustic testing; Additive noise; Automatic speech recognition; Cepstral analysis; Code standards; Microphones; Noise level; Noise robustness; Signal to noise ratio; System testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1990. ICASSP-90., 1990 International Conference on
Conference_Location
Albuquerque, NM
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.1990.115971
Filename
115971
Link To Document