DocumentCode :
353542
Title :
Comparing acoustic features for robust ASR in fixed and cellular network applications
Author :
de Wet, Febe ; Cranen, Bert ; De Veth, Johan ; Boves, L.
Author_Institution :
Dept. of Language & Speech, Nijmegen Univ., Netherlands
Volume :
3
fYear :
2000
fDate :
2000
Firstpage :
1415
Abstract :
Within the context of automatic speech recognition (ASR) applications for telephony, we investigate the acoustic preprocessing issues that are at stake in going from the fixed line to the cellular network. Because the spectral representation used in enhanced full rate GSM is linear prediction, we investigate the relative advantages and drawbacks of conventional mel-frequency cepstral coefficient (MFCC) parameters derived from a non-parametric fast Fourier transform (FFT) and MFCC parameters derived from a linear predictive coding (LPC) spectral estimate. Robust formant parameters, also derived from an LPC description of the spectrum, are studied as an alternative to MFCCs. Within the framework of connected digit recognition based on hidden Markov models, ASR performance was measured for clean conditions, as well as for three different additive noise conditions. In addition, the performance of a conventional recognition procedure was compared with the performance of an ASR system based on our acoustic backing-off implementation of missing feature theory (MFT)
Keywords :
acoustic noise; cellular radio; cepstral analysis; fast Fourier transforms; hidden Markov models; linear predictive coding; speech coding; speech recognition; LPC spectral estimate; MFCC parameter; acoustic backing-off implementation; acoustic features; acoustic preprocessing; additive noise conditions; automatic speech recognition; cellular network applications; connected digit recognition; enhanced full rate GSM; fixed network applications; hidden Markov models; linear predictive coding spectral estimate; mel-frequency cepstral coefficient parameters; missing feature theory; nonparametric fast Fourier transform; robust ASR; robust formant parameters; spectral representation; Acoustic applications; Automatic speech recognition; Cepstral analysis; Fast Fourier transforms; GSM; Land mobile radio cellular systems; Linear predictive coding; Mel frequency cepstral coefficient; Robustness; Telephony;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location :
Istanbul
ISSN :
1520-6149
Print_ISBN :
0-7803-6293-4
Type :
conf
DOI :
10.1109/ICASSP.2000.861856
Filename :
861856
Link To Document :
بازگشت