DocumentCode :
3433467
Title :
Multi-band speech recognition in noisy environments
Author :
Okawa, Shigeki ; Bocchieri, Enrico ; Potamianos, Alexaridros
Volume :
2
fYear :
1998
fDate :
12-15 May 1998
Firstpage :
641
Abstract :
This paper presents a new approach for multi-band based automatic speech recognition (ASR). Previous work by Bourlard et al. (see Proc. Int. Conf. on Spoken Language Processing, Philadelphia, p.426-9, 1996) and Hermansky et al. (see Proc. Int. Conf. on Spoken Language Processing, Philadelphia, p.1579-82, 1996) suggests that multi-band ASR gives a more accurate recognition, especially in noisy acoustic environments, by combining the likelihoods of different frequency bands. Here we evaluate this likelihood recombination (LC) approach to multi-band ASR, and propose an alternative method, namely feature recombination (FC). In the FC system, after different acoustic analyzers are applied to each sub-band individually, a vector is composed by combining the sub-band features. The speech classifier then calculates the likelihood from the single vector. Thus, band-limited noise affects only a few of the feature components, as in the multi-band LC system, but, at the same time, all feature components are jointly modeled, as in conventional ASR. The experimental results show that the FC system can yield better performance than both the conventional ASR and the LC strategy for noisy speech
Keywords :
acoustic analysis; acoustic noise; acoustic signal processing; feature extraction; pattern classification; speech processing; speech recognition; acoustic analyzers; band-limited noise; experimental results; feature recombination; frequency bands; likelihood recombination; multi-band automatic speech recognition; noisy acoustic environments; noisy speech; performance; speech classifier; sub-band features; Acoustic noise; Additive noise; Automatic speech recognition; Filter bank; Frequency; Noise robustness; Psychoacoustic models; Speech enhancement; Speech recognition; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
ISSN :
1520-6149
Print_ISBN :
0-7803-4428-6
Type :
conf
DOI :
10.1109/ICASSP.1998.675346
Filename :
675346
Link To Document :
بازگشت