DocumentCode
3433467
Title
Multi-band speech recognition in noisy environments
Author
Okawa, Shigeki ; Bocchieri, Enrico ; Potamianos, Alexaridros
Volume
2
fYear
1998
fDate
12-15 May 1998
Firstpage
641
Abstract
This paper presents a new approach for multi-band based automatic speech recognition (ASR). Previous work by Bourlard et al. (see Proc. Int. Conf. on Spoken Language Processing, Philadelphia, p.426-9, 1996) and Hermansky et al. (see Proc. Int. Conf. on Spoken Language Processing, Philadelphia, p.1579-82, 1996) suggests that multi-band ASR gives a more accurate recognition, especially in noisy acoustic environments, by combining the likelihoods of different frequency bands. Here we evaluate this likelihood recombination (LC) approach to multi-band ASR, and propose an alternative method, namely feature recombination (FC). In the FC system, after different acoustic analyzers are applied to each sub-band individually, a vector is composed by combining the sub-band features. The speech classifier then calculates the likelihood from the single vector. Thus, band-limited noise affects only a few of the feature components, as in the multi-band LC system, but, at the same time, all feature components are jointly modeled, as in conventional ASR. The experimental results show that the FC system can yield better performance than both the conventional ASR and the LC strategy for noisy speech
Keywords
acoustic analysis; acoustic noise; acoustic signal processing; feature extraction; pattern classification; speech processing; speech recognition; acoustic analyzers; band-limited noise; experimental results; feature recombination; frequency bands; likelihood recombination; multi-band automatic speech recognition; noisy acoustic environments; noisy speech; performance; speech classifier; sub-band features; Acoustic noise; Additive noise; Automatic speech recognition; Filter bank; Frequency; Noise robustness; Psychoacoustic models; Speech enhancement; Speech recognition; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location
Seattle, WA
ISSN
1520-6149
Print_ISBN
0-7803-4428-6
Type
conf
DOI
10.1109/ICASSP.1998.675346
Filename
675346
Link To Document