Title :
Subband-based speech recognition
Author :
Bourlard, Hervé ; Dupont, Stéphane
Author_Institution :
Faculte Polytech. de Mons, Belgium
Abstract :
In the framework of hidden Markov models (HMM) or hybrid HMM/artificial neural network (ANN) systems, we present a new approach towards automatic speech recognition (ASR). The general idea is to divide up the full frequency band (represented in terms of critical bands) into several subbands, compute phone probabilities for each subband on the basis of subband acoustic features, perform dynamic programming independently for each band, and merge the subband recognizers (recombining the respective, possibly weighted, scores) at some segmental level corresponding to temporal anchor points. The results presented in this paper confirm some preliminary tests reported earlier. On both isolated word and continuous speech tasks, it is indeed shown that even using quite simple recombination strategies, this subband ASR approach can yield at least comparable performance on clean speech while providing better robustness in the case of narrowband noise
Keywords :
dynamic programming; hidden Markov models; neural nets; speech recognition; ASR; automatic speech recognition; clean speech; continuous speech tasks; critical bands; dynamic programming; frequency band; hidden Markov models; hybrid HMM/artificial neural network; isolated word tasks; narrowband noise; phone probabilities; recombination strategies; robustness; subband acoustic features; subband-based speech recognition; temporal anchor points; Acoustic testing; Artificial neural networks; Automatic speech recognition; Dynamic programming; Frequency conversion; Hidden Markov models; Narrowband; Noise robustness; Speech enhancement; Speech recognition;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
Print_ISBN :
0-8186-7919-0
DOI :
10.1109/ICASSP.1997.596172