DocumentCode :
2647116
Title :
Robust speech recognition using a noise rejection approach
Author :
Khan, Emdad ; Levinson, Robert
Author_Institution :
Core Technol. Group, Nat. Semicond., CA, USA
fYear :
1998
fDate :
21-23 May 1998
Firstpage :
326
Lastpage :
335
Abstract :
In this paper, we explore some new approaches to improve speech recognition accuracy in a noisy environment. The key approaches taken are: (a) use no additional data (i.e. use only speakers data, no data for noise) for training and (b) no adaptation phase for noise. Instead of making adaptation in the recognition, preprocessing or both stages, we make a noise tolerant (rejection) speech recognition system where the system tries to reject noise automatically because of its inherent structure. We call our approach a noise rejection-based approach. Noise rejection is achieved by using multiple views and dynamic features of the input sequences. Multiple views exploit more information from the available data that is used for training multiple HMMs (hidden Markov models). This makes the training process simpler, faster and avoids the need to use a noise database, which is often difficult to obtain. The dynamic features (added to the HMM using vector emission probabilities) add more information about the input speech during training. Since the values of dynamic features of noise are usually much smaller than that of the speech signal, it helps reject the noise during recognition. Also, multiple views of the input sequence are applied to multiple HMMs during recognition and the outcome of the multiple HMMs are combined using maximum evidence criterion. Our tests show very encouraging results. We also incorporate higher level decision making to more judiciously combine the outcomes of the multiple HMMs to further improve the accuracy. For this, we use meta reasoning to identify the problem complexity and accordingly allocate resources
Keywords :
hidden Markov models; noise; speech recognition; hidden Markov models; high-level decision-making; input sequences; maximum evidence criterion; meta reasoning; multiple HMM training; noise rejection approach; noise tolerant speech recognition system; problem complexity; resource allocation; robust speech recognition; vector emission probabilities; Automatic speech recognition; Decision making; Hidden Markov models; Noise robustness; Phase noise; Spatial databases; Speech enhancement; Speech recognition; Testing; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligence and Systems, 1998. Proceedings., IEEE International Joint Symposia on
Conference_Location :
Rockville, MD
Print_ISBN :
0-8186-8548-4
Type :
conf
DOI :
10.1109/IJSIS.1998.685469
Filename :
685469
Link To Document :
بازگشت