Title :
An integrated approach to robust speaker identification and speech recognition
Author :
Kwan, C. ; Yin, J. ; Ayhan, B. ; Chu, S. ; Liu, X. ; Puckett, K. ; Zhao, Y. ; Ho, K.C. ; Kruger, M. ; Sityar, I.
Author_Institution :
Signal Process. Inc., Rockville, MD
Abstract :
Conventional speaker identification and speech recognition algorithms cannot deal with noisy and multiple speaker environments. For example, IBM via Voice has low recognition rates if dictation is done in a noisy environment. In order to achieve high performance in speaker identification and speech recognition, we propose an integrated approach that takes every facet of the process into account. Here we summarize some preliminary results from the application of this integrated approach to robust speaker identification and speech recognition. A real-time stand-alone software prototype has been developed to evaluate the effectiveness of the approach.
Keywords :
speaker recognition; real-time stand-alone software prototype; robust speaker identification; speech recognition; Cepstral analysis; Hidden Markov models; Mel frequency cepstral coefficient; Microphones; Robustness; Signal processing; Signal to noise ratio; Speech enhancement; Speech recognition; Working environment noise;
Conference_Titel :
Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-1820-6
Electronic_ISBN :
1098-7576
DOI :
10.1109/IJCNN.2008.4634016