Title :
Robust speech recognition with speaker localization by a microphone array
Author :
Yamada, Takeshi ; Nakamura, Satoshi ; Shikano, Kiyohiro
Author_Institution :
Graduate Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Japan
Abstract :
This paper proposes robust speech recognition with speaker localization by an arrayed microphone (SLAM) to realize a hands-free speech interface in noisy environments. In order to localize a speaker direction accurately in low SNR conditions, a speaker localization algorithm based on extracting pitch harmonics is introduced. To evaluate the performance of the proposed system, speech recognition experiments are carried out both in computer simulation and real environments. These results show that the proposed system attains much higher speech recognition performance than that of a single microphone not only in computer simulation but also in real environments
Keywords :
acoustic noise; harmonics; microphones; natural language interfaces; performance evaluation; speech recognition; computer simulation; hands-free speech interface; low SNR conditions; microphone array; noisy environments; performance; pitch harmonics extraction; robust speech recognition; speaker direction; speaker localization; Acoustic noise; Computer simulation; Delay; Microphone arrays; Robustness; Signal processing algorithms; Simultaneous localization and mapping; Speech enhancement; Speech recognition; Working environment noise;
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
DOI :
10.1109/ICSLP.1996.607855