Title :
Spatio-temporal processing for distant speech recognition
Author :
Low, Siow Yong ; Togneri, Roberto ; Nordholm, Sven
Author_Institution :
Western Australian Telecommun. Res. Inst., Crawley, WA, Australia
Abstract :
A new subband based front-end processor for speech recognition is presented. It integrates both spatial and temporal signal processing methods to enhance noisy signals as a means to reduce the mismatch problem in speech recognition. The approach makes use of the popular blind signal separation (BSS) to spatially separate the target signal from the interference. Due to the multipath/reverberant environment, BSS has its fundamental limitation in the separation quality. To overcome that, an adaptive noise canceller (ANC) is employed to perform further interference reduction. Experimental results show that even in an adverse environment, the proposed structure improves the word recognition rate (WRR) by 70% for the connected digit recognition task.
Keywords :
blind source separation; error statistics; interference suppression; reverberation; spatiotemporal phenomena; speech enhancement; speech recognition; ANC; BSS; WRR; adaptive noise canceller; blind signal separation; connected digit recognition task; distant speech recognition; interference reduction; mismatch problem; multipath/reverberant environment; noisy signal enhancement; separation quality; spatio-temporal processing; subband based front-end processor; word recognition rate; Array signal processing; Australia; Blind source separation; Filter bank; Interference; Microphone arrays; Noise cancellation; Signal processing; Speech recognition; Working environment noise;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1326157