DocumentCode :
417301
Title :
Spatio-temporal processing for distant speech recognition
Author :
Low, Siow Yong ; Togneri, Roberto ; Nordholm, Sven
Author_Institution :
Western Australian Telecommun. Res. Inst., Crawley, WA, Australia
Volume :
1
fYear :
2004
fDate :
17-21 May 2004
Abstract :
A new subband based front-end processor for speech recognition is presented. It integrates both spatial and temporal signal processing methods to enhance noisy signals as a means to reduce the mismatch problem in speech recognition. The approach makes use of the popular blind signal separation (BSS) to spatially separate the target signal from the interference. Due to the multipath/reverberant environment, BSS has its fundamental limitation in the separation quality. To overcome that, an adaptive noise canceller (ANC) is employed to perform further interference reduction. Experimental results show that even in an adverse environment, the proposed structure improves the word recognition rate (WRR) by 70% for the connected digit recognition task.
Keywords :
blind source separation; error statistics; interference suppression; reverberation; spatiotemporal phenomena; speech enhancement; speech recognition; ANC; BSS; WRR; adaptive noise canceller; blind signal separation; connected digit recognition task; distant speech recognition; interference reduction; mismatch problem; multipath/reverberant environment; noisy signal enhancement; separation quality; spatio-temporal processing; subband based front-end processor; word recognition rate; Array signal processing; Australia; Blind source separation; Filter bank; Interference; Microphone arrays; Noise cancellation; Signal processing; Speech recognition; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1326157
Filename :
1326157
Link To Document :
بازگشت