DocumentCode
417301
Title
Spatio-temporal processing for distant speech recognition
Author
Low, Siow Yong ; Togneri, Roberto ; Nordholm, Sven
Author_Institution
Western Australian Telecommun. Res. Inst., Crawley, WA, Australia
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
A new subband based front-end processor for speech recognition is presented. It integrates both spatial and temporal signal processing methods to enhance noisy signals as a means to reduce the mismatch problem in speech recognition. The approach makes use of the popular blind signal separation (BSS) to spatially separate the target signal from the interference. Due to the multipath/reverberant environment, BSS has its fundamental limitation in the separation quality. To overcome that, an adaptive noise canceller (ANC) is employed to perform further interference reduction. Experimental results show that even in an adverse environment, the proposed structure improves the word recognition rate (WRR) by 70% for the connected digit recognition task.
Keywords
blind source separation; error statistics; interference suppression; reverberation; spatiotemporal phenomena; speech enhancement; speech recognition; ANC; BSS; WRR; adaptive noise canceller; blind signal separation; connected digit recognition task; distant speech recognition; interference reduction; mismatch problem; multipath/reverberant environment; noisy signal enhancement; separation quality; spatio-temporal processing; subband based front-end processor; word recognition rate; Array signal processing; Australia; Blind source separation; Filter bank; Interference; Microphone arrays; Noise cancellation; Signal processing; Speech recognition; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1326157
Filename
1326157
Link To Document