Title :
Speech Recognition in Multisource Reverberant Environments with Binaural Inputs
Author :
Roman, Nicoleta ; Srinivasan, Soundararajan ; Wang, DeLiang
Author_Institution :
Dept. of Math., Stat. & Comput. Sci., Ohio State Univ., Lima, OH
Abstract :
We present a binaural solution to robust speech recognition in multi-source reverberant environments. We employ the notion of an ideal time-frequency binary mask, which selects the target if it is stronger than the interference in a local time-frequency (T-F) unit. Our system estimates this ideal binary mask at the output of a target cancellation module implemented using adaptive filtering. This mask is used in conjunction with a missing-data algorithm to decode the target utterance. A systematic evaluation in terms of automatic speech recognition (ASR) performance shows substantial improvements over the baseline performance and better results over related two-microphone approaches
Keywords :
adaptive filters; decoding; filtering theory; speech coding; speech recognition; time-frequency analysis; adaptive filtering; automatic speech recognition; binaural inputs; missing-data algorithm; multisource reverberant environments; target cancellation module; target utterance decoding; time-frequency binary mask; Acoustic noise; Array signal processing; Automatic speech recognition; Computer science; Independent component analysis; Interference; Microphones; Robustness; Speech recognition; Target recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
Print_ISBN :
1-4244-0469-X
DOI :
10.1109/ICASSP.2006.1660019