Title :
On the impact of signal preprocessing for robust distant speech recognition in adverse acoustic environments
Author :
Reindl, Klaus ; Zheng, Yuanhang ; Meier, Stefan ; Schwarz, Andreas ; Kellermann, Walter
Author_Institution :
Multimedia Commun. & Signal Process., Univ. of Erlangen-Nuremberg, Erlangen, Germany
Abstract :
In this contribution, a two-channel acoustic front-end for robust automatic speech recognition (ASR) in adverse acoustic environments is analyzed. The source signal extraction scheme combines a blocking matrix based on semi-blind source separation, which provides a continuously updated reference of all undesired components separated from the desired signal and its reflections, and a single-channel Wiener postfilter. The postfilter is directly derived from the obtained noise and interference reference signal and hence, generalizes well-known postfilter realizations. The proposed front-end and its integration into an ASR system are analyzed and evaluated with respect to keyword accuracy under reverberant conditions with unpredictable and nonstationary interferences, and for different target source distances. Evaluating a simplified front-end based on free-field assumptions, an ideal front-end, where knowledge about the true undesired components is assumed, and comparing the proposed scheme with the competitive approach of solely using multistyle training, demonstrates the importance of an adequate signal preprocessing for robust distant speech recognition.
Keywords :
acoustic signal processing; integration; matrix algebra; speech recognition; ASR system; adverse acoustic environments; blocking matrix; integration method; interference reference signal; multistyle training; postfilter realizations; robust automatic speech recognition; robust distant speech recognition; semi-blind source separation; signal preprocessing; single-channel Wiener postfilter; source signal extraction scheme; two-channel acoustic front-end; Acoustics; Microphones; Noise measurement; Signal to noise ratio; Speech; Speech recognition; Blind source extraction; robust distant speech recognition; speech enhancement;
Conference_Titel :
Signal Processing, Communication and Computing (ICSPCC), 2012 IEEE International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4673-2192-1
DOI :
10.1109/ICSPCC.2012.6335732