Title :
Multichannel feature enhancement in distributed microphone arrays for robust distant speech recognition in smart rooms
Author :
Mirsamadi, Seyedmahdad ; Hansen, John H. L.
Author_Institution :
Center for Robust Speech Syst., Univ. of Texas at Dallas, Richardson, TX, USA
Abstract :
Room reverberation and environmental noise present challenges for integration of speech recognition technology in smart room applications. We present a multichannel enhancement framework for distributed microphone arrays to mitigate the effects of both additive noise and reverberation on distant-talking microphones. The proposed approach uses techniques of nonnegative matrix and tensor factorization to achieve both noise suppression (through sparse representation of speech spectra) and dereverberation (through decomposition of magnitude spectra into convolutive components). Results of ASR experiments on the DIRHA-GRID corpus confirm that the proposed approach can achieve relative improvements of up to +20% in recognition accuracy in highly reverberant and noisy conditions using clean-trained models.
Keywords :
acoustic signal processing; matrix decomposition; microphone arrays; noise abatement; reverberation; speech enhancement; speech recognition; tensors; ASR experiments; DIRHA-GRID corpus; additive noise mitigation; automatic speech recognition; distant speech recognition; distant-talking microphones; distributed microphone arrays; multichannel feature enhancement; noise suppression; nonnegative matrix techniques; reverberation mitigation; smart rooms; tensor factorization techniques; Dictionaries; Microphones; Noise; Reverberation; Spectrogram; Speech; Speech recognition; distant speech recognition; distributed microphone array; nonnegative matrix/tensor factorization;
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2014 IEEE
DOI :
10.1109/SLT.2014.7078626