Title :
Bayesian Feature Enhancement for Reverberation and Noise Robust Speech Recognition
Author :
Leutnant, Volker ; Krueger, A. ; Haeb-Umbach, Reinhold
Author_Institution :
Dept. of Commun. Eng., Univ. of Paderborn, Paderborn, Germany
Abstract :
In this contribution we extend a previously proposed Bayesian approach for the enhancement of reverberant logarithmic mel power spectral coefficients for robust automatic speech recognition to the additional compensation of background noise. A recently proposed observation model is employed whose time-variant observation error statistics are obtained as a side product of the inference of the a posteriori probability density function of the clean speech feature vectors. Further a reduction of the computational effort and the memory requirements are achieved by using a recursive formulation of the observation model. The performance of the proposed algorithms is first experimentally studied on a connected digits recognition task with artificially created noisy reverberant data. It is shown that the use of the time-variant observation error model leads to a significant error rate reduction at low signal-to-noise ratios compared to a time-invariant model. Further experiments were conducted on a 5000 word task recorded in a reverberant and noisy environment. A significant word error rate reduction was obtained demonstrating the effectiveness of the approach on real-world data.
Keywords :
Bayes methods; compensation; error statistics; reverberation; speech recognition; Bayesian feature enhancement; background noise; clean speech feature vectors; compensation; connected digits recognition task; error statistics; memory requirements; noisy reverberant data; posteriori probability density function; recursive formulation; reverberant logarithmic mel power spectral coefficients; robust automatic speech recognition; signal-to-noise ratios; time-variant observation; word error rate reduction; Robust automatic speech recognition; model-based Bayesian feature enhancement; observation model for reverberant and noisy speech; recursive observation model;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2013.2258013