Title :
Speech Enhancement Based on Minimum Mean-Square Error Estimation and Supergaussian Priors
Author_Institution :
Inst. of Commun. Acoust., Ruhr-Univ. Bochum, Germany
Abstract :
This paper presents a class of minimum mean-square error (MMSE) estimators for enhancing short-time spectral coefficients of a noisy speech signal. In contrast to most of the presently used methods, we do not assume that the spectral coefficients of the noise or of the clean speech signal obey a (complex) Gaussian probability density. We derive analytical solutions to the problem of estimating discrete Fourier transform (DFT) coefficients in the MMSE sense when the prior probability density function of the clean speech DFT coefficients can be modeled by a complex Laplace or by a complex bilateral Gamma density. The probability density function of the noise DFT coefficients may be modeled either by a complex Gaussian or by a complex Laplacian density. Compared to algorithms based on the Gaussian assumption, such as the Wiener filter or the Ephraim and Malah (1984) MMSE short-time spectral amplitude estimator, the estimators based on these supergaussian densities deliver an improved signal-to-noise ratio.
Keywords :
Gaussian processes; Laplace transforms; discrete Fourier transforms; least mean squares methods; speech enhancement; Gaussian probability density; bilateral Gamma density; complex Laplace; discrete Fourier transform; minimum mean square error estimation; noisy speech signal; short time spectral coefficients; speech enhancement; Amplitude estimation; Discrete Fourier transforms; Estimation error; Gaussian noise; Laplace equations; Probability density function; Signal to noise ratio; Speech analysis; Speech enhancement; Wiener filter; Minimum mean-square error (MMSE) spectral estimation; minimum statistics noise power estimation; noise reduction; supergaussian (leptokurtic) densities;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
DOI :
10.1109/TSA.2005.851927