• DocumentCode
    1135381
  • Title

    Speech Enhancement Based on Minimum Mean-Square Error Estimation and Supergaussian Priors

  • Author

    Martin, Rainer

  • Author_Institution
    Inst. of Commun. Acoust., Ruhr-Univ. Bochum, Germany
  • Volume
    13
  • Issue
    5
  • fYear
    2005
  • Firstpage
    845
  • Lastpage
    856
  • Abstract
    This paper presents a class of minimum mean-square error (MMSE) estimators for enhancing short-time spectral coefficients of a noisy speech signal. In contrast to most of the presently used methods, we do not assume that the spectral coefficients of the noise or of the clean speech signal obey a (complex) Gaussian probability density. We derive analytical solutions to the problem of estimating discrete Fourier transform (DFT) coefficients in the MMSE sense when the prior probability density function of the clean speech DFT coefficients can be modeled by a complex Laplace or by a complex bilateral Gamma density. The probability density function of the noise DFT coefficients may be modeled either by a complex Gaussian or by a complex Laplacian density. Compared to algorithms based on the Gaussian assumption, such as the Wiener filter or the Ephraim and Malah (1984) MMSE short-time spectral amplitude estimator, the estimators based on these supergaussian densities deliver an improved signal-to-noise ratio.
  • Keywords
    Gaussian processes; Laplace transforms; discrete Fourier transforms; least mean squares methods; speech enhancement; Gaussian probability density; bilateral Gamma density; complex Laplace; discrete Fourier transform; minimum mean square error estimation; noisy speech signal; short time spectral coefficients; speech enhancement; Amplitude estimation; Discrete Fourier transforms; Estimation error; Gaussian noise; Laplace equations; Probability density function; Signal to noise ratio; Speech analysis; Speech enhancement; Wiener filter; Minimum mean-square error (MMSE) spectral estimation; minimum statistics noise power estimation; noise reduction; supergaussian (leptokurtic) densities;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/TSA.2005.851927
  • Filename
    1495468