Speech enhancement based on Rayleigh mixture modeling of speech spectral amplitude distributions

Author

Erkelens, J.S. ; Jensen, J. ; Heusdens, R.

Author_Institution

Dept. of Mediamatics, Delft Univ. of Technol., Delft, Netherlands

fYear

2007

fDate

3-7 Sept. 2007

Firstpage

65

Lastpage

69

Abstract

DFT-based speech enhancement algorithms typically rely on a statistical model of the spectral amplitudes of the noise-free speech signal. It has been shown in the literature recently that the speech spectral amplitude distributions, conditional on estimated a priori SNR, may differ significantly from the traditional Gaussian model and are better described by super-Gaussian probability density functions. We show that these conditional distributions can be accurately approximated by a mixture of Rayleigh distributions. The MMSE amplitude estimators based on Rayleigh Mixture Models perform at least as well as the estimators based on super-Gaussian models. Furthermore, the proposed Rayleigh Mixture Models allow for derivation of closed-form estimators minimizing other perceptually relevant distortion measures, which may be difficult for other models.

Keywords

Gaussian processes; discrete Fourier transforms; least mean squares methods; mixture models; probability; spectral analysis; speech enhancement; DFT-based speech enhancement algorithm; MMSE amplitude estimator; Rayleigh distribution; Rayleigh mixture modeling; a priori SNR; closed-form estimator; discrete Fourier transform; minimum mean square error; noise-free speech signal; signal-noise ratio; speech spectral amplitude distribution; statistical model; super-Gaussian model; super-Gaussian probability density function; Discrete Fourier transforms; Estimation; Histograms; Signal to noise ratio; Speech; Speech enhancement;

fLanguage

English

Publisher

ieee

Conference_Titel

Signal Processing Conference, 2007 15th European

Conference_Location

Poznan

Print_ISBN

978-839-2134-04-6

Type

conf

Filename

7098765