DocumentCode
705829
Title
Speech enhancement based on Rayleigh mixture modeling of speech spectral amplitude distributions
Author
Erkelens, J.S. ; Jensen, J. ; Heusdens, R.
Author_Institution
Dept. of Mediamatics, Delft Univ. of Technol., Delft, Netherlands
fYear
2007
fDate
3-7 Sept. 2007
Firstpage
65
Lastpage
69
Abstract
DFT-based speech enhancement algorithms typically rely on a statistical model of the spectral amplitudes of the noise-free speech signal. It has been shown in the literature recently that the speech spectral amplitude distributions, conditional on estimated a priori SNR, may differ significantly from the traditional Gaussian model and are better described by super-Gaussian probability density functions. We show that these conditional distributions can be accurately approximated by a mixture of Rayleigh distributions. The MMSE amplitude estimators based on Rayleigh Mixture Models perform at least as well as the estimators based on super-Gaussian models. Furthermore, the proposed Rayleigh Mixture Models allow for derivation of closed-form estimators minimizing other perceptually relevant distortion measures, which may be difficult for other models.
Keywords
Gaussian processes; discrete Fourier transforms; least mean squares methods; mixture models; probability; spectral analysis; speech enhancement; DFT-based speech enhancement algorithm; MMSE amplitude estimator; Rayleigh distribution; Rayleigh mixture modeling; a priori SNR; closed-form estimator; discrete Fourier transform; minimum mean square error; noise-free speech signal; signal-noise ratio; speech spectral amplitude distribution; statistical model; super-Gaussian model; super-Gaussian probability density function; Discrete Fourier transforms; Estimation; Histograms; Signal to noise ratio; Speech; Speech enhancement;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2007 15th European
Conference_Location
Poznan
Print_ISBN
978-839-2134-04-6
Type
conf
Filename
7098765
Link To Document