Title :
An MMSE Estimator for Speech Enhancement Under a Combined Stochastic–Deterministic Speech Model
Author :
Hendriks, Richard C. ; Heusdens, Richard ; Jensen, Jesper
Author_Institution :
Dept. of Mediamatics, Delft Univ. of Technol.
Abstract :
Although many discrete Fourier transform (DFT) domain-based speech enhancement methods rely on stochastic models to derive clean speech estimators, like the Gaussian and Laplace distribution, certain speech sounds clearly show a more deterministic character. In this paper, we study the use of a deterministic model in combination with the well-known stochastic models for speech enhancement. We derive a minimum mean-square error (MMSE) estimator under a combined stochastic-deterministic speech model with speech presence uncertainty and show that for different distributions of the DFT coefficients the combined stochastic-deterministic speech model leads to improved performance of approximately 0.8 dB segmental signal-to-noise ratio (SNR) over the use of a stochastic model alone. Evaluation with perceptual evaluation of speech quality (PESQ) shows performance improvements of approximately 0.15 on an MOS scale
Keywords :
discrete Fourier transforms; least mean squares methods; speech enhancement; stochastic processes; DFT; Laplace distribution; MMSE estimator; SNR; combined stochastic-deterministic speech model; discrete Fourier transform; minimum mean-square error; perceptual evaluation of speech quality; signal-to-noise ratio; speech enhancement; Acoustic noise; Degradation; Discrete Fourier transforms; Materials science and technology; Signal processing algorithms; Speech enhancement; Speech processing; Stochastic processes; Stochastic resonance; Uncertainty; Deterministic speech model; minimum mean-square error (MMSE); speech enhancement;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2006.881666