DocumentCode :
1118123
Title :
MAP Estimators for Speech Enhancement Under Normal and Rayleigh Inverse Gaussian Distributions
Author :
Hendriks, Richard C. ; Martin, Rainer
Author_Institution :
Dept. of Mediamatics, Delft Univ. of Technol.
Volume :
15
Issue :
3
fYear :
2007
fDate :
3/1/2007 12:00:00 AM
Firstpage :
918
Lastpage :
927
Abstract :
This paper presents a new class of estimators for speech enhancement in the discrete Fourier transform (DFT) domain, where we consider a multidimensional normal inverse Gaussian (MNIG) distribution for the speech DFT coefficients. The MNIG distribution can model a wide range of processes, from heavy-tailed to less heavy-tailed processes. Under the MNIG distribution complex DFT and amplitude estimators are derived. In contrast to other estimators, the suppression characteristics of the MNIG-based estimators can be adapted online to the underlying distribution of the speech DFT coefficients. Compared to noise suppression algorithms based on preselected super-Gaussian distributions, the MNIG-based complex DFT and amplitude estimators lead to a performance improvement in terms of segmental signal-to-noise ratio (SNR) in the order of 0.3 to 0.6 dB and 0.2 to 0.6 dB, respectively
Keywords :
Gaussian distribution; discrete Fourier transforms; maximum likelihood estimation; signal denoising; speech enhancement; 0.2 to 0.6 dB; 0.3 to 0.6 dB; MAP estimators; Rayleigh inverse Gaussian distributions; amplitude estimation; discrete Fourier transform; multidimensional normal inverse Gaussian distributions; noise suppression algorithms; segmental signal-to-noise ratio; speech DFT coefficients; speech enhancement; super-Gaussian distributions; Amplitude estimation; Discrete Fourier transforms; Frequency estimation; Gaussian distribution; Multidimensional systems; Noise level; Shape; Signal to noise ratio; Speech enhancement; Speech processing; Maximum a posteriori (MAP) estimation; multidimensional normal inverse Gaussian (MNIG) distribution; speech enhancement;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2006.889753
Filename :
4100682
Link To Document :
بازگشت