DocumentCode :
1070953
Title :
Online Noise Estimation Using Stochastic-Gain HMM for Speech Enhancement
Author :
Zhao, David Y. ; Kleijn, W. Bastiaan ; Ypma, Alexander ; de Vries, Bert
Author_Institution :
Skype Sweden AB, Stockholm
Volume :
16
Issue :
4
fYear :
2008
fDate :
5/1/2008 12:00:00 AM
Firstpage :
835
Lastpage :
846
Abstract :
We propose a noise estimation algorithm for single-channel noise suppression in dynamic noisy environments. A stochastic-gain hidden Markov model (SG-HMM) is used to model the statistics of nonstationary noise with time-varying energy. The noise model is adaptive and the model parameters are estimated online from noisy observations using a recursive estimation algorithm. The parameter estimation is derived for the maximum-likelihood criterion and the algorithm is based on the recursive expectation maximization (EM) framework. The proposed method facilitates continuous adaptation to changes of both noise spectral shapes and noise energy levels, e.g., due to movement of the noise source. Using the estimated noise model, we also develop an estimator of the noise power spectral density (PSD) based on recursive averaging of estimated noise sample spectra. We demonstrate that the proposed scheme achieves more accurate estimates of the noise model and noise PSD, and as part of a speech enhancement system facilitates a lower level of residual noise.
Keywords :
expectation-maximisation algorithm; hidden Markov models; interference suppression; recursive estimation; spectral analysis; speech enhancement; dynamic noisy environment; maximum-likelihood criterion; noise energy level; noise spectral shape; nonstationary noise; online noise estimation; parameter estimation; power spectral density; recursive estimation; recursive expectation maximization; single-channel noise suppression; speech enhancement; stochastic-gain hidden Markov model; Gain modeling; noise estimation; noise model adaptation; noise suppression; stochastic-gain hidden Markov model (SG-HMM);
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2008.916055
Filename :
4453864
Link To Document :
بازگشت