Title :
Time-domain approach using multiple Kalman filters and EM algorithm to speech enhancement with nonstationary noise
Author :
Lee, Ki Yong ; Jung, Souhwan
Author_Institution :
Sch. of Electron. Eng., Soongsil Univ., Seoul, South Korea
fDate :
5/1/2000 12:00:00 AM
Abstract :
A time-domain approach for enhancing speech signals degraded by statistically independent additive nonstationary noise with no a priori information is developed. The autoregressive (AR)-hidden filter model (HFM) with gain contour is proposed for modeling the statistical characteristics of the clean speech signal. Given the HFM parameter set of the speech, speech enhancement becomes a set of problems of joint signal estimation for clean speech and system identification for the gain contour and time-varying parameter of noise. Then, the expectation-maximization (EM) algorithm is applied to signal estimation and system identification. In the E-step, the signal estimation becomes a weighted sum of conditional mean estimator using multiple Kalman filters with Markovian switching coefficient, where the weights equal to a posteriori probabilities of the specific state sequence history given the noisy speech. The probability is computed by the Viterbi algorithm (VA). In M-step, the gain contour and noise parameters are recursively updated by an adaptive algorithm modified from the gradient-based algorithm. The proposed method does not require framing of speech signal in, the train and enhancement procedure. The proposed method is tested against the noisy speech signals degraded by nonstationary noise at various input signal-to-noise ratios. An approximate improvement of 4.5-6.0 dB in signal-to-noise ratio (SNR) is achieved at the input SNR 10 and 15 dB
Keywords :
Kalman filters; Markov processes; adaptive signal processing; autoregressive processes; gradient methods; noise; optimisation; parameter estimation; probability; speech enhancement; time-domain analysis; AR-hidden filter model; EM algorithm; HFM parameter set; Markovian switching coefficient; Viterbi algorithm; a posteriori probabilities; adaptive algorithm; autoregressive hidden filter model; clean speech signal; conditional mean estimator; expectation-maximization algorithm; gain contour; gradient-based algorithm; input SNR; input signal-to-noise ratios; joint signal estimation; multiple Kalman filters; noise parameters; noisy speech; signal estimation; signal-to-noise ratio; speech enhancement; state sequence history; statistical characteristics; statistically independent additive nonstationary noise; system identification; time-domain approach; time-varying parameter; weighted sum; Additive noise; Degradation; Filters; History; Signal to noise ratio; Speech enhancement; State estimation; System identification; Time domain analysis; Time varying systems;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on