DocumentCode :
33680
Title :
Nonnegative HMM for Babble Noise Derived From Speech HMM: Application to Speech Enhancement
Author :
Mohammadiha, Nasser ; Leijon, Arne
Author_Institution :
Sound & Image Process. Lab., KTH R. Inst. of Technol., Stockholm, Sweden
Volume :
21
Issue :
5
fYear :
2013
fDate :
May-13
Firstpage :
998
Lastpage :
1011
Abstract :
Deriving a good model for multitalker babble noise can facilitate different speech processing algorithms, e.g., noise reduction, to reduce the so-called cocktail party difficulty. In the available systems, the fact that the babble waveform is generated as a sum of N different speech waveforms is not exploited explicitly. In this paper, first we develop a gamma hidden Markov model for power spectra of the speech signal, and then formulate it as a sparse nonnegative matrix factorization (NMF). Second, the sparse NMF is extended by relaxing the sparsity constraint, and a novel model for babble noise (gamma nonnegative HMM) is proposed in which the babble basis matrix is the same as the speech basis matrix, and only the activation factors (weights) of the basis vectors are different for the two signals over time. Finally, a noise reduction algorithm is proposed using the derived speech and babble models. All of the stationary model parameters are estimated using the expectation-maximization (EM) algorithm, whereas the time-varying parameters, i.e., the gain parameters of speech and babble signals, are estimated using a recursive EM algorithm. The objective and subjective listening evaluations show that the proposed babble model and the final noise reduction algorithm significantly outperform the conventional methods.
Keywords :
expectation-maximisation algorithm; hidden Markov models; matrix decomposition; signal denoising; speech enhancement; babble basis matrix; babble waveform generation; cocktail party difficulty reduction; expectation-maximization algorithm; gamma hidden Markov model; gamma nonnegative HMM; multitalker babble noise; noise reduction algorithm; power spectra; recursive EM algorithm; sparse NMF; sparse nonnegative matrix factorization; speech HMM; speech basis matrix; speech enhancement; speech processing; stationary model parameter; time-varying parameter; Hidden Markov models; Noise; Noise reduction; Speech; Speech enhancement; Vectors; Babble noise; hidden Markov model; nonnegative matrix factorization; speech enhancement;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2013.2243435
Filename :
6423259
Link To Document :
بازگشت