DocumentCode :
155629
Title :
Regularized NMF-based speech enhancement with spectral components modeled by gaussian mixtures
Author :
Hanwook Chung ; Plourde, Eric ; Champagne, Benoit
Author_Institution :
Electr. & Comput. Eng., McGill Univ., Montreal, QC, Canada
fYear :
2014
fDate :
21-24 Sept. 2014
Firstpage :
1
Lastpage :
6
Abstract :
In this paper, we introduce a single channel speech enhancement algorithm based on regularized non-negative matrix factorization (NMF). In our proposed formulation, the log-likelihood function (LLF) of the magnitude spectral components, based on Gaussian mixture models (GMM) for both the speech and background noise signals, is included as a regularization term in the NMF cost function. By using this spectral type of regularization, we can incorporate the statistical properties of the signals during the estimation of both the basis and excitation martices in NMF model. Furthermore, borrowing from the expectation-maximization (EM) algorithm and to reduce the computational complexity of the NMF update, the LLF is replaced by its expected value. Experimental results of perceptual evaluation of speech quality (PESQ), source-to-distortion ratio (SDR) and source-to-interference ratio (SIR) show that the proposed speech enhancement algorithm provides better performance than the compared benchmark algorithms.
Keywords :
Gaussian processes; computational complexity; expectation-maximisation algorithm; interference (signal); matrix decomposition; mixture models; spectral analysis; speech enhancement; statistical analysis; EM algorithm; GMM; Gaussian mixture models; Gaussian mixtures; LLF; NMF cost function; NMF update; PESQ; SDR; SIR; background noise signal; computational complexity; excitation martices; expectation-maximization algorithm; log-likelihood function; magnitude spectral component; perceptual evaluation of speech quality; regularization term; regularized NMF-based speech enhancement; regularized nonnegative matrix factorization; single channel speech enhancement algorithm; source-to-distortion ratio; source-to-interference ratio; spectral components; spectral type; speech noise; statistical property; Abstracts; Signal to noise ratio; Gaussian mixture model; Regularized non-negative matrix factorization; expectation-maximization; single channel speech enhancement;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning for Signal Processing (MLSP), 2014 IEEE International Workshop on
Conference_Location :
Reims
Type :
conf
DOI :
10.1109/MLSP.2014.6958870
Filename :
6958870
Link To Document :
بازگشت