Title :
Single-channel speech separation by including spectral structure information within non-negative matrix factorization
Author :
Yuxiao Feng ; Ritz, Christian
Author_Institution :
ICT Res. Inst. & Sch. of Electr. Comput. & Telecommun. Eng., Univ. of Wollongong, Wollongong, NSW, Australia
Abstract :
This paper proposes a novel extension on Non-negative Matrix Factorization (NMF) scheme for the separation of single channel speech mixtures, where we impose a post-sparse model on the original weight matrix derived from a previously proposed coherence-constrained NMF model. The approach considers both the modeling ability of NMF basis functions for each source as well as the ability of these basis functions to achieve accurate separation performance. Compared with latest associated NMF models for source separation, the results of our model indicate promising advantages, in terms of both objective source separation measures and Perceptual Evaluation of Speech Quality (PESQ) evaluations.
Keywords :
blind source separation; matrix decomposition; spectral analysis; speech processing; NMF basis function; NMF scheme; PESQ evaluation; coherence-constrained NMF model; modeling ability; nonnegative matrix factorization; objective source separation measure; perceptual evaluation of speech quality; post-sparse model; separation performance; single channel speech mixture; single-channel speech separation; spectral structure information; weight matrix; Australia; Computers; Decision support systems; Electronic mail; Indexes; Telecommunications; Training; Non-negative matrix factorization; sparsity; spectral mask; speech separation;
Conference_Titel :
Signal and Information Processing (ChinaSIP), 2015 IEEE China Summit and International Conference on
Conference_Location :
Chengdu
DOI :
10.1109/ChinaSIP.2015.7230478