DocumentCode :
2069784
Title :
Model-based non-negative matrix factorization for single-channel speech separation
Author :
Zheng, Nengheng ; Lee, Tan ; Mak, Chun-Man
Author_Institution :
Coll. of Inf. Engineergin, Shenzhen Univ., Shenzhen, China
fYear :
2011
fDate :
14-16 Sept. 2011
Firstpage :
1
Lastpage :
4
Abstract :
A model-based non-negative matrix factorization (NMF) algorithm is formulated for single-channel speech source separation. With linguistic priors of the speech sources, the state-aligned spectral envelopes of these sources are inferred from acoustic models. Being initialized with the computed spectral envelopes, NMF processing is implemented to estimate the spectral envelope trajectory of each source. Subsequently the source speech is generated by reshaping the mixture spectra according to the estimated spectral envelopes, followed by the reduction of interfering harmonic components. An iterative process is developed for more reliable time alignment and hence better performance of separation. Experimental results show that two speech sources with equal intensity can be successfully separated by the proposed algorithm.
Keywords :
matrix decomposition; source separation; speech processing; acoustic models; model based nonnegative matrix factorization; single channel speech separation; spectral envelope estimation; spectral envelope trajectory; Acoustics; Hidden Markov models; Source separation; Speech; Speech enhancement; Trajectory; non-negative matrix separation; spectral envelope; speech source separation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing, Communications and Computing (ICSPCC), 2011 IEEE International Conference on
Conference_Location :
Xi´an
Print_ISBN :
978-1-4577-0893-0
Type :
conf
DOI :
10.1109/ICSPCC.2011.6061795
Filename :
6061795
Link To Document :
بازگشت