DocumentCode :
2017731
Title :
Spectral trajectory estimation using nonnegative matrix factorization for model-based monaural speech separation
Author :
Mak, Chun-Man ; Lee, Tan ; Lee, S.W.
Author_Institution :
Dept. of Electron. Eng., Chinese Univ. of Hong Kong, Hong Kong, China
fYear :
2010
fDate :
Nov. 29 2010-Dec. 3 2010
Firstpage :
23
Lastpage :
28
Abstract :
This paper presents a study on model-based speech separation for monaural speech mixture. With prior knowledge about of the text content of the speech sources, we estimate the spectral envelope trajectory of each target source and use them to filter the mixture signal so that the target signal is enhanced and the interfering signal is suppressed. Accurate trajectory estimation is therefore crucial for successful separation. We proposed to use the nonnegative matrix factorization in the trajectory estimation process which improves the accuracy of the estimated trajectories considerably. Performance evaluation is carried out using mixtures of two equally-loud Cantonese speech sources. The proposed method is found to have significant improvement over previously proposed speech separation methods.
Keywords :
matrix decomposition; performance evaluation; signal processing; speech processing; Cantonese speech source; interfering signal; mixture signal; model based monaural speech separation; nonnegative matrix factorization; performance evaluation; spectral envelope trajectory; spectral trajectory estimation; speech source; text content; Estimation; Hidden Markov models; Mel frequency cepstral coefficient; Signal to noise ratio; Speech; Trajectory; Speech separation; component; nonnegative matrix factorization; spectral envelope trajectory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
Conference_Location :
Tainan
Print_ISBN :
978-1-4244-6244-5
Type :
conf
DOI :
10.1109/ISCSLP.2010.5684883
Filename :
5684883
Link To Document :
بازگشت