DocumentCode
2017731
Title
Spectral trajectory estimation using nonnegative matrix factorization for model-based monaural speech separation
Author
Mak, Chun-Man ; Lee, Tan ; Lee, S.W.
Author_Institution
Dept. of Electron. Eng., Chinese Univ. of Hong Kong, Hong Kong, China
fYear
2010
fDate
Nov. 29 2010-Dec. 3 2010
Firstpage
23
Lastpage
28
Abstract
This paper presents a study on model-based speech separation for monaural speech mixture. With prior knowledge about of the text content of the speech sources, we estimate the spectral envelope trajectory of each target source and use them to filter the mixture signal so that the target signal is enhanced and the interfering signal is suppressed. Accurate trajectory estimation is therefore crucial for successful separation. We proposed to use the nonnegative matrix factorization in the trajectory estimation process which improves the accuracy of the estimated trajectories considerably. Performance evaluation is carried out using mixtures of two equally-loud Cantonese speech sources. The proposed method is found to have significant improvement over previously proposed speech separation methods.
Keywords
matrix decomposition; performance evaluation; signal processing; speech processing; Cantonese speech source; interfering signal; mixture signal; model based monaural speech separation; nonnegative matrix factorization; performance evaluation; spectral envelope trajectory; spectral trajectory estimation; speech source; text content; Estimation; Hidden Markov models; Mel frequency cepstral coefficient; Signal to noise ratio; Speech; Trajectory; Speech separation; component; nonnegative matrix factorization; spectral envelope trajectory;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
Conference_Location
Tainan
Print_ISBN
978-1-4244-6244-5
Type
conf
DOI
10.1109/ISCSLP.2010.5684883
Filename
5684883
Link To Document