DocumentCode :
1520617
Title :
Single-Channel Source Separation Using Complex Matrix Factorization
Author :
King, B.J. ; Atlas, Les
Author_Institution :
Univ. of Washington, Seattle, WA, USA
Volume :
19
Issue :
8
fYear :
2011
Firstpage :
2591
Lastpage :
2597
Abstract :
Nonnegative matrix factorization is gaining popularity in speech and audio processing applications. Performing nonnegative matrix factorization on a complex-valued short-time Fourier transform, however, makes assumptions on the signal, such as additivity in the magnitude domain, potentially degrading the results. One application where these assumptions can cause a problem is in single-channel source separation of overlapping speech. In this paper, we present how this problem can be solved by incorporating phase estimation via complex matrix factorization. Another challenge in source separation is how to select reconstruction bases for optimal separation. In this paper, we compare the most common method with a new, simpler method of finding bases that does not share many of the challenges of the current, established method. The paper will conclude by comparing nonnegative with complex matrix factorization as well as the previous and new methods for finding bases on the task of automatic speech recognition of single-channel two-talker overlapping speech.
Keywords :
Fourier transforms; matrix decomposition; source separation; speech recognition; audio processing; automatic speech recognition; complex matrix factorization; complex valued short time Fourier transform; magnitude domain; nonnegative matrix factorization; optimal separation; phase estimation; single channel source separation; single channel two-talker overlapping speech processing; Automatic speech recognition; Phase estimation; Source separation; Speech processing; Complex matrix factorization (CMF); nonnegative matrix factorization (NMF); source separation; speech processing;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2011.2156786
Filename :
5771055
Link To Document :
بازگشت