Title :
Single-Channel Source Separation Using Complex Matrix Factorization
Author :
King, B.J. ; Atlas, Les
Author_Institution :
Univ. of Washington, Seattle, WA, USA
Abstract :
Nonnegative matrix factorization is gaining popularity in speech and audio processing applications. Performing nonnegative matrix factorization on a complex-valued short-time Fourier transform, however, makes assumptions on the signal, such as additivity in the magnitude domain, potentially degrading the results. One application where these assumptions can cause a problem is in single-channel source separation of overlapping speech. In this paper, we present how this problem can be solved by incorporating phase estimation via complex matrix factorization. Another challenge in source separation is how to select reconstruction bases for optimal separation. In this paper, we compare the most common method with a new, simpler method of finding bases that does not share many of the challenges of the current, established method. The paper will conclude by comparing nonnegative with complex matrix factorization as well as the previous and new methods for finding bases on the task of automatic speech recognition of single-channel two-talker overlapping speech.
Keywords :
Fourier transforms; matrix decomposition; source separation; speech recognition; audio processing; automatic speech recognition; complex matrix factorization; complex valued short time Fourier transform; magnitude domain; nonnegative matrix factorization; optimal separation; phase estimation; single channel source separation; single channel two-talker overlapping speech processing; Automatic speech recognition; Phase estimation; Source separation; Speech processing; Complex matrix factorization (CMF); nonnegative matrix factorization (NMF); source separation; speech processing;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2011.2156786