Title :
NMF-based Target Source Separation Using Deep Neural Network
Author :
Tae Gyoon Kang ; Kisoo Kwon ; Jong Won Shin ; Nam Soo Kim
Author_Institution :
Dept. of Electr. & Comput. Eng., Seoul Nat. Univ., Seoul, South Korea
Abstract :
Non-negative matrix factorization (NMF) is one of the most well-known techniques that are applied to separate a desired source from mixture data. In the NMF framework, a collection of data is factorized into a basis matrix and an encoding matrix. The basis matrix for mixture data is usually constructed by augmenting the basis matrices for independent sources. However, target source separation with the concatenated basis matrix turns out to be problematic if there exists some overlap between the subspaces that the bases for the individual sources span. In this letter, we propose a novel approach to improve encoding vector estimation for target signal extraction. Estimating encoding vectors from the mixture data is viewed as a regression problem and a deep neural network (DNN) is used to learn the mapping between the mixture data and the corresponding encoding vectors. To demonstrate the performance of the proposed algorithm, experiments were conducted in the speech enhancement task. The experimental results show that the proposed algorithm outperforms the conventional encoding vector estimation scheme.
Keywords :
encoding; estimation theory; matrix decomposition; mixture models; neural nets; source separation; speech enhancement; vectors; DNN; NMF-based target source separation; basis matrix; concatenated basis matrix; data factorization; deep neural network; encoding matrix; encoding vector estimation; encoding vectors estimation; mixture data; nonnegative matrix factorization; speech enhancement task; target signal extraction; target source separation; Encoding; Noise; Signal processing algorithms; Source separation; Speech; Speech enhancement; Vectors; Deep neural network; dictionary learning; non-negative matrix factorization; speech enhancement; target source separation;
Journal_Title :
Signal Processing Letters, IEEE
DOI :
10.1109/LSP.2014.2354456