Title :
Deep NMF for speech separation
Author :
Le Roux, Jonathan ; Hershey, John R. ; Weninger, Felix
Author_Institution :
Mitsubishi Electr. Res. Labs., Cambridge, MA, USA
Abstract :
Non-negative matrix factorization (NMF) has been widely used for challenging single-channel audio source separation tasks. However, inference in NMF-based models relies on iterative inference methods, typically formulated as multiplicative updates. We propose “deep NMF”, a novel non-negative deep network architecture which results from unfolding the NMF iterations and untying its parameters. This architecture can be discriminatively trained for optimal separation performance. To optimize its non-negative parameters, we show how a new form of back-propagation, based on multiplicative updates, can be used to preserve non-negativity, without the need for constrained optimization. We show on a challenging speech separation task that deep NMF improves in terms of accuracy upon NMF and is competitive with conventional sigmoid deep neural networks, while requiring a tenth of the number of parameters.
Keywords :
audio signal processing; backpropagation; inference mechanisms; iterative methods; matrix decomposition; source separation; speech processing; NMF iterations; back-propagation; deep NMF; deep nonnegative matrix factorization; iterative inference methods; multiplicative updates; nonnegative deep network architecture; nonnegative parameter optimization; nonnegativity preservation; single-channel audio source separation tasks; speech separation task; Context; Mathematical model; Neural networks; Noise; Speech; Topology; Training; Deep Neural Network; Deep unfolding; Non-negative Back-propagation; Non-negative Matrix Factorization;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
DOI :
10.1109/ICASSP.2015.7177933