DocumentCode
730060
Title
Deep NMF for speech separation
Author
Le Roux, Jonathan ; Hershey, John R. ; Weninger, Felix
Author_Institution
Mitsubishi Electr. Res. Labs., Cambridge, MA, USA
fYear
2015
fDate
19-24 April 2015
Firstpage
66
Lastpage
70
Abstract
Non-negative matrix factorization (NMF) has been widely used for challenging single-channel audio source separation tasks. However, inference in NMF-based models relies on iterative inference methods, typically formulated as multiplicative updates. We propose “deep NMF”, a novel non-negative deep network architecture which results from unfolding the NMF iterations and untying its parameters. This architecture can be discriminatively trained for optimal separation performance. To optimize its non-negative parameters, we show how a new form of back-propagation, based on multiplicative updates, can be used to preserve non-negativity, without the need for constrained optimization. We show on a challenging speech separation task that deep NMF improves in terms of accuracy upon NMF and is competitive with conventional sigmoid deep neural networks, while requiring a tenth of the number of parameters.
Keywords
audio signal processing; backpropagation; inference mechanisms; iterative methods; matrix decomposition; source separation; speech processing; NMF iterations; back-propagation; deep NMF; deep nonnegative matrix factorization; iterative inference methods; multiplicative updates; nonnegative deep network architecture; nonnegative parameter optimization; nonnegativity preservation; single-channel audio source separation tasks; speech separation task; Context; Mathematical model; Neural networks; Noise; Speech; Topology; Training; Deep Neural Network; Deep unfolding; Non-negative Back-propagation; Non-negative Matrix Factorization;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location
South Brisbane, QLD
Type
conf
DOI
10.1109/ICASSP.2015.7177933
Filename
7177933
Link To Document