Lp-norm non-negative matrix factorization and its application to singing voice enhancement

Author

Nakamuray, Tomohiko ; Kameoka, Hirokazu

Author_Institution

Grad. Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Tokyo, Japan

fYear

2015

fDate

19-24 April 2015

Firstpage

2115

Lastpage

2119

Abstract

Measures of sparsity are useful in many aspects of audio signal processing including speech enhancement, audio coding and singing voice enhancement, and the well-known method for these applications is non-negative matrix factorization (NMF), which decomposes a non-negative data matrix into two non-negative matrices. Although previous studies on NMF have focused on the sparsity of the two matrices, the sparsity of reconstruction errors between a data matrix and the two matrices is also important, since designing the sparsity is equivalent to assuming the nature of the errors. We propose a new NMF technique, which we called L_p-norm NMF, that minimizes the L_p norm of the reconstruction errors, and derive a computationally efficient algorithm for L_p-norm NMF according to an auxiliary function principle. This algorithm can be generalized for the factorization of a real-valued matrix into the product of two real-valued matrices. We apply the algorithm to singing voice enhancement and show that adequately selecting p improves the enhancement.

Keywords

audio coding; matrix decomposition; signal reconstruction; sparse matrices; speech enhancement; L_p-norm nonnegative matrix factorization; NMF; audio coding; audio signal processing; auxiliary function principle; nonnegative data matrix decomposition; signal reconstruction error sparsity; singing voice enhancement; speech enhancement; Artificial neural networks; Harmonic analysis; Rhythm; Robustness; Speech; Speech enhancement; Lp norm; Non-negative matrix factorization; auxiliary function principle;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on

Conference_Location

South Brisbane, QLD

Type

conf

DOI

10.1109/ICASSP.2015.7178344

Filename

7178344