DocumentCode
730345
Title
Lp -norm non-negative matrix factorization and its application to singing voice enhancement
Author
Nakamuray, Tomohiko ; Kameoka, Hirokazu
Author_Institution
Grad. Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Tokyo, Japan
fYear
2015
fDate
19-24 April 2015
Firstpage
2115
Lastpage
2119
Abstract
Measures of sparsity are useful in many aspects of audio signal processing including speech enhancement, audio coding and singing voice enhancement, and the well-known method for these applications is non-negative matrix factorization (NMF), which decomposes a non-negative data matrix into two non-negative matrices. Although previous studies on NMF have focused on the sparsity of the two matrices, the sparsity of reconstruction errors between a data matrix and the two matrices is also important, since designing the sparsity is equivalent to assuming the nature of the errors. We propose a new NMF technique, which we called Lp-norm NMF, that minimizes the Lp norm of the reconstruction errors, and derive a computationally efficient algorithm for Lp-norm NMF according to an auxiliary function principle. This algorithm can be generalized for the factorization of a real-valued matrix into the product of two real-valued matrices. We apply the algorithm to singing voice enhancement and show that adequately selecting p improves the enhancement.
Keywords
audio coding; matrix decomposition; signal reconstruction; sparse matrices; speech enhancement; Lp-norm nonnegative matrix factorization; NMF; audio coding; audio signal processing; auxiliary function principle; nonnegative data matrix decomposition; signal reconstruction error sparsity; singing voice enhancement; speech enhancement; Artificial neural networks; Harmonic analysis; Rhythm; Robustness; Speech; Speech enhancement; Lp norm; Non-negative matrix factorization; auxiliary function principle;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location
South Brisbane, QLD
Type
conf
DOI
10.1109/ICASSP.2015.7178344
Filename
7178344
Link To Document