• DocumentCode
    730345
  • Title

    Lp-norm non-negative matrix factorization and its application to singing voice enhancement

  • Author

    Nakamuray, Tomohiko ; Kameoka, Hirokazu

  • Author_Institution
    Grad. Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Tokyo, Japan
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    2115
  • Lastpage
    2119
  • Abstract
    Measures of sparsity are useful in many aspects of audio signal processing including speech enhancement, audio coding and singing voice enhancement, and the well-known method for these applications is non-negative matrix factorization (NMF), which decomposes a non-negative data matrix into two non-negative matrices. Although previous studies on NMF have focused on the sparsity of the two matrices, the sparsity of reconstruction errors between a data matrix and the two matrices is also important, since designing the sparsity is equivalent to assuming the nature of the errors. We propose a new NMF technique, which we called Lp-norm NMF, that minimizes the Lp norm of the reconstruction errors, and derive a computationally efficient algorithm for Lp-norm NMF according to an auxiliary function principle. This algorithm can be generalized for the factorization of a real-valued matrix into the product of two real-valued matrices. We apply the algorithm to singing voice enhancement and show that adequately selecting p improves the enhancement.
  • Keywords
    audio coding; matrix decomposition; signal reconstruction; sparse matrices; speech enhancement; Lp-norm nonnegative matrix factorization; NMF; audio coding; audio signal processing; auxiliary function principle; nonnegative data matrix decomposition; signal reconstruction error sparsity; singing voice enhancement; speech enhancement; Artificial neural networks; Harmonic analysis; Rhythm; Robustness; Speech; Speech enhancement; Lp norm; Non-negative matrix factorization; auxiliary function principle;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178344
  • Filename
    7178344