• DocumentCode
    594695
  • Title

    Perceptually weighted Non-negative Matrix Factorization for blind single-channel music source separation

  • Author

    Kirbiz, Serap ; Gunsel, B.

  • Author_Institution
    Dept. of Electron. & Commun. Eng., Istanbul Tech. Univ., Istanbul, Turkey
  • fYear
    2012
  • fDate
    11-15 Nov. 2012
  • Firstpage
    226
  • Lastpage
    229
  • Abstract
    We propose a blind single-channel musical source separation method that improves perceptual quality of the separated sources. It uses the advantages of subspace learning based on Non-negative Matrix Factor 2-D Deconvolution (NMF2D). To improve the perceptual quality of separation, we propose a weighted divergence type cost function for the optimization that adopts the auditory model defined in ITU-R BS.1387 into the source separation. It is shown that the proposed perceptually weighted NMF2D scheme efficiently clusters the bases of subspace representation corresponding to notes generated by single instruments. Source separation performance has been reported on musical mixtures resulting an improvement in perceptual quality measures.
  • Keywords
    deconvolution; learning (artificial intelligence); matrix decomposition; music; source separation; 2D deconvolution; NMF2D; blind single-channel music source separation; non-negative matrix factorization; perceptual quality; subspace learning; Computational complexity; Humans; Instruments; Source separation; Spectrogram; Time domain analysis; Time frequency analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ICPR), 2012 21st International Conference on
  • Conference_Location
    Tsukuba
  • ISSN
    1051-4651
  • Print_ISBN
    978-1-4673-2216-4
  • Type

    conf

  • Filename
    6460113