• DocumentCode
    36300
  • Title

    Separation of Singing Voice Using Nonnegative Matrix Partial Co-Factorization for Singer Identification

  • Author

    Ying Hu ; Guizhong Liu

  • Author_Institution
    Sch. of Electron. & Inf. Eng., Xian Jiaotong Univ., Xian, China
  • Volume
    23
  • Issue
    4
  • fYear
    2015
  • fDate
    Apr-15
  • Firstpage
    643
  • Lastpage
    653
  • Abstract
    In order to improve the performance of singer identification, we propose a system to separate singing voice from music accompaniment for monaural recordings. Our system consists of two key stages. The first stage exploits the nonnegative matrix partial co-factorization (NMPCF), which is a joint matrix decomposition integrating prior knowledge of singing voice and pure accompaniment to separate the mixture signal into singing voice portion and accompaniment portion. In the second stage, based on the separated singing voice obtained by the first stage, the pitches of singing voice are first estimated and then the harmonic components of singing voice can be distinguished. For a frame, the distinguished harmonic components are regarded as reliable while other frequency components unreliable, thus the spectrum is incomplete. With those harmonic components, the complete spectrums of singing voice can be reconstructed by a missing feature method, spectrum reconstruction, obtaining a refined signal with more clean singing voice. Experimental results demonstrate that, from the point view of source separation, the singing voice refinement can further improve ΔSNR in contrast with the singing voice separation using NMPCF, while for the point view of singer identification, the singing voice separated by NMPCF is more appropriate than the refined singing voice.
  • Keywords
    matrix decomposition; music; signal reconstruction; source separation; speaker recognition; NMPCF; accompaniment portion; frequency components; harmonic components; joint matrix decomposition; missing feature method; mixture signal separation; monaural recordings; music accompaniment; nonnegative matrix partial co-factorization; refined signal; singer identification; singing voice portion; singing voice refinement; singing voice separation; source separation; spectrum reconstruction; Feature extraction; Harmonic analysis; IEEE transactions; Instruments; Matrix decomposition; Source separation; Spectrogram; Nonnegative matrix partial co-factorization (NMPCF); singer identification; singing voice separation; spectrum reconstruction;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    2329-9290
  • Type

    jour

  • DOI
    10.1109/TASLP.2015.2396681
  • Filename
    7021947