• DocumentCode
    730156
  • Title

    Vocal activity informed singing voice separation with the iKala dataset

  • Author

    Tak-Shing Chan ; Tzu-Chun Yeh ; Zhe-Cheng Fan ; Hung-Wei Chen ; Li Su ; Yi-Hsuan Yang ; Jang, Roger

  • Author_Institution
    Res. Center for Inf. Technol. Innovation, Taipei, Taiwan
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    718
  • Lastpage
    722
  • Abstract
    A new algorithm is proposed for robust principal component analysis with predefined sparsity patterns. The algorithm is then applied to separate the singing voice from the instrumental accompaniment using vocal activity information. To evaluate its performance, we construct a new publicly available iKala dataset that features longer durations and higher quality than the existing MIR-1K dataset for singing voice separation. Part of it will be used in the MIREX Singing Voice Separation task. Experimental results on both the MIR-1K dataset and the new iKala dataset confirmed that the more informed the algorithm is, the better the separation results are.
  • Keywords
    principal component analysis; speech processing; MIR-IK dataset; iKala dataset; predefined sparsity pattern; robust principal component analysis; vocal activity informed singing voice separation; Electronic publishing; Harmonic analysis; Information services; Internet; MATLAB; Low-rank and sparse decomposition; informed source separation; singing voice separation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178063
  • Filename
    7178063