• DocumentCode
    1096341
  • Title

    Unsupervised Single-Channel Music Source Separation by Average Harmonic Structure Modeling

  • Author

    Duan, Zhiyao ; Zhang, Yungang ; Zhang, Changshui ; Shi, Zhenwei

  • Author_Institution
    Dept. of Autom., Tsinghua Univ., Beijing
  • Volume
    16
  • Issue
    4
  • fYear
    2008
  • fDate
    5/1/2008 12:00:00 AM
  • Firstpage
    766
  • Lastpage
    778
  • Abstract
    Source separation of musical signals is an appealing but difficult problem, especially in the single-channel case. In this paper, an unsupervised single-channel music source separation algorithm based on average harmonic structure modeling is proposed. Under the assumption of playing in narrow pitch ranges, different harmonic instrumental sources in a piece of music often have different but stable harmonic structures; thus, sources can be characterized uniquely by harmonic structure models. Given the number of instrumental sources, the proposed algorithm learns these models directly from the mixed signal by clustering the harmonic structures extracted from different frames. The corresponding sources are then extracted from the mixed signal using the models. Experiments on several mixed signals, including synthesized instrumental sources, real instrumental sources, and singing voices, show that this algorithm outperforms the general nonnegative matrix factorization (NMF)-based source separation algorithm, and yields good subjective listening quality. As a side effect, this algorithm estimates the pitches of the harmonic instrumental sources. The number of concurrent sounds in each frame is also computed, which is a difficult task for general multipitch estimation (MPE) algorithms.
  • Keywords
    harmonic analysis; matrix algebra; source separation; average harmonic structure modeling; multipitch estimation algorithm; musical signal; nonnegative matrix factorization; unsupervised single-channel music source separation algorithm; Clustering; harmonic structure; multipitch estimation; single-channel source separation;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2008.919073
  • Filename
    4469889