• DocumentCode
    2793029
  • Title

    Blind rhythmic source separation: Nonnegativity and repeatability

  • Author

    Kim, Minje ; Yoo, Jiho ; Kang, Kyeongok ; Choi, Seungjin

  • Author_Institution
    Electron. & Telecommun. Res. Inst. (ETRI), Daejeon, South Korea
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    2006
  • Lastpage
    2009
  • Abstract
    An unsupervised method is proposed aiming at extracting rhythmic sources from commercial polyphonic music whose number of channels is limited to one. Commercial music signals are not usually provided with more than two channels while they often contain multiple instruments including singing voice. Therefore, instead of using conventional ways, such as modeling mixing environments or statistical characteristics, we should introduce other source-specific characteristics for separating or extracting the sources. In this paper, we concentrate on extracting rhythmic sources from the mixture with the other harmonic sources. An extension of nonnegative matrix factorization (NMF) is used to analyze multiple relationships between spectral and temporal properties in the given input matrices. Moreover, temporal repeatability of the rhythmic sound sources is implicated as common rhythmic property among segments of an input mixture signal. The proposed method shows acceptable, but not superior separation quality to the referred drum source separation systems. However, it has better applicability due to its blind manner in separation.
  • Keywords
    acoustic signal processing; blind source separation; music; blind rhythmic source separation; commercial polyphonic music; nonnegative matrix factorization; nonnegativity; repeatability; singing voice; unsupervised method; Blind source separation; Computer science; Feature extraction; Humans; Information analysis; Instruments; Multiple signal classification; Performance analysis; Source separation; Sparse matrices; Nonnegative matrix factorization; musical information research; rhythmic source separation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495205
  • Filename
    5495205