• DocumentCode
    2170836
  • Title

    Factors in factorization: Does better audio source separation imply better polyphonic music transcription?

  • Author

    Tavares, Tiago Fernandes ; Tzanetakis, G. ; Driessen, Peter

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Victoria, Victoria, BC, Canada
  • fYear
    2013
  • fDate
    Sept. 30 2013-Oct. 2 2013
  • Firstpage
    424
  • Lastpage
    428
  • Abstract
    Spectrogram factorization methods such as Non-Negative Matrix Factorization (NMF) are frequently used as a way to separate individual sound sources from complex sound mixtures. More recently, they have also been used as a first stage for the automatic transcription of polyphonic music. The problem of sound source separation is different (but related) to the problem of automatic music transcription. The output of the first is the separated audio signals corresponding to each sound source, whereas the output of the second is a symbolic representation/music score that encodes the discrete pitches/notes that are played and when they are played. Many variations of factorization methods have been proposed. Two important design choices are the way spectra are represented and what distance measures are used to compare them in the optimization used for factorization. A common assumption has been that a variant that yields better signal separation will result in better automatic transcription. In this work, we investigate experimentally this question and show that this relationship is not necessarily true.
  • Keywords
    audio signal processing; matrix decomposition; music; source separation; audio signals; audio source separation; automatic music transcription; complex sound mixture; nonnegative matrix factorization method; polyphonic music transcription; sound source separation; spectrogram factorization method; Approximation methods; Correlation; Databases; Distortion measurement; Source separation; Spectrogram; Vectors; Beta-Divergence; Music Transcription; Non-negative matrix factorization; Sound Source Separation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia Signal Processing (MMSP), 2013 IEEE 15th International Workshop on
  • Conference_Location
    Pula
  • Type

    conf

  • DOI
    10.1109/MMSP.2013.6659326
  • Filename
    6659326