DocumentCode
2170836
Title
Factors in factorization: Does better audio source separation imply better polyphonic music transcription?
Author
Tavares, Tiago Fernandes ; Tzanetakis, G. ; Driessen, Peter
Author_Institution
Dept. of Comput. Sci., Univ. of Victoria, Victoria, BC, Canada
fYear
2013
fDate
Sept. 30 2013-Oct. 2 2013
Firstpage
424
Lastpage
428
Abstract
Spectrogram factorization methods such as Non-Negative Matrix Factorization (NMF) are frequently used as a way to separate individual sound sources from complex sound mixtures. More recently, they have also been used as a first stage for the automatic transcription of polyphonic music. The problem of sound source separation is different (but related) to the problem of automatic music transcription. The output of the first is the separated audio signals corresponding to each sound source, whereas the output of the second is a symbolic representation/music score that encodes the discrete pitches/notes that are played and when they are played. Many variations of factorization methods have been proposed. Two important design choices are the way spectra are represented and what distance measures are used to compare them in the optimization used for factorization. A common assumption has been that a variant that yields better signal separation will result in better automatic transcription. In this work, we investigate experimentally this question and show that this relationship is not necessarily true.
Keywords
audio signal processing; matrix decomposition; music; source separation; audio signals; audio source separation; automatic music transcription; complex sound mixture; nonnegative matrix factorization method; polyphonic music transcription; sound source separation; spectrogram factorization method; Approximation methods; Correlation; Databases; Distortion measurement; Source separation; Spectrogram; Vectors; Beta-Divergence; Music Transcription; Non-negative matrix factorization; Sound Source Separation;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia Signal Processing (MMSP), 2013 IEEE 15th International Workshop on
Conference_Location
Pula
Type
conf
DOI
10.1109/MMSP.2013.6659326
Filename
6659326
Link To Document