DocumentCode :
1037708
Title :
Normalized Cuts for Predominant Melodic Source Separation
Author :
Lagrange, Mathieu ; Martins, Luis Gustavo ; Murdoch, Jennifer ; Tzanetakis, George
Author_Institution :
Univ. of Victoria, Victoria, BC
Volume :
16
Issue :
2
fYear :
2008
Firstpage :
278
Lastpage :
290
Abstract :
The predominant melodic source, frequently the singing voice, is an important component of musical signals. In this paper, we describe a method for extracting the predominant source and corresponding melody from ldquoreal-worldrdquo polyphonic music. The proposed method is inspired by ideas from computational auditory scene analysis. We formulate predominant melodic source tracking and formation as a graph partitioning problem and solve it using the normalized cut which is a global criterion for segmenting graphs that has been used in computer vision. Sinusoidal modeling is used as the underlying representation. A novel harmonicity cue which we term harmonically wrapped peak similarity is introduced. Experimental results supporting the use of this cue are presented. In addition, we show results for automatic melody extraction using the proposed approach.
Keywords :
audio signal processing; graph theory; information retrieval; music; pattern clustering; signal representation; source separation; spectral analysis; automatic melody extraction; computational auditory scene analysis; graph partitioning problem; harmonically wrapped peak similarity; harmonicity cue; music information retrieval; musical signals; normalized cuts; polyphonic music; predominant melodic source separation; signal representation; sinusoidal modeling; spectral clustering; Computer vision; Data mining; Helium; Image analysis; Independent component analysis; Lagrangian functions; Multiple signal classification; Music information retrieval; Source separation; Statistical analysis; Computational auditory scene analysis (CASA); music information retrieval (MIR); normalized cut; sinusoidal modeling; spectral clustering;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2007.909260
Filename :
4432646
Link To Document :
بازگشت