DocumentCode :
641131
Title :
Unsupervised music segmentation via multi-scale processing of compressive features´ representation
Author :
Theodorakopoulos, Ilias ; Economou, George ; Fotopoulos, Spiros
Author_Institution :
Phys. Dept., Univ. of Patras, Patras, Greece
fYear :
2013
fDate :
1-3 July 2013
Firstpage :
1
Lastpage :
6
Abstract :
We present an automated method for unsupervised detection of structural boundaries in musical recordings. The proposed method utilizes a compressed representation of features capturing timbre and chroma, in an 1-D time series derived via PCA. Time delay embedding and multi-scale comparison using the Wald-Wolfowitz statistical test are incorporated in order to calculate a Self Dissimilarity Matrix. A novelty curve is estimated by convolving an appropriate kernel along the main diagonal of the matrix, while the structural boundaries are located on the local maxima of the derived curve. We evaluate the proposed method on a popular dataset, using two different ground truth annotations. We demonstrate that the 1-D compressed representation of features contains enough information in order to detect boundaries with high precision, outperforming several methods from the literature.
Keywords :
matrix algebra; music; principal component analysis; time series; 1D compressed representation; 1D time series; PCA; Wald-Wolfowitz statistical test; automated method; chroma; compressive feature representation; multiscale comparison; multiscale processing; musical recordings; novelty curve; self dissimilarity matrix; structural boundaries; timbre; time delay embedding; unsupervised detection; unsupervised music segmentation; Delay effects; Feature extraction; Kernel; Timbre; Time series analysis; Vectors; Chroma feature; Multi-Scale; Music Structure; Structural bounraries; Time delay embedding; Trimbe features;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Digital Signal Processing (DSP), 2013 18th International Conference on
Conference_Location :
Fira
ISSN :
1546-1874
Type :
conf
DOI :
10.1109/ICDSP.2013.6622772
Filename :
6622772
Link To Document :
بازگشت