DocumentCode :
177640
Title :
Music segment similarity using 2D-Fourier Magnitude Coefficients
Author :
Nieto, Oriol ; Bello, Juan P.
Author_Institution :
Music & Audio Res. Lab., New York Univ., New York, NY, USA
fYear :
2014
fDate :
4-9 May 2014
Firstpage :
664
Lastpage :
668
Abstract :
Music segmentation is the task of automatically identifying the different segments of a piece. In this work we present a novel approach to cluster the musical segments based on their acoustic similarity by using 2D-Fourier Magnitude Coefficients (2D-FMCs). These coefficients, computed from a chroma representation, significantly simplify the problem of clustering the different segments since they are key transposition and phase shift invariant. We explore various strategies to obtain the 2D-FMC patches that represent entire segments and apply k-means to label them. Finally, we discuss possible ways of estimating k and compare our competitive results with the current state of the art.
Keywords :
Fourier transforms; acoustic signal processing; audio signal processing; learning (artificial intelligence); music; pattern clustering; signal representation; 2D-FMC; 2D-Fourier magnitude coefficients; acoustic similarity; chroma representation; k-means algorithm; music segment similarity; music segmentation; segment clustering; Multiple signal classification; Music; Music information retrieval; Speech; Speech processing; Vectors; 2D-Fourier Transform; Clustering; Music Segmentation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
Type :
conf
DOI :
10.1109/ICASSP.2014.6853679
Filename :
6853679
Link To Document :
بازگشت