Title :
Ensemble Discriminant Sparse Projections Applied to Music Genre Classification
Author :
Kotropoulos, Constantine ; Arce, Gonzalo R. ; Panagakis, Yannis
Author_Institution :
Dept. of Inf., Aristotle Univ. of Thessaloniki, Thessaloniki, Greece
Abstract :
Resorting to the rich, psycho-physiologically grounded, properties of the slow temporal modulations of music recordings, a novel classifier ensemble is built, which applies discriminant sparse projections. More specifically, over complete dictionaries are learned and sparse coefficient vectors are extracted to optimally approximate the slow temporal modulations of the training music recordings. The sparse coefficient vectors are then projected to the principal subspaces of their within-class and between-class covariance matrices. Decisions are taken with respect to the minimum Euclidean distance from the class mean sparse coefficient vectors, which undergo the aforementioned projections. The application of majority voting to the decisions taken by 10 individual classifiers, which are trained on the 10 training folds defined by stratified 10-fold cross-validation on the GTZAN dataset, yields a music genre classification accuracy of 84.96% on average. The latter exceeds by 2.46% the highest accuracy previously reported without employing any sparse representations.
Keywords :
audio signal processing; covariance matrices; modulation; music; signal classification; GTZAN dataset; class mean sparse coefficient vectors; classifier ensemble; covariance matrices; ensemble discriminant sparse projection; minimum Euclidean distance; music genre classification; music recording; stratified 10-fold cross-validation; temporal modulation; Accuracy; Covariance matrix; Dictionaries; Modulation; Sparse matrices; Training; Vectors; auditory temporal modulations; music genre classification; sparse representations;
Conference_Titel :
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location :
Istanbul
Print_ISBN :
978-1-4244-7542-1
DOI :
10.1109/ICPR.2010.207