DocumentCode
2483063
Title
Ensemble Discriminant Sparse Projections Applied to Music Genre Classification
Author
Kotropoulos, Constantine ; Arce, Gonzalo R. ; Panagakis, Yannis
Author_Institution
Dept. of Inf., Aristotle Univ. of Thessaloniki, Thessaloniki, Greece
fYear
2010
fDate
23-26 Aug. 2010
Firstpage
822
Lastpage
825
Abstract
Resorting to the rich, psycho-physiologically grounded, properties of the slow temporal modulations of music recordings, a novel classifier ensemble is built, which applies discriminant sparse projections. More specifically, over complete dictionaries are learned and sparse coefficient vectors are extracted to optimally approximate the slow temporal modulations of the training music recordings. The sparse coefficient vectors are then projected to the principal subspaces of their within-class and between-class covariance matrices. Decisions are taken with respect to the minimum Euclidean distance from the class mean sparse coefficient vectors, which undergo the aforementioned projections. The application of majority voting to the decisions taken by 10 individual classifiers, which are trained on the 10 training folds defined by stratified 10-fold cross-validation on the GTZAN dataset, yields a music genre classification accuracy of 84.96% on average. The latter exceeds by 2.46% the highest accuracy previously reported without employing any sparse representations.
Keywords
audio signal processing; covariance matrices; modulation; music; signal classification; GTZAN dataset; class mean sparse coefficient vectors; classifier ensemble; covariance matrices; ensemble discriminant sparse projection; minimum Euclidean distance; music genre classification; music recording; stratified 10-fold cross-validation; temporal modulation; Accuracy; Covariance matrix; Dictionaries; Modulation; Sparse matrices; Training; Vectors; auditory temporal modulations; music genre classification; sparse representations;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location
Istanbul
ISSN
1051-4651
Print_ISBN
978-1-4244-7542-1
Type
conf
DOI
10.1109/ICPR.2010.207
Filename
5596055
Link To Document