• DocumentCode
    2483063
  • Title

    Ensemble Discriminant Sparse Projections Applied to Music Genre Classification

  • Author

    Kotropoulos, Constantine ; Arce, Gonzalo R. ; Panagakis, Yannis

  • Author_Institution
    Dept. of Inf., Aristotle Univ. of Thessaloniki, Thessaloniki, Greece
  • fYear
    2010
  • fDate
    23-26 Aug. 2010
  • Firstpage
    822
  • Lastpage
    825
  • Abstract
    Resorting to the rich, psycho-physiologically grounded, properties of the slow temporal modulations of music recordings, a novel classifier ensemble is built, which applies discriminant sparse projections. More specifically, over complete dictionaries are learned and sparse coefficient vectors are extracted to optimally approximate the slow temporal modulations of the training music recordings. The sparse coefficient vectors are then projected to the principal subspaces of their within-class and between-class covariance matrices. Decisions are taken with respect to the minimum Euclidean distance from the class mean sparse coefficient vectors, which undergo the aforementioned projections. The application of majority voting to the decisions taken by 10 individual classifiers, which are trained on the 10 training folds defined by stratified 10-fold cross-validation on the GTZAN dataset, yields a music genre classification accuracy of 84.96% on average. The latter exceeds by 2.46% the highest accuracy previously reported without employing any sparse representations.
  • Keywords
    audio signal processing; covariance matrices; modulation; music; signal classification; GTZAN dataset; class mean sparse coefficient vectors; classifier ensemble; covariance matrices; ensemble discriminant sparse projection; minimum Euclidean distance; music genre classification; music recording; stratified 10-fold cross-validation; temporal modulation; Accuracy; Covariance matrix; Dictionaries; Modulation; Sparse matrices; Training; Vectors; auditory temporal modulations; music genre classification; sparse representations;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ICPR), 2010 20th International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1051-4651
  • Print_ISBN
    978-1-4244-7542-1
  • Type

    conf

  • DOI
    10.1109/ICPR.2010.207
  • Filename
    5596055