• DocumentCode
    2806843
  • Title

    Music genre classification via Topology Preserving Non-Negative Tensor Factorization and sparse representations

  • Author

    Panagakis, Yannis ; Kotropoulos, Constantine

  • Author_Institution
    Dept. of Inf., Aristotle Univ. of Thessaloniki, Thessaloniki, Greece
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    249
  • Lastpage
    252
  • Abstract
    Motivated by the rich, psycho-physiologically grounded properties of auditory cortical representations and the power of sparse representation-based classifiers, we propose a robust music genre classification framework. Its first pilar is a novel multilinear subspace analysis method that reduces the dimensionality of cortical representations of music signals, while preserving the topology of the cortical representations. Its second pilar is the sparse representation based classification, that models any test cortical representation as a sparse weighted sum of dictionary atoms, which stem from training cortical representations of known genre, by assuming that the representations of music recordings of the same genre are close enough in the tensor space they lie. Accordingly, the dimensionality reduction is made in a compatible manner to the working principle of the sparse-representation based classification. Music genre classification accuracy of 93.7% and 94.93% is reported on the GTZAN and the ISMIR2004 Genre datasets, respectively. Both accuracies outperform any accuracy ever reported for state of the art music genre classification algorithms applied to the aforementioned datasets.
  • Keywords
    matrix decomposition; music; signal classification; tensors; auditory cortical representation; dimensionality reduction; multilinear subspace analysis method; music genre classification; sparse representation-based classification; topology preserving nonnegative tensor factorization; Dictionaries; Feature extraction; Informatics; Multiple signal classification; Psychology; Robustness; Signal analysis; Tensile stress; Testing; Topology; Music genre classification; non-negative tensor factorization; sparse representations; topology preserving;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495984
  • Filename
    5495984