DocumentCode
2806843
Title
Music genre classification via Topology Preserving Non-Negative Tensor Factorization and sparse representations
Author
Panagakis, Yannis ; Kotropoulos, Constantine
Author_Institution
Dept. of Inf., Aristotle Univ. of Thessaloniki, Thessaloniki, Greece
fYear
2010
fDate
14-19 March 2010
Firstpage
249
Lastpage
252
Abstract
Motivated by the rich, psycho-physiologically grounded properties of auditory cortical representations and the power of sparse representation-based classifiers, we propose a robust music genre classification framework. Its first pilar is a novel multilinear subspace analysis method that reduces the dimensionality of cortical representations of music signals, while preserving the topology of the cortical representations. Its second pilar is the sparse representation based classification, that models any test cortical representation as a sparse weighted sum of dictionary atoms, which stem from training cortical representations of known genre, by assuming that the representations of music recordings of the same genre are close enough in the tensor space they lie. Accordingly, the dimensionality reduction is made in a compatible manner to the working principle of the sparse-representation based classification. Music genre classification accuracy of 93.7% and 94.93% is reported on the GTZAN and the ISMIR2004 Genre datasets, respectively. Both accuracies outperform any accuracy ever reported for state of the art music genre classification algorithms applied to the aforementioned datasets.
Keywords
matrix decomposition; music; signal classification; tensors; auditory cortical representation; dimensionality reduction; multilinear subspace analysis method; music genre classification; sparse representation-based classification; topology preserving nonnegative tensor factorization; Dictionaries; Feature extraction; Informatics; Multiple signal classification; Psychology; Robustness; Signal analysis; Tensile stress; Testing; Topology; Music genre classification; non-negative tensor factorization; sparse representations; topology preserving;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location
Dallas, TX
ISSN
1520-6149
Print_ISBN
978-1-4244-4295-9
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2010.5495984
Filename
5495984
Link To Document