DocumentCode
3199199
Title
Analyzing the impact of data vectorization on distance relations
Author
Stober, Sebastian ; Nürnberger, Andreas
Author_Institution
Data & Knowledge Eng. Group, Otto-von-Guericke-Univ. Magdeburg, Magdeburg, Germany
fYear
2011
fDate
11-15 July 2011
Firstpage
1
Lastpage
6
Abstract
Some popular algorithms used in Music Information Retrieval (MIR) such as Self-Organizing Maps (SOMs) require the objects they process to be represented as vectors, i.e. elements of a vector space. This is a rather severe restriction and if the data does not adhere to it, some means of vectorization is required. As a common practice, the full distance matrix is computed and each row of the matrix interpreted as an artificial feature vector. This paper empirically investigates the impact of this transformation. Further, an alternative approach for vectorization based on Multidimensional Scaling is pro posed that is able to better preserve the actual distance relations of the objects which is essential for obtaining a good retrieval performance.
Keywords
information retrieval; matrix algebra; music; self-organising feature maps; statistical analysis; artificial feature vector; data vectorization impact analysis; distance relations; full distance matrix; multidimensional scaling; music information retrieval; selforganizing maps; Eigenvalues and eigenfunctions; Equations; Euclidean distance; Instruments; Self organizing feature maps; Training; Aggregation; Distance Measures; Facets; Multidimensional Scaling; Vectorization;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia and Expo (ICME), 2011 IEEE International Conference on
Conference_Location
Barcelona
ISSN
1945-7871
Print_ISBN
978-1-61284-348-3
Electronic_ISBN
1945-7871
Type
conf
DOI
10.1109/ICME.2011.6012134
Filename
6012134
Link To Document