DocumentCode :
1364893
Title :
Two-Phase Mapping for Projecting Massive Data Sets
Author :
Paulovich, Fernando V. ; Silva, Cláudio T. ; Nonato, L. Gustavo
Author_Institution :
Univ. de Sao Paulo, São Carlos, Brazil
Volume :
16
Issue :
6
fYear :
2010
Firstpage :
1281
Lastpage :
1290
Abstract :
Most multidimensional projection techniques rely on distance (dissimilarity) information between data instances to embed high-dimensional data into a visual space. When data are endowed with Cartesian coordinates, an extra computational effort is necessary to compute the needed distances, making multidimensional projection prohibitive in applications dealing with interactivity and massive data. The novel multidimensional projection technique proposed in this work, called Part-Linear Multidimensional Projection (PLMP), has been tailored to handle multivariate data represented in Cartesian high-dimensional spaces, requiring only distance information between pairs of representative samples. This characteristic renders PLMP faster than previous methods when processing large data sets while still being competitive in terms of precision. Moreover, knowing the range of variation for data instances in the high-dimensional space, we can make PLMP a truly streaming data projection technique, a trait absent in previous methods.
Keywords :
data mining; data visualisation; rendering (computer graphics); Cartesian coordinates; massive data set projection; part-linear multidimensional projection; streaming data projection technique; two-phase mapping; visual data mining; Approximation methods; Complexity theory; Equations; Force; Principal component analysis; Stress; Visualization; Dimensionality Reduction; Projection Methods; Streaming Technique; Visual Data Mining;
fLanguage :
English
Journal_Title :
Visualization and Computer Graphics, IEEE Transactions on
Publisher :
ieee
ISSN :
1077-2626
Type :
jour
DOI :
10.1109/TVCG.2010.207
Filename :
5613468
Link To Document :
بازگشت