Title :
Analyzing high-dimensional data by subspace validity
Author :
Amir, Amihood ; Kashi, Reuven ; Netanyahu, Nathan S. ; Keim, Daniel ; Wawryniuk, Markus
Author_Institution :
Dept. of Comput. Sci., Bar-Ilan Univ., Ramat-Gan, Israel
Abstract :
We are proposing a novel method that makes it possible to analyze high-dimensional data with arbitrary shaped projected clusters and high noise levels. At the core of our method lies the idea of subspace validity. We map the data in a way that allows us to test the quality of subspaces using statistical tests. Experimental results, both on synthetic and real data sets, demonstrate the potential of our method.
Keywords :
feature extraction; image segmentation; statistical testing; visual databases; arbitrary shaped projected clusters; high-dimensional data analysis; noise levels; real data sets; statistical tests; subspace validity; Automation; Clustering algorithms; Computer science; Data analysis; Humans; Information analysis; Noise level; Space technology; Testing; Topology;
Conference_Titel :
Data Mining, 2003. ICDM 2003. Third IEEE International Conference on
Print_ISBN :
0-7695-1978-4
DOI :
10.1109/ICDM.2003.1250955