Title :
A Rank-by-Feature Framework for Unsupervised Multidimensional Data Exploration Using Low Dimensional Projections
Author :
Seo, Jinwook ; Shneiderman, Ben
Author_Institution :
Dept. of Comput. Sci., Maryland Univ., College Park, MD
Abstract :
Exploratory analysis of multidimensional data sets is challenging because of the difficulty in comprehending more than three dimensions. Two fundamental statistical principles for the exploratory analysis are (1) to examine each dimension first and then find relationships among dimensions, and (2) to try graphical displays first and then find numerical summaries (D.S. Moore, (1999). We implement these principles in a novel conceptual framework called the rank-by-feature framework. In the framework, users can choose a ranking criterion interesting to them and sort 1D or 2D axis-parallel projections according to the criterion. We introduce the rank-by-feature prism that is a color-coded lower-triangular matrix that guides users to desired features. Statistical graphs (histogram, boxplot, and scatterplot) and information visualization techniques (overview, coordination, and dynamic query) are combined to help users effectively traverse 1D and 2D axis-parallel projections, and finally to help them interactively find interesting features
Keywords :
computational complexity; computational geometry; computer displays; data analysis; data mining; data visualisation; feature extraction; graph theory; interactive systems; statistical analysis; very large databases; axis-parallel projections; boxplot; color-coded lower-triangular matrix; dynamic query; exploratory data analysis; feature detection; feature selection; graphical displays; histogram; information visualization; rank-by-feature prism; scatterplot; statistical graphs; unsupervised multidimensional data exploration; Computer science; Computer vision; Data analysis; Data mining; Data visualization; Displays; Educational institutions; Laboratories; Multidimensional systems; Principal component analysis; dynamic query; exploratory data analysis; feature detection/selection; information visualization; statistical graphics;
Conference_Titel :
Information Visualization, 2004. INFOVIS 2004. IEEE Symposium on
Conference_Location :
Austin, TX
Print_ISBN :
0-7803-8779-3
DOI :
10.1109/INFVIS.2004.3