Title :
Quantifying and Comparing Features in High-Dimensional Datasets
Author :
Piringer, Harald ; Berger, Wolfgang ; Hauser, Helwig
Author_Institution :
VRVis Res. Center, Vienna
Abstract :
Linking and brushing is a proven approach to analyzing multi-dimensional datasets in the context of multiple coordinated views. Nevertheless, most of the respective visualization techniques only offer qualitative visual results. Many user tasks, however, also require precise quantitative results as, for example, offered by statistical analysis. In succession of the useful Rank-by-Feature Framework, this paper describes a joint visual and statistical approach for guiding the user through a high-dimensional dataset by ranking dimensions (1D case) and pairs of dimensions (2D case) according to statistical summaries. While the original Rank-by-Feature Framework is limited to global features, the most important novelty here is the concept to consider local features, i.e., data subsets defined by brushing in linked views. The ability to compare subsets to other subsets and subsets to the whole dataset in the context of a large number of dimensions significantly extends the benefits of the approach especially in later stages of an exploratory data analysis. A case study illustrates the workflow by analyzing counts of keywords for classifying e-mails as spam or no-spam.
Keywords :
data analysis; data visualisation; feature extraction; dimension ranking; exploratory data analysis; feature comparison; feature quantification; high-dimensional dataset; multidimensional dataset analysis; multiple coordinated views; rank-by-feature framework; statistical analysis; statistical summaries; visual approach; Data analysis; Data mining; Data visualization; Decision making; Electronic mail; Informatics; Information analysis; Joining processes; Statistical analysis; Statistics; High Dimensionality; Local Features; Ranking; Statistics;
Conference_Titel :
Information Visualisation, 2008. IV '08. 12th International Conference
Conference_Location :
London
Print_ISBN :
978-0-7695-3268-4