DocumentCode :
2292106
Title :
Quantifying and Comparing Features in High-Dimensional Datasets
Author :
Piringer, Harald ; Berger, Wolfgang ; Hauser, Helwig
Author_Institution :
VRVis Res. Center, Vienna
fYear :
2008
fDate :
9-11 July 2008
Firstpage :
240
Lastpage :
245
Abstract :
Linking and brushing is a proven approach to analyzing multi-dimensional datasets in the context of multiple coordinated views. Nevertheless, most of the respective visualization techniques only offer qualitative visual results. Many user tasks, however, also require precise quantitative results as, for example, offered by statistical analysis. In succession of the useful Rank-by-Feature Framework, this paper describes a joint visual and statistical approach for guiding the user through a high-dimensional dataset by ranking dimensions (1D case) and pairs of dimensions (2D case) according to statistical summaries. While the original Rank-by-Feature Framework is limited to global features, the most important novelty here is the concept to consider local features, i.e., data subsets defined by brushing in linked views. The ability to compare subsets to other subsets and subsets to the whole dataset in the context of a large number of dimensions significantly extends the benefits of the approach especially in later stages of an exploratory data analysis. A case study illustrates the workflow by analyzing counts of keywords for classifying e-mails as spam or no-spam.
Keywords :
data analysis; data visualisation; feature extraction; dimension ranking; exploratory data analysis; feature comparison; feature quantification; high-dimensional dataset; multidimensional dataset analysis; multiple coordinated views; rank-by-feature framework; statistical analysis; statistical summaries; visual approach; Data analysis; Data mining; Data visualization; Decision making; Electronic mail; Informatics; Information analysis; Joining processes; Statistical analysis; Statistics; High Dimensionality; Local Features; Ranking; Statistics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Visualisation, 2008. IV '08. 12th International Conference
Conference_Location :
London
ISSN :
1550-6037
Print_ISBN :
978-0-7695-3268-4
Type :
conf
DOI :
10.1109/IV.2008.17
Filename :
4577954
Link To Document :
بازگشت