Title :
An Approach to Supporting Incremental Visual Data Classification
Author :
Paiva, Jose Gustavo S. ; Schwartz, William Robson ; Pedrini, Helio ; Minghim, Rosane
Author_Institution :
Fac. of Comput. Sci., Fed. Univ. of Uberlandia-UFU, Uberlandia, Brazil
Abstract :
Automatic data classification is a computationally intensive task that presents variable precision and is considerably sensitive to the classifier configuration and to data representation, particularly for evolving data sets. Some of these issues can best be handled by methods that support users´ control over the classification steps. In this paper, we propose a visual data classification methodology that supports users in tasks related to categorization such as training set selection; model creation, application and verification; and classifier tuning. The approach is then well suited for incremental classification, present in many applications with evolving data sets. Data set visualization is accomplished by means of point placement strategies, and we exemplify the method through multidimensional projections and Neighbor Joining trees. The same methodology can be employed by a user who wishes to create his or her own ground truth (or perspective) from a previously unlabeled data set. We validate the methodology through its application to categorization scenarios of image and text data sets, involving the creation, application, verification, and adjustment of classification models.
Keywords :
data visualisation; learning (artificial intelligence); pattern classification; application and verification; classification models; classifier configuration; classifier tuning; data representation; ground truth; incremental visual data classification; model creation; multidimensional projection; neighbor joining trees; training set selection; Computational modeling; Data models; Data visualization; Layout; Mathematical model; Training; Visualization; Visual image classification; information visualization; multidimensional point placement;
Journal_Title :
Visualization and Computer Graphics, IEEE Transactions on
DOI :
10.1109/TVCG.2014.2331979