Title :
UnTangle: Visual Mining for Data with Uncertain Multi-labels via Triangle Map
Author :
Yu-Ru Lin ; Cao, Nianxia ; Gotz, David ; Lu Lu
Author_Institution :
Univ. of Pittsburgh, Pittsburgh, PA, USA
Abstract :
Data with multiple uncertain labels are common in many situations. For examples, a movie may be associated with multiple genres with different levels of confidence, and a protein sequence may be probabilistically assigned to several structural subcategories. Despite their ubiquity, the problem of visualizing uncertain labels has not been adequately addressed. Existing approaches often either discard the uncertainty information, or map the data to a low-dimensional subspace where their associations with multiple labels are obscured. In this paper, we propose a novel visual mining technique, UnTangle, for visualizing uncertain multi-labels. In our proposed visualization, data items are placed inside a web of connected triangles, with labels assigned to the triangle vertices such that nearby labels are more relevant to each other. The positions of the data items are determined based on the probabilistic associations between items and labels. UnTangle provides both (a) an automatic label placement algorithm, and (b) adaptive interaction mechanisms that allow users to control the label positioning for different visual queries. Our work makes a unique contribution by providing an effective way to investigate the relationship between data items and their uncertain labels, as well as the relationships among labels. Our user study suggests that the visualization effectively helps users discover emergent patterns and compare the nuances of uncertainty information in the data labels.
Keywords :
data mining; data visualisation; UnTangle; adaptive interaction mechanisms; automatic label placement algorithm; data items; data mining; emergent pattern discovery; label positioning control; low-dimensional subspace; multiple genres; multiple uncertain labels; probabilistic associations; protein sequence; triangle map; uncertain multilabel visualization; uncertain multilabels; uncertainty information; visual mining; visual queries; visualizing uncertain labels; Data mining; Data visualization; Distributed databases; Motion pictures; Probabilistic logic; Uncertainty; Visualization; multi-labels; probablistic labels; ternary plot; uncertainty data; visual mining;
Conference_Titel :
Data Mining (ICDM), 2014 IEEE International Conference on
Conference_Location :
Shenzhen
Print_ISBN :
978-1-4799-4303-6
DOI :
10.1109/ICDM.2014.24