DocumentCode
21369
Title
Visualizing Statistical Mix Effects and Simpson's Paradox
Author
Armstrong, Zan ; Wattenberg, Martin
Volume
20
Issue
12
fYear
2014
fDate
Dec. 31 2014
Firstpage
2132
Lastpage
2141
Abstract
We discuss how “mix effects” can surprise users of visualizations and potentially lead them to incorrect conclusions. This statistical issue (also known as “omitted variable bias” or, in extreme cases, as “Simpson´s paradox”) is widespread and can affect any visualization in which the quantity of interest is an aggregated value such as a weighted sum or average. Our first contribution is to document how mix effects can be a serious issue for visualizations, and we analyze how mix effects can cause problems in a variety of popular visualization techniques, from bar charts to treemaps. Our second contribution is a new technique, the “comet chart,” that is meant to ameliorate some of these issues.
Keywords
data visualisation; statistical analysis; Simpson paradox; average value; bar charts; comet chart technique; data visualization; omitted variable bias; statistical mix effects; treemaps; visualization techniques; weighted sum value; Data visualization; Image color analysis; Image segmentation; Statistics; Mix effects; Omitted variable bias; Simpson´s paradox; Statistics;
fLanguage
English
Journal_Title
Visualization and Computer Graphics, IEEE Transactions on
Publisher
ieee
ISSN
1077-2626
Type
jour
DOI
10.1109/TVCG.2014.2346297
Filename
6875927
Link To Document