• DocumentCode
    21369
  • Title

    Visualizing Statistical Mix Effects and Simpson's Paradox

  • Author

    Armstrong, Zan ; Wattenberg, Martin

  • Volume
    20
  • Issue
    12
  • fYear
    2014
  • fDate
    Dec. 31 2014
  • Firstpage
    2132
  • Lastpage
    2141
  • Abstract
    We discuss how “mix effects” can surprise users of visualizations and potentially lead them to incorrect conclusions. This statistical issue (also known as “omitted variable bias” or, in extreme cases, as “Simpson´s paradox”) is widespread and can affect any visualization in which the quantity of interest is an aggregated value such as a weighted sum or average. Our first contribution is to document how mix effects can be a serious issue for visualizations, and we analyze how mix effects can cause problems in a variety of popular visualization techniques, from bar charts to treemaps. Our second contribution is a new technique, the “comet chart,” that is meant to ameliorate some of these issues.
  • Keywords
    data visualisation; statistical analysis; Simpson paradox; average value; bar charts; comet chart technique; data visualization; omitted variable bias; statistical mix effects; treemaps; visualization techniques; weighted sum value; Data visualization; Image color analysis; Image segmentation; Statistics; Mix effects; Omitted variable bias; Simpson´s paradox; Statistics;
  • fLanguage
    English
  • Journal_Title
    Visualization and Computer Graphics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1077-2626
  • Type

    jour

  • DOI
    10.1109/TVCG.2014.2346297
  • Filename
    6875927