Title :
Searching for effects in big data: Why p-values are not advised and what to use instead
Author :
Marko A. Hofmann
Author_Institution :
ITIS, University of the Federal Armed Forces Munich, Werner-Heisenberg-Weg 39, Neubiberg 85577, Germany
Abstract :
P-values of null hypothesis significance testing have long been the standard and decisive measure of deductive statistics. However, for decades, top statistical methodologists have argued that focusing on p-values is not conducive to science, and that these tests are regularly misunderstood. The standard replacement or at least complement proposed for p-values by those critics are confidence intervals and statistical effects sizes. Regrettably, analyzing and comparing huge data sets (from data mining or simulation based data farming) with two measures is awkward. As a single-value measure of first interpretation for the scanning of Big Data this article proposes statistically secured effect sizes either based on exact, mathematically sophisticated confidence intervals for effect sizes or simplified approximations. It is further argued that simplified secured effect sizes are among the most instructive single measures of statistical interpretation completely perspicuous for the layman.
Keywords :
"Standards","Data models","Big data","Sociology","Statistics","Cognition","Testing"
Conference_Titel :
Winter Simulation Conference (WSC), 2015
Electronic_ISBN :
1558-4305
DOI :
10.1109/WSC.2015.7408210