Title of article :
Not the First Digit! Using Benfordʹs Law to Detect Fraudulent Scientif ic Data
Author/Authors :
Andreas Diekmann، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2007
Pages :
9
From page :
321
To page :
329
Abstract :
Digits in statistical data produced by natural or social processes are often distributed in a manner described by ‘Benford’s law’. Recently, a test against this distribution was used to identify fraudulent accounting data. This test is based on the supposition that first, second, third, and other digits in real data follow the Benford distribution while the digits in fabricated data do not. Is it possible to apply Benford tests to detect fabricated or falsified scientific data as well as fraudulent financial data? We approached this question in two ways. First, we examined the use of the Benford distribution as a standard by checking the frequencies of the nine possible first and ten possible second digits in published statistical estimates. Second, we conducted experiments in which subjects were asked to fabricate statistical estimates (regression coefficients). The digits in these experimental data were scrutinized for possible deviations from the Benford distribution. There were two main findings. First, both digits of the published regression coefficients were approximately Benford distributed or at least followed a pattern of monotonic decline. Second, the experimental results yielded new insights into the strengths and weaknesses of Benford tests. Surprisingly, first digits of faked data also exhibited a pattern of monotonic decline, while second, third, and fourth digits were distributed less in accordance with Benford’s law. At least in the case of regression coefficients, there were indications that checks for digit-preference anomalies should focus less on the first (i.e. leftmost) and more on later digits.
Keywords :
Digital analysis , data fabrication , first digit law , distribution of digits fromregression coefficients , Benford
Journal title :
JOURNAL OF APPLIED STATISTICS
Serial Year :
2007
Journal title :
JOURNAL OF APPLIED STATISTICS
Record number :
712114
Link To Document :
بازگشت