DocumentCode
384285
Title
On machine learning, ROC analysis, and statistical tests of significance
Author
Maloof, Marcus A.
Author_Institution
Dept. of Comput. Sci., Georgetown Univ., Washington, DC, USA
Volume
2
fYear
2002
fDate
2002
Firstpage
204
Abstract
Receiver operating characteristic (ROC) analysis is being used with greater frequency as an evaluation methodology in machine learning and pattern recognition. Researchers have used ANOVA to determine if the results from such analysis are statistically significant. Yet, in the medical decision making community, the prevailing method is LABMRMC. Although this latter method uses ANOVA, before doing so, it applies the Jackknife method to account for case-sample variance. To determine whether these two tests make the same decisions regarding statistical significance, we conducted a Monte Carlo simulation using several problems derived from Gaussian distributions, three machine-learning algorithms, ROC analysis, ANOVA, and LABMRMC. Results suggest that the decisions these tests make are not the same, even for simple problems. Furthermore, the larger issue is that since ANOVA does not account for case-sample variance, one cannot generalize experimental results to the population from which the data were drawn.
Keywords
Gaussian distribution; Monte Carlo methods; decision making; learning (artificial intelligence); medical expert systems; sensitivity analysis; ANOVA; Gaussian distributions; Jackknife method; LABMRMC; Monte Carlo simulation; ROC analysis; case-sample variance; machine learning; machine-learning algorithms; medical decision making community; pattern recognition; receiver operating characteristic analysis; statistical significance; statistical tests of significance; Algorithm design and analysis; Analysis of variance; Computer science; Decision making; Humans; Learning systems; Machine learning; Monte Carlo methods; Performance analysis; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 2002. Proceedings. 16th International Conference on
ISSN
1051-4651
Print_ISBN
0-7695-1695-X
Type
conf
DOI
10.1109/ICPR.2002.1048273
Filename
1048273
Link To Document