• DocumentCode
    384285
  • Title

    On machine learning, ROC analysis, and statistical tests of significance

  • Author

    Maloof, Marcus A.

  • Author_Institution
    Dept. of Comput. Sci., Georgetown Univ., Washington, DC, USA
  • Volume
    2
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    204
  • Abstract
    Receiver operating characteristic (ROC) analysis is being used with greater frequency as an evaluation methodology in machine learning and pattern recognition. Researchers have used ANOVA to determine if the results from such analysis are statistically significant. Yet, in the medical decision making community, the prevailing method is LABMRMC. Although this latter method uses ANOVA, before doing so, it applies the Jackknife method to account for case-sample variance. To determine whether these two tests make the same decisions regarding statistical significance, we conducted a Monte Carlo simulation using several problems derived from Gaussian distributions, three machine-learning algorithms, ROC analysis, ANOVA, and LABMRMC. Results suggest that the decisions these tests make are not the same, even for simple problems. Furthermore, the larger issue is that since ANOVA does not account for case-sample variance, one cannot generalize experimental results to the population from which the data were drawn.
  • Keywords
    Gaussian distribution; Monte Carlo methods; decision making; learning (artificial intelligence); medical expert systems; sensitivity analysis; ANOVA; Gaussian distributions; Jackknife method; LABMRMC; Monte Carlo simulation; ROC analysis; case-sample variance; machine learning; machine-learning algorithms; medical decision making community; pattern recognition; receiver operating characteristic analysis; statistical significance; statistical tests of significance; Algorithm design and analysis; Analysis of variance; Computer science; Decision making; Humans; Learning systems; Machine learning; Monte Carlo methods; Performance analysis; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 2002. Proceedings. 16th International Conference on
  • ISSN
    1051-4651
  • Print_ISBN
    0-7695-1695-X
  • Type

    conf

  • DOI
    10.1109/ICPR.2002.1048273
  • Filename
    1048273