• DocumentCode
    2514075
  • Title

    The Binormal Assumption on Precision-Recall Curves

  • Author

    Brodersen, Kay H. ; Ong, Cheng Soon ; Stephan, Klaas E. ; Buhmann, Joachim M.

  • Author_Institution
    Dept. of Comput. Sci., ETH Zurich, Zurich, Switzerland
  • fYear
    2010
  • fDate
    23-26 Aug. 2010
  • Firstpage
    4263
  • Lastpage
    4266
  • Abstract
    The precision-recall curve (PRC) has become a widespread conceptual basis for assessing classification performance. The curve relates the positive predictive value of a classifier to its true positive rate and often provides a useful alternative to the well-known receiver operating characteristic (ROC). The empirical PRC, however, turns out to be a highly imprecise estimate of the true curve, especially in the case of a small sample size and class imbalance in favour of negative examples. Ironically, this situation tends to occur precisely in those applications where the curve would be most useful, e.g., in anomaly detection or information retrieval. Here, we propose to estimate the PRC on the basis of a simple distributional assumption about the decision values that generalizes the established binormal model for estimating smooth ROC curves. Using simulations, we show that our approach outperforms empirical estimates, and that an account of the class imbalance is crucial for obtaining unbiased PRC estimates.
  • Keywords
    pattern classification; anomaly detection; binormal assumption; binormal model; classification performance; decision values; information retrieval; precision-recall curves; receiver operating characteristic; Accuracy; Computational modeling; Data models; Estimation; Mathematical model; Predictive models; Solid modeling; classification performance; false discovery rate; generalizability; information retrieval; receiver operating characteristic;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ICPR), 2010 20th International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1051-4651
  • Print_ISBN
    978-1-4244-7542-1
  • Type

    conf

  • DOI
    10.1109/ICPR.2010.1036
  • Filename
    5597760