• DocumentCode
    2864997
  • Title

    Ranking-based evaluation of regression models

  • Author

    Rosset, Saharon ; Perlich, Claudia ; Zadrozny, Bianca

  • Author_Institution
    IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
  • fYear
    2005
  • fDate
    27-30 Nov. 2005
  • Abstract
    We suggest the use of ranking-based evaluation measures for regression models, as a complement to the commonly used residual-based evaluation. We argue that in some cases, such as the case study we present, ranking can be the main underlying goal in building a regression model, and ranking performance is the correct evaluation metric. However, even when ranking is not the contextually correct performance metric, the measures we explore still have significant advantages: They are robust against extreme outliers in the evaluation set; and they are interpretable. The two measures we consider correspond closely to non-parametric correlation coefficients commonly used in data analysis (Spearman\´s p and Kendall\´s r); and they both have interesting graphical representations, which, similarly to ROC curves, offer useful "partial" model performance views, in addition to a one-number summary in the area under the curve. We illustrate our methods on a case study of evaluating IT wallet size estimation models for IBM\´s customers.
  • Keywords
    correlation methods; data analysis; regression analysis; data analysis; evaluation metric; graphical representation; model performance views; nonparametric correlation coefficients; ranking performance; ranking-based evaluation; regression model; residual-based evaluation; Area measurement; Costs; Data analysis; Data mining; Marketing and sales; Pediatrics; Predictive models; Robustness; Statistics; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, Fifth IEEE International Conference on
  • ISSN
    1550-4786
  • Print_ISBN
    0-7695-2278-5
  • Type

    conf

  • DOI
    10.1109/ICDM.2005.126
  • Filename
    1565701