• DocumentCode
    2370022
  • Title

    Cost-sensitive learning by cost-proportionate example weighting

  • Author

    Zadrozny, Bianca ; Langford, John ; Abe, Naoki

  • Author_Institution
    Dept. of Math. Sci., IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
  • fYear
    2003
  • fDate
    19-22 Nov. 2003
  • Firstpage
    435
  • Lastpage
    442
  • Abstract
    We propose and evaluate a family of methods for converting classifier learning algorithms and classification theory into cost-sensitive algorithms and theory. The proposed conversion is based on cost-proportionate weighting of the training examples, which can be realized either by feeding the weights to the classification algorithm (as often done in boosting), or by careful subsampling. We give some theoretical performance guarantees on the proposed methods, as well as empirical evidence that they are practical alternatives to existing approaches. In particular, we propose costing, a method based on cost-proportionate rejection sampling and ensemble aggregation, which achieves excellent predictive performance on two publicly available datasets, while drastically reducing the computation required by other methods.
  • Keywords
    cost-benefit analysis; learning (artificial intelligence); pattern classification; sampling methods; support vector machines; classification algorithm; cost-proportionate example weighting; cost-proportionate rejection sampling; cost-sensitive learning algorithms; support vector machines; Boosting; Classification algorithms; Computer crime; Costing; Costs; Data mining; High performance computing; Machine learning; Medical diagnosis; Sampling methods;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2003. ICDM 2003. Third IEEE International Conference on
  • Print_ISBN
    0-7695-1978-4
  • Type

    conf

  • DOI
    10.1109/ICDM.2003.1250950
  • Filename
    1250950