DocumentCode
2370022
Title
Cost-sensitive learning by cost-proportionate example weighting
Author
Zadrozny, Bianca ; Langford, John ; Abe, Naoki
Author_Institution
Dept. of Math. Sci., IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
fYear
2003
fDate
19-22 Nov. 2003
Firstpage
435
Lastpage
442
Abstract
We propose and evaluate a family of methods for converting classifier learning algorithms and classification theory into cost-sensitive algorithms and theory. The proposed conversion is based on cost-proportionate weighting of the training examples, which can be realized either by feeding the weights to the classification algorithm (as often done in boosting), or by careful subsampling. We give some theoretical performance guarantees on the proposed methods, as well as empirical evidence that they are practical alternatives to existing approaches. In particular, we propose costing, a method based on cost-proportionate rejection sampling and ensemble aggregation, which achieves excellent predictive performance on two publicly available datasets, while drastically reducing the computation required by other methods.
Keywords
cost-benefit analysis; learning (artificial intelligence); pattern classification; sampling methods; support vector machines; classification algorithm; cost-proportionate example weighting; cost-proportionate rejection sampling; cost-sensitive learning algorithms; support vector machines; Boosting; Classification algorithms; Computer crime; Costing; Costs; Data mining; High performance computing; Machine learning; Medical diagnosis; Sampling methods;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2003. ICDM 2003. Third IEEE International Conference on
Print_ISBN
0-7695-1978-4
Type
conf
DOI
10.1109/ICDM.2003.1250950
Filename
1250950
Link To Document