DocumentCode
2006043
Title
Calibrating Random Forests
Author
Bostrom, Henrik
Author_Institution
Inf. Res. Centre, Univ. of Skovde, Skovde
fYear
2008
fDate
11-13 Dec. 2008
Firstpage
121
Lastpage
126
Abstract
When using the output of classifiers to calculate the expected utility of different alternatives in decision situations, the correctness of predicted class probabilities may be of crucial importance. However, even very accurate classifiers may output class probabilities of rather poor quality. One way of overcoming this problem is by means of calibration, i.e., mapping the original class probabilities to more accurate ones. Previous studies have however indicated that random forests are difficult to calibrate by standard calibration methods. In this work, a novel calibration method is introduced, which is based on a recent finding that probabilities predicted by forests of classification trees have a lower squared error compared to those predicted by forests of probability estimation trees (PETs). The novel calibration method is compared to the two standard methods, Platt scaling and isotonic regression, on 34 datasets from the UCI repository. The experiment shows that random forests of PETs calibrated by the novel method significantly outperform uncalibrated random forests of both PETs and classification trees, as well as random forests calibrated with the two standard methods, with respect to the squared error of predicted class probabilities.
Keywords
calibration; learning (artificial intelligence); probability; trees (mathematics); class probability prediction; probability estimation trees; random forests; square error methods; standard calibration methods; Area measurement; Calibration; Classification tree analysis; Decision trees; Informatics; Machine learning; Positron emission tomography; Probability distribution; Regression tree analysis; Utility theory; Brier score; calibration; probability-estimation trees; random forests;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Applications, 2008. ICMLA '08. Seventh International Conference on
Conference_Location
San Diego, CA
Print_ISBN
978-0-7695-3495-4
Type
conf
DOI
10.1109/ICMLA.2008.107
Filename
4724964
Link To Document