• DocumentCode
    3164353
  • Title

    Conformal Prediction Using Decision Trees

  • Author

    Johansson, Ulf ; Bostrom, Henrik ; Lofstrom, Tuve

  • Author_Institution
    Sch. of Bus. & IT, Univ. of Boras, Boras, Sweden
  • fYear
    2013
  • fDate
    7-10 Dec. 2013
  • Firstpage
    330
  • Lastpage
    339
  • Abstract
    Conformal prediction is a relatively new framework in which the predictive models output sets of predictions with a bound on the error rate, i.e., in a classification context, the probability of excluding the correct class label is lower than a predefined significance level. An investigation of the use of decision trees within the conformal prediction framework is presented, with the overall purpose to determine the effect of different algorithmic choices, including split criterion, pruning scheme and way to calculate the probability estimates. Since the error rate is bounded by the framework, the most important property of conformal predictors is efficiency, which concerns minimizing the number of elements in the output prediction sets. Results from one of the largest empirical investigations to date within the conformal prediction framework are presented, showing that in order to optimize efficiency, the decision trees should be induced using no pruning and with smoothed probability estimates. The choice of split criterion to use for the actual induction of the trees did not turn out to have any major impact on the efficiency. Finally, the experimentation also showed that when using decision trees, standard inductive conformal prediction was as efficient as the recently suggested method cross-conformal prediction. This is an encouraging results since cross-conformal prediction uses several decision trees, thus sacrificing the interpretability of a single decision tree.
  • Keywords
    conformal mapping; decision trees; pattern classification; prediction theory; probability; class label; conformal prediction framework; cross-conformal prediction; decision trees; output prediction sets; predictive models; probability estimates; pruning scheme; split criterion; Accuracy; Calibration; Decision trees; Prediction algorithms; Predictive models; Probability; Standards; Conformal prediction; Decision trees;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2013 IEEE 13th International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1550-4786
  • Type

    conf

  • DOI
    10.1109/ICDM.2013.85
  • Filename
    6729517