• DocumentCode
    2851230
  • Title

    Improving the reliability of decision tree and naive Bayes learners

  • Author

    Lindsay, David ; Cox, Siân

  • Author_Institution
    Comput. Learning Res. Centre, London Univ., Egham, UK
  • fYear
    2004
  • fDate
    1-4 Nov. 2004
  • Firstpage
    459
  • Lastpage
    462
  • Abstract
    The C4.5 decision tree and naive Bayes learners are known to produce unreliable probability forecasts. We have used simple binning (Zadrozny and Elkan, 2001) and Laplace transform (Cestnik, 2001) techniques to improve the reliability of these learners and compare their effectiveness with that of the newly developed Venn probability machine (VPM) meta-learner (Vovk et al., 2003). We assess improvements in reliability using loss functions, receiver operator characteristic (ROC) curves and empirical reliability curves (ERC). The VPM outperforms the simple techniques to improve reliability, although at the cost of increased computational intensity and slight increase in error rate. These trade-offs are discussed.
  • Keywords
    Bayes methods; Laplace transforms; decision trees; forecasting theory; learning (artificial intelligence); probability; reliability theory; Laplace transform; Venn probability machine meta-learner; decision tree; empirical reliability curves; naive Bayes learners; receiver operator characteristic curves; simple binning; unreliable probability forecast; Biology computing; Computational efficiency; Decision trees; Error analysis; Frequency; Laplace equations; Machine learning; Pattern recognition; Supervised learning; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2004. ICDM '04. Fourth IEEE International Conference on
  • Print_ISBN
    0-7695-2142-8
  • Type

    conf

  • DOI
    10.1109/ICDM.2004.10037
  • Filename
    1410335