• DocumentCode
    671567
  • Title

    Incremental decision tree based on order statistics

  • Author

    Salperwyck, Christophe ; Lemaire, Vincent

  • Author_Institution
    Profiling & Datamining, Orange Labs., Lannion, France
  • fYear
    2013
  • fDate
    4-9 Aug. 2013
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    New application domains generate data which are not persistent anymore but volatile: network management, web profile modeling... These data arrive quickly, massively and are visible just once. Thus they necessarily have to be learnt according to their arrival orders. For classification problems online decision trees are known to perform well and are widely used on streaming data. In this paper, we propose a new decision tree method based on order statistics. The construction of an online tree usually needs summaries in the leaves. Our solution uses bounded error quantiles summaries. A robust and performing discretization or grouping method uses these summaries to provide, at the same time, a criterion to find the best split and better density estimations. This estimation is then used to build a näıve Bayes classifier in the leaves to improve the prediction in the early learning stage.
  • Keywords
    Bayes methods; decision trees; learning (artificial intelligence); pattern classification; statistics; Web profile modeling; incremental decision tree; naive Bayes classifier; network management; online decision trees; order statistics; streaming data; Decision trees; Entropy; Estimation; Memory management; Numerical models; Robustness; Transforms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks (IJCNN), The 2013 International Joint Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    2161-4393
  • Print_ISBN
    978-1-4673-6128-6
  • Type

    conf

  • DOI
    10.1109/IJCNN.2013.6706907
  • Filename
    6706907