• DocumentCode
    2914236
  • Title

    A clustering-based decision tree induction algorithm

  • Author

    Barros, Rodrigo C. ; De Carvalho, André C P L F ; Basgalupp, Márcio P. ; Quiles, Marcos G.

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Sao Paulo (USP), São Carlos, Brazil
  • fYear
    2011
  • fDate
    22-24 Nov. 2011
  • Firstpage
    543
  • Lastpage
    550
  • Abstract
    Decision tree induction algorithms are well known techniques for assigning objects to predefined categories in a transparent fashion. Most decision tree induction algorithms rely on a greedy top-down recursive strategy for growing the tree, and pruning techniques to avoid overfitting. Even though such a strategy has been quite successful in many problems, it falls short in several others. For instance, there are cases in which the hyper-rectangular surfaces generated by these algorithms can only map the problem description after several sub-sequential partitions, which results in a large and incomprehensible tree. Hence, we propose a new decision tree induction algorithm based on clustering which seeks to provide more accurate models and/or shorter descriptions more comprehensible for the end-user. We do not base our performance analysis solely on the straightforward comparison of our proposed algorithm to baseline methods. Instead, we propose a data-dependent analysis in order to look for evidences which may explain in which situations our algorithm outperforms a well-known decision tree induction algorithm.
  • Keywords
    data analysis; decision trees; learning (artificial intelligence); pattern clustering; clustering-based decision tree induction algorithm; data-dependent analysis; greedy top-down recursive strategy; hyper-rectangular surfaces; pruning techniques; transparent fashion; Accuracy; Algorithm design and analysis; Clustering algorithms; Complexity theory; Decision trees; Machine learning algorithms; Training; clustering; data-dependency analysis; decision trees; hybrid intelligent systems; machine learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems Design and Applications (ISDA), 2011 11th International Conference on
  • Conference_Location
    Cordoba
  • ISSN
    2164-7143
  • Print_ISBN
    978-1-4577-1676-8
  • Type

    conf

  • DOI
    10.1109/ISDA.2011.6121712
  • Filename
    6121712