• DocumentCode
    974740
  • Title

    Constrained cascade generalization of decision trees

  • Author

    Zhao, Huimin ; Ram, Sudha

  • Author_Institution
    Sch. of Bus. Adm., Wisconsin Univ., Milwaukee, WI, USA
  • Volume
    16
  • Issue
    6
  • fYear
    2004
  • fDate
    6/1/2004 12:00:00 AM
  • Firstpage
    727
  • Lastpage
    739
  • Abstract
    While decision tree techniques have been widely used in classification applications, a shortcoming of many decision tree inducers is that they do not learn intermediate concepts, i.e., at each node, only one of the original features is involved in the branching decision. Combining other classification methods, which learn intermediate concepts, with decision tree inducers can produce more flexible decision boundaries that separate different classes, potentially improving classification accuracy. We propose a generic algorithm for cascade generalization of decision tree inducers with the maximum cascading depth as a parameter to constrain the degree of cascading. Cascading methods proposed in the past, i.e., loose coupling and tight coupling, are strictly special cases of this new algorithm. We have empirically evaluated the proposed algorithm using logistic regression and C4.5 as base inducers on 32 UCI data sets and found that neither loose coupling nor tight coupling is always the best cascading strategy and that the maximum cascading depth in the proposed algorithm can be tuned for better classification accuracy. We have also empirically compared the proposed algorithm and ensemble methods such as bagging and boosting and found that the proposed algorithm performs marginally better than bagging and boosting on the average.
  • Keywords
    data mining; decision trees; generalisation (artificial intelligence); learning (artificial intelligence); pattern classification; regression analysis; classification methods; constrained cascade generalization; data mining; decision trees; flexible decision boundaries; logistic regression; machine learning; Bagging; Boosting; Classification tree analysis; Data mining; Decision trees; Logic testing; Logistics; Machine learning algorithms; Predictive models; Sequential analysis; 65; Machine learning; cascade generalization.; classification; data mining; decision tree;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2004.3
  • Filename
    1294893