Title :
Constrained cascade generalization of decision trees
Author :
Zhao, Huimin ; Ram, Sudha
Author_Institution :
Sch. of Bus. Adm., Wisconsin Univ., Milwaukee, WI, USA
fDate :
6/1/2004 12:00:00 AM
Abstract :
While decision tree techniques have been widely used in classification applications, a shortcoming of many decision tree inducers is that they do not learn intermediate concepts, i.e., at each node, only one of the original features is involved in the branching decision. Combining other classification methods, which learn intermediate concepts, with decision tree inducers can produce more flexible decision boundaries that separate different classes, potentially improving classification accuracy. We propose a generic algorithm for cascade generalization of decision tree inducers with the maximum cascading depth as a parameter to constrain the degree of cascading. Cascading methods proposed in the past, i.e., loose coupling and tight coupling, are strictly special cases of this new algorithm. We have empirically evaluated the proposed algorithm using logistic regression and C4.5 as base inducers on 32 UCI data sets and found that neither loose coupling nor tight coupling is always the best cascading strategy and that the maximum cascading depth in the proposed algorithm can be tuned for better classification accuracy. We have also empirically compared the proposed algorithm and ensemble methods such as bagging and boosting and found that the proposed algorithm performs marginally better than bagging and boosting on the average.
Keywords :
data mining; decision trees; generalisation (artificial intelligence); learning (artificial intelligence); pattern classification; regression analysis; classification methods; constrained cascade generalization; data mining; decision trees; flexible decision boundaries; logistic regression; machine learning; Bagging; Boosting; Classification tree analysis; Data mining; Decision trees; Logic testing; Logistics; Machine learning algorithms; Predictive models; Sequential analysis; 65; Machine learning; cascade generalization.; classification; data mining; decision tree;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
DOI :
10.1109/TKDE.2004.3