Title :
Functional trees for classification
Author_Institution :
LIACC, Porto Univ., Portugal
Abstract :
The design of algorithms that explore multiple representation languages and explore different search spaces has an intuitive appeal. In the context of classification problems, algorithms that generate multivariate trees are able to explore multiple representation languages by using decision tests based on a combination of attributes. The same applies to model-tree algorithms in regression domains, but using linear models at leaf nodes. In this paper, we study where to use combinations of attributes in decision tree learning. We present an algorithm for multivariate tree learning that combines a univariate decision tree with a discriminant function by means of constructive induction. This algorithm is able to use decision nodes with multivariate tests, and leaf nodes that predict a class using a discriminant function. Multivariate decision nodes are built when growing the tree, while functional leaves are built when pruning the tree. Functional trees can be seen as a generalization of multivariate trees. Our algorithm was compared against to its components and two simplified versions using 30 benchmark data sets. The experimental evaluation shows that our algorithm has clear advantages with respect to the generalization ability and model sizes at statistically significant confidence levels
Keywords :
data mining; decision trees; generalisation (artificial intelligence); learning by example; pattern classification; tree searching; algorithm design; attribute combinations; benchmark data sets; classification; constructive induction; decision nodes; decision tests; decision tree learning; discriminant function; functional leaves; functional trees; generalization ability; leaf nodes; linear models; model sizes; model-tree algorithms; multiple representation languages; multivariate tests; multivariate tree generation; regression domains; search spaces; statistically significant confidence levels; tree pruning; univariate decision tree; Algorithm design and analysis; Classification tree analysis; Decision trees; Regression tree analysis; Space exploration; Supervised learning; Testing;
Conference_Titel :
Data Mining, 2001. ICDM 2001, Proceedings IEEE International Conference on
Conference_Location :
San Jose, CA
Print_ISBN :
0-7695-1119-8
DOI :
10.1109/ICDM.2001.989512