DocumentCode
408369
Title
Building decision trees using functional dependencies
Author
Lam, Kwok-wa ; Lee, Victor C S
Author_Institution
Dept. of Comput. Sci., City Univ. of Hong Kong, Kowloon, China
Volume
2
fYear
2004
fDate
5-7 April 2004
Firstpage
470
Abstract
Decision tree (DT) induction is regarded as a representative of traditional approaches to classification rule mining which is an important technique for many data mining applications. Using a heuristic-based local search, DT induction appends attribute at a time to rules in the order of goodness. This method may eliminate some typical structures that several attributes collectively determine the class. Recently, there has been growing interest in the problem of discovering functional dependencies (FDs) from existing databases [[Flach et al.], [Y. Huhtala et al., (1999)], [Lopes et al.], [Novelli et al.]]. Some efficient and scalable algorithms have been proposed. In this paper, we present a new method to build a DT classifier using approximate FDs [Y. Huhtala et al., (1999)]. The new method is different from the traditional ways of building DTs in that it searches composite attributes for individual node of a DT which leads to substantially smaller and more understandable DTs without adversely affecting the accuracy gains. Experiments showed that the new method not only builds more accurate classifiers, but also does this with more compact structures.
Keywords
data mining; decision trees; pattern classification; relational databases; search problems; classification rule mining; data mining; decision tree; functional dependency; heuristic-based local search; relational database; Application software; Buildings; Classification tree analysis; Computer science; Data mining; Data warehouses; Decision trees; Information technology; Relational databases; Tree data structures;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Technology: Coding and Computing, 2004. Proceedings. ITCC 2004. International Conference on
Print_ISBN
0-7695-2108-8
Type
conf
DOI
10.1109/ITCC.2004.1286698
Filename
1286698
Link To Document