• DocumentCode
    408369
  • Title

    Building decision trees using functional dependencies

  • Author

    Lam, Kwok-wa ; Lee, Victor C S

  • Author_Institution
    Dept. of Comput. Sci., City Univ. of Hong Kong, Kowloon, China
  • Volume
    2
  • fYear
    2004
  • fDate
    5-7 April 2004
  • Firstpage
    470
  • Abstract
    Decision tree (DT) induction is regarded as a representative of traditional approaches to classification rule mining which is an important technique for many data mining applications. Using a heuristic-based local search, DT induction appends attribute at a time to rules in the order of goodness. This method may eliminate some typical structures that several attributes collectively determine the class. Recently, there has been growing interest in the problem of discovering functional dependencies (FDs) from existing databases [[Flach et al.], [Y. Huhtala et al., (1999)], [Lopes et al.], [Novelli et al.]]. Some efficient and scalable algorithms have been proposed. In this paper, we present a new method to build a DT classifier using approximate FDs [Y. Huhtala et al., (1999)]. The new method is different from the traditional ways of building DTs in that it searches composite attributes for individual node of a DT which leads to substantially smaller and more understandable DTs without adversely affecting the accuracy gains. Experiments showed that the new method not only builds more accurate classifiers, but also does this with more compact structures.
  • Keywords
    data mining; decision trees; pattern classification; relational databases; search problems; classification rule mining; data mining; decision tree; functional dependency; heuristic-based local search; relational database; Application software; Buildings; Classification tree analysis; Computer science; Data mining; Data warehouses; Decision trees; Information technology; Relational databases; Tree data structures;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Technology: Coding and Computing, 2004. Proceedings. ITCC 2004. International Conference on
  • Print_ISBN
    0-7695-2108-8
  • Type

    conf

  • DOI
    10.1109/ITCC.2004.1286698
  • Filename
    1286698