• DocumentCode
    1871691
  • Title

    Learning acyclic decision trees with Functional Dependency Network and MDL Genetic Programming

  • Author

    Shum, Wing-Ho ; Leung, Kwong-Sak ; Wong, Man-Leung

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Chinese Univ. of Hong Kong
  • fYear
    2006
  • fDate
    1-3 Aug. 2006
  • Firstpage
    25
  • Lastpage
    25
  • Abstract
    One objective of data mining is to discover parent-child relationships among a set of variables in the domain. Moreover, showing parents´ importance can further help to improve decision makings´ quality. Bayesian network (BN) is a useful model for multi-class problems and can illustrate parent-child relationships with no cycle. But it cannot show parents´ importance. In contrast, decision trees state parents´ importance clearly, for instance, the most important parent is put in the first level. However, decision trees are proposed for single-class problems only, when they are applied to multi-class ones, they are likely to produce cycles representing tautologic. In this paper, we propose to use MDL genetic programming (MDLGP) and functional dependency network (FDN) to learn a set of acyclic decision trees (Shum et al., 2005). The FDN is an extension of BN; it can handle all of discrete, continuous, interval and ordinal values; it guarantees to produce decision trees with no cycle; its learning search space is smaller than decision trees´; and it can represent higher-order relationships among variables. The MDLGP is a robust genetic programming (GP) proposed to learn the FDN. We also propose a method to derive acyclic decision trees from the FDN. The experimental results demonstrate that the proposed method can successfully discover the target decision trees, which have no cycle and have the accurate classification results
  • Keywords
    belief networks; data mining; decision trees; genetic algorithms; learning (artificial intelligence); MDL genetic programming; acyclic decision tree learning; data mining; functional dependency network; parent-child relationships; Asia; Bayesian methods; Classification tree analysis; Computer science; Data engineering; Decision making; Decision trees; Genetic programming; History; Robustness;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computing in the Global Information Technology, 2006. ICCGI '06. International Multi-Conference on
  • Conference_Location
    Bucharest
  • Print_ISBN
    0-7695-2690-X
  • Electronic_ISBN
    0-7695-2690-X
  • Type

    conf

  • DOI
    10.1109/ICCGI.2006.46
  • Filename
    4124044