DocumentCode
2461234
Title
Learning non-overlapping rules A method based on Functional Dependency Network and MDL Genetic Programming
Author
Shum, Wing-Ho ; Leung, Kwong-Sak ; Wong, Man-Leung
Author_Institution
Chinese Univ. of Hong Kong, Hong Kong
fYear
0
fDate
0-0 0
Firstpage
702
Lastpage
709
Abstract
Classification rule is a useful model in data mining. Given variable values, rules classify data items into different classes. Different rule learning algorithms are proposed, like genetic algorithm (GA) and genetic programming (GP). Rules can also be extracted from Bayesian network (BN) and decision trees. However, all of them have disadvantages and may fail to get the best results. Both of GA and GP cannot handle cooperation among rules and thus, the learnt rules are likely to have many overlappings, i.e. more than one rules classify the same data items and different rules have different predictions. The conflicts among the rules reduce their understandability and increase their usage difficulty for expert systems. In contrast, rules extracted from BN and decision trees have no overlapping in nature. But BN can handle discrete values only and cannot represent higher-order relationships among variables. Moreover, the search space for decision tree learning is huge and thus, it is difficult to reach the global optimum. In this paper, we propose to use functional dependency network (FDN) and MDL genetic programming (MDLGP) to learn a set of non-overlapping classification rules. The FDN is an extension of BN; it can handle all kind of values; it can represent higher-order relationships among variables; and its learning search space is smaller than decision trees´. The experimental results demonstrate that the proposed method can successfully discover the target rules, which have no overlapping and have the highest classification accuracies
Keywords
belief networks; data mining; decision trees; genetic algorithms; learning (artificial intelligence); pattern classification; Bayesian network; MDL genetic programming; classification rule; data mining; decision tree learning; expert systems; functional dependency network; genetic algorithm; higher-order relationships; learning search space; nonoverlapping rules; rule learning algorithms; Bayesian methods; Computer networks; Computer science; Data engineering; Data mining; Decision trees; Expert systems; Genetic algorithms; Genetic engineering; Genetic programming;
fLanguage
English
Publisher
ieee
Conference_Titel
Evolutionary Computation, 2006. CEC 2006. IEEE Congress on
Conference_Location
Vancouver, BC
Print_ISBN
0-7803-9487-9
Type
conf
DOI
10.1109/CEC.2006.1688380
Filename
1688380
Link To Document