DocumentCode :
3028547
Title :
CompactLEM2: A scalable rough set based knowledge acquisition method that generates small number of short rules
Author :
Liu, Yang ; Bai, Guohua ; Feng, BoQin
Author_Institution :
Dept. of Comput. Sci. & Technol., Xi´´an Jiaotong Univ., Xi´´an
fYear :
2008
fDate :
14-16 Aug. 2008
Firstpage :
215
Lastpage :
222
Abstract :
The complexity of knowledge plays an important role in the success of any types of knowledge acquisition algorithms performing on large-scale database. LERS (learning from examples based on rough sets) system is a rule based knowledge acquisition system that is characterized by excellent accuracy, but the complexity of generated rule set is not taken into account. This may cause interpretation problems for human and the classification knowledge may over fit training data. In this paper, CompactLEM2 is proposed as a scalable knowledge acquisition method that extracts rule set with easily understood rule forms, i.e., small size of rule set and short rule forms, without sacrificing classification accuracy. The main advantage of CompactLEM2 is its high efficiency. It can also produce compact rule set that fully or approximately describes classifications of given examples. We theoretically and experimentally show that CompactLEM2 exhibits log-linear asymptotic complexity with the number of training examples in most cases. We also present an example to illustrate characteristics of this algorithm. Finally, the capabilities of our method are demonstrated on eleven datasets. Experimental results are encouraging, and show that the length of extracted rule forms are short, and size of rule set is small, keeping the same level of classification accuracy of other rule acquisition methods in LERS system.
Keywords :
computational complexity; knowledge acquisition; learning by example; pattern classification; rough set theory; CompactLEM2; LERS system; large-scale database; pattern classification; rough set theory; rule based knowledge acquisition algorithm; short rule generation; Computer science; Data engineering; Data mining; Databases; Knowledge acquisition; Knowledge engineering; Large-scale systems; Rough sets; Set theory; Training data; Knowledge acquisition; LERS data mining system; classification; rough set; rule induction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cognitive Informatics, 2008. ICCI 2008. 7th IEEE International Conference on
Conference_Location :
Stanford, CA
Print_ISBN :
978-1-4244-2538-9
Type :
conf
DOI :
10.1109/COGINF.2008.4639171
Filename :
4639171
Link To Document :
بازگشت