DocumentCode :
3372642
Title :
Mining knowledge rules from databases: a rough set approach
Author :
Hu, Xiaohua ; Cercone, Nick
Author_Institution :
Bell-Northern Res., Ottawa, Ont., Canada
fYear :
1996
fDate :
26 Feb-1 Mar 1996
Firstpage :
96
Lastpage :
105
Abstract :
The principle and experimental results of an attribute oriented rough set approach for knowledge discovery in databases are described. Our method integrates the database operation, rough set theory and machine learning techniques. In this method the learning procedure consists of two phases: data generalization and data reduction. In the data generalization phase, attribute oriented induction is performed attribute by attribute using attribute removal and concept ascension, some undesirable attributes to the discovery task are removed and the primitive data is generalized to the desirable level; thus a set of tuples may be generalized to the same generalized tuple. This procedure substantially reduces the computational complexity of the database learning process. Subsequently, in data reduction phase, the rough set method is applied to the generalized relation to find a minimal attribute set relevant to the learning task. The generalized relation is reduced further by removing those attributes which are irrelevant and/or unimportant to the learning task. Finally the tuples in the reduced relation are transformed into different knowledge rules based on different knowledge discovery algorithms. Based upon these principles, a prototype knowledge discovery system, DBROUGH has been constructed. In DBROUGH, a variety of knowledge discovery algorithms are incorporated and different kinds of knowledge rules, such as characteristic rules, classification rules, decision rules, maximal generalized rules can be discovered efficiently and effectively from large databases
Keywords :
computational complexity; database theory; deductive databases; knowledge acquisition; learning (artificial intelligence); set theory; DBROUGH; attribute oriented induction; attribute oriented rough set approach; attribute removal; characteristic rules; classification rules; computational complexity; concept ascension; data generalization; data reduction; database learning process; database operation; decision rules; generalized tuple; knowledge discovery algorithms; knowledge rule mining; knowledge rules; machine learning techniques; maximal generalized rules; minimal attribute set; prototype knowledge discovery system; Data mining; Databases;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 1996. Proceedings of the Twelfth International Conference on
Conference_Location :
New Orleans, LA
ISSN :
1063-6382
Print_ISBN :
0-8186-7240-4
Type :
conf
DOI :
10.1109/ICDE.1996.492093
Filename :
492093
Link To Document :
بازگشت