DocumentCode
1739901
Title
A multi-mechanism rule-extraction pipeline for use on unannotated datasets
Author
Goh, Alwyn ; Meng, Hoe Kok
Author_Institution
Sch. of Comput. Sci., Univ. Sains Malaysia, Penang, Malaysia
Volume
1
fYear
2000
fDate
2000
Firstpage
382
Abstract
We outline a hybrid methodology (incorporating both supervised and unsupervised-learning components) for rule-based knowledge discovery from unannotated data i.e. when the classification information is unknown. The motivation for our work stems from the individual effectiveness of various data mining mechanisms i.e.: (1) class identification via unsupervised datavector cluster formation, (2) datavector simplification and feature selection via attribute discretisation, and (3) symbolic rule extraction via the association of symbolic rules with the structural parameters of a trained neural network (NN). The basic operational concept involves the pipelined application of various unsupervised and supervised mechanisms i.e.: (1) k-means, (2) Chi-2, (3) local cluster (LC) network training, and (4) rule extraction from a trained LC network. The methodology will be tested and analysed using several well-known datasets
Keywords
data mining; knowledge representation; neural nets; pattern clustering; statistical analysis; unsupervised learning; Chi-2; attribute discretisation; class identification; data mining; datavector simplification; feature selection; k-means; knowledge representation; local cluster; multi-mechanism; network training; rule extraction; rule-based knowledge discovery; rule-extraction pipeline; supervised learning; symbolic rule extraction; trained LC network; trained neural network; unannotated data; unannotated datasets; unsupervised-learning; Artificial intelligence; Clustering algorithms; Data analysis; Data mining; Electronic mail; Neural networks; Pipelines; Structural engineering; Testing; Unsupervised learning;
fLanguage
English
Publisher
ieee
Conference_Titel
TENCON 2000. Proceedings
Conference_Location
Kuala Lumpur
Print_ISBN
0-7803-6355-8
Type
conf
DOI
10.1109/TENCON.2000.893694
Filename
893694
Link To Document