DocumentCode :
1739901
Title :
A multi-mechanism rule-extraction pipeline for use on unannotated datasets
Author :
Goh, Alwyn ; Meng, Hoe Kok
Author_Institution :
Sch. of Comput. Sci., Univ. Sains Malaysia, Penang, Malaysia
Volume :
1
fYear :
2000
fDate :
2000
Firstpage :
382
Abstract :
We outline a hybrid methodology (incorporating both supervised and unsupervised-learning components) for rule-based knowledge discovery from unannotated data i.e. when the classification information is unknown. The motivation for our work stems from the individual effectiveness of various data mining mechanisms i.e.: (1) class identification via unsupervised datavector cluster formation, (2) datavector simplification and feature selection via attribute discretisation, and (3) symbolic rule extraction via the association of symbolic rules with the structural parameters of a trained neural network (NN). The basic operational concept involves the pipelined application of various unsupervised and supervised mechanisms i.e.: (1) k-means, (2) Chi-2, (3) local cluster (LC) network training, and (4) rule extraction from a trained LC network. The methodology will be tested and analysed using several well-known datasets
Keywords :
data mining; knowledge representation; neural nets; pattern clustering; statistical analysis; unsupervised learning; Chi-2; attribute discretisation; class identification; data mining; datavector simplification; feature selection; k-means; knowledge representation; local cluster; multi-mechanism; network training; rule extraction; rule-based knowledge discovery; rule-extraction pipeline; supervised learning; symbolic rule extraction; trained LC network; trained neural network; unannotated data; unannotated datasets; unsupervised-learning; Artificial intelligence; Clustering algorithms; Data analysis; Data mining; Electronic mail; Neural networks; Pipelines; Structural engineering; Testing; Unsupervised learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
TENCON 2000. Proceedings
Conference_Location :
Kuala Lumpur
Print_ISBN :
0-7803-6355-8
Type :
conf
DOI :
10.1109/TENCON.2000.893694
Filename :
893694
Link To Document :
بازگشت