• DocumentCode
    1739901
  • Title

    A multi-mechanism rule-extraction pipeline for use on unannotated datasets

  • Author

    Goh, Alwyn ; Meng, Hoe Kok

  • Author_Institution
    Sch. of Comput. Sci., Univ. Sains Malaysia, Penang, Malaysia
  • Volume
    1
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    382
  • Abstract
    We outline a hybrid methodology (incorporating both supervised and unsupervised-learning components) for rule-based knowledge discovery from unannotated data i.e. when the classification information is unknown. The motivation for our work stems from the individual effectiveness of various data mining mechanisms i.e.: (1) class identification via unsupervised datavector cluster formation, (2) datavector simplification and feature selection via attribute discretisation, and (3) symbolic rule extraction via the association of symbolic rules with the structural parameters of a trained neural network (NN). The basic operational concept involves the pipelined application of various unsupervised and supervised mechanisms i.e.: (1) k-means, (2) Chi-2, (3) local cluster (LC) network training, and (4) rule extraction from a trained LC network. The methodology will be tested and analysed using several well-known datasets
  • Keywords
    data mining; knowledge representation; neural nets; pattern clustering; statistical analysis; unsupervised learning; Chi-2; attribute discretisation; class identification; data mining; datavector simplification; feature selection; k-means; knowledge representation; local cluster; multi-mechanism; network training; rule extraction; rule-based knowledge discovery; rule-extraction pipeline; supervised learning; symbolic rule extraction; trained LC network; trained neural network; unannotated data; unannotated datasets; unsupervised-learning; Artificial intelligence; Clustering algorithms; Data analysis; Data mining; Electronic mail; Neural networks; Pipelines; Structural engineering; Testing; Unsupervised learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    TENCON 2000. Proceedings
  • Conference_Location
    Kuala Lumpur
  • Print_ISBN
    0-7803-6355-8
  • Type

    conf

  • DOI
    10.1109/TENCON.2000.893694
  • Filename
    893694