• DocumentCode
    3112234
  • Title

    A Minimal Coverage-based Classification method and its application in predictive toxicology data mining

  • Author

    Guo, Gongde ; Huang, Yu

  • Author_Institution
    Sch. of Math. & Comput. Sci., Fujian Normal Univ., Fuzhou
  • fYear
    2008
  • fDate
    12-15 Oct. 2008
  • Firstpage
    1243
  • Lastpage
    1246
  • Abstract
    A robust method, MCC (minimal coverage-based classification), for toxicity prediction of chemical compounds is proposed. The MCC method mainly considers the local distribution of each class around a new tuple to be classified and uses minimal coverage principle - covering minimal number of tuples with different classes - to classify this new tuple. The merits of MCC over other machine learning algorithms are threefold: (1) uniform approach for both numerical and categorical data; (2) deals with missing values; (3) given a new data tuple, it provides values for all classes which measure the likelihood of the tuple being in each class. The experimental results of MCC conducted on seven toxicity data sets from real-world applications are compared with the results of IBL, DT, Ripper, MLP and SVM in terms of classification performance. This application shows that MCC is a promising method for the toxicity prediction of chemical compounds.
  • Keywords
    chemical hazards; chemistry computing; data mining; learning (artificial intelligence); pattern classification; toxicology; chemical compound; data mining; data tuple; machine learning; minimal coverage-based classification method; toxicity prediction; Application software; Chemical compounds; Computer science; Data mining; Machine learning algorithms; Mathematics; Merging; Multidimensional systems; Noise cancellation; Toxicology; classification; hyper tuples; minimal coverage model; performance;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man and Cybernetics, 2008. SMC 2008. IEEE International Conference on
  • Conference_Location
    Singapore
  • ISSN
    1062-922X
  • Print_ISBN
    978-1-4244-2383-5
  • Electronic_ISBN
    1062-922X
  • Type

    conf

  • DOI
    10.1109/ICSMC.2008.4811453
  • Filename
    4811453