Title :
A Minimal Coverage-based Classification method and its application in predictive toxicology data mining
Author :
Guo, Gongde ; Huang, Yu
Author_Institution :
Sch. of Math. & Comput. Sci., Fujian Normal Univ., Fuzhou
Abstract :
A robust method, MCC (minimal coverage-based classification), for toxicity prediction of chemical compounds is proposed. The MCC method mainly considers the local distribution of each class around a new tuple to be classified and uses minimal coverage principle - covering minimal number of tuples with different classes - to classify this new tuple. The merits of MCC over other machine learning algorithms are threefold: (1) uniform approach for both numerical and categorical data; (2) deals with missing values; (3) given a new data tuple, it provides values for all classes which measure the likelihood of the tuple being in each class. The experimental results of MCC conducted on seven toxicity data sets from real-world applications are compared with the results of IBL, DT, Ripper, MLP and SVM in terms of classification performance. This application shows that MCC is a promising method for the toxicity prediction of chemical compounds.
Keywords :
chemical hazards; chemistry computing; data mining; learning (artificial intelligence); pattern classification; toxicology; chemical compound; data mining; data tuple; machine learning; minimal coverage-based classification method; toxicity prediction; Application software; Chemical compounds; Computer science; Data mining; Machine learning algorithms; Mathematics; Merging; Multidimensional systems; Noise cancellation; Toxicology; classification; hyper tuples; minimal coverage model; performance;
Conference_Titel :
Systems, Man and Cybernetics, 2008. SMC 2008. IEEE International Conference on
Conference_Location :
Singapore
Print_ISBN :
978-1-4244-2383-5
Electronic_ISBN :
1062-922X
DOI :
10.1109/ICSMC.2008.4811453