Title of article :
Prediction of the acute toxicity of chemical compounds to the fathead minnow by machine learning approaches
Author/Authors :
Tan، نويسنده , , Ningxin and Li، نويسنده , , Ping-Fan Rao، نويسنده , , Han-Bing and Li، نويسنده , , Ze-Rong and Li، نويسنده , , Xiang-Yuan، نويسنده ,
Issue Information :
دوفصلنامه با شماره پیاپی سال 2010
Pages :
8
From page :
66
To page :
73
Abstract :
Support vector machines (SVM) and artificial neural networks (ANN) are applied for prediction of the acute toxicity of compounds to fathead minnow from molecular structure. A diverse set of 611 compounds, including 442 fathead minnow toxicity (FMT) agents and 169 non-FMT agents, are adopted to develop the classification models. A hybrid feature selection method, which combines Fischerʹs score and Monte Carlo simulated annealing embedded in the SVM approach, is used to select the relevant descriptors from 1559 molecular descriptors. Five-fold cross-validation method is used to optimize the model parameters and select the relevant descriptors. Using the 60 selected descriptors, SVM model gives an averaged prediction accuracy of 95.5% for FMT, 79.3% for non-FMT and 91.0% for all samples, while the corresponding values of the ANN model are 92.5%, 75.2% and 87.7%, respectively. The study indicates that the hybrid feature selection method is very efficient and the selected descriptors from the SVM approach have also a good performance for the ANN approach. A hold-out method is used to build the final classification models by using the selected descriptors and optimized model parameters from the 5-fold cross-validation. The SVM model gives an excellent prediction accuracy of 96.6% for FMT, 93.0% for non-FMT and 95.1% for all samples, while the corresponding values of the ANN model are 91.4%, 90.7% and 91.1%, respectively.
Keywords :
Support vector machine , Fathead minnow , feature selection , Artificial neural networks , Monte Carlo simulated annealing
Journal title :
Chemometrics and Intelligent Laboratory Systems
Serial Year :
2010
Journal title :
Chemometrics and Intelligent Laboratory Systems
Record number :
1489654
Link To Document :
بازگشت