DocumentCode :
3394548
Title :
Application of machine learning approaches on quantitative structure activity relationships
Author :
Butkiewicz, Mariusz ; Mueller, Ralf ; Selic, Danilo ; Dawson, Eric ; Meiler, Jens
Author_Institution :
Center for Struct. Biol., Vanderbilt Univ., Nashville, TN
fYear :
2009
fDate :
March 30 2009-April 2 2009
Firstpage :
255
Lastpage :
262
Abstract :
Machine Learning techniques are successfully applied to establish quantitative relations between chemical structure and biological activity (QSAR), i.e. classify compounds as active or inactive with respect to a specific target biological system. This paper presents a comparison of artificial neural networks (ANN), support vector machines (SVM), and decision trees (DT) in an effort to identify potentiators of metabotropic glutamate receptor 5 (mGluR5), compounds that have potential as novel treatments against schizophrenia. When training and testing each of the three techniques on the same dataset enrichments of 61, 64, and 43 were obtained and an area under the curve (AUC) of 0.77, 0.78, and 0.63 was determined for ANNs, SVMs, and DTs, respectively. For the top percentile of predicted active compounds, the true positives for all three methods were highly similar, while the inactives were diverse offering the potential use of jury approaches to improve prediction accuracy.
Keywords :
decision trees; drugs; learning (artificial intelligence); medical computing; neural nets; support vector machines; ANN; SVM; area under the curve; artificial neural networks; biological activity; chemical structure; decision trees; machine learning; metabotropic glutamate receptor 5; quantitative structure activity relationships; schizophrenia; support vector machines; Amino acids; Artificial neural networks; Data mining; Decision trees; High temperature superconductors; Human immunodeficiency virus; Machine learning; Support vector machine classification; Support vector machines; Testing; Artificial Neural Network (ANN); Decision Trees (DT); Machine Learning; Support Vector Machine (SVM); area under the curve (AUC); high-throughput screening (HTS); quantitative structure activity relationship (QSAR); receiver operator characteristics (ROC);
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence in Bioinformatics and Computational Biology, 2009. CIBCB '09. IEEE Symposium on
Conference_Location :
Nashville, TN
Print_ISBN :
978-1-4244-2756-7
Type :
conf
DOI :
10.1109/CIBCB.2009.4925736
Filename :
4925736
Link To Document :
بازگشت