• DocumentCode
    3394548
  • Title

    Application of machine learning approaches on quantitative structure activity relationships

  • Author

    Butkiewicz, Mariusz ; Mueller, Ralf ; Selic, Danilo ; Dawson, Eric ; Meiler, Jens

  • Author_Institution
    Center for Struct. Biol., Vanderbilt Univ., Nashville, TN
  • fYear
    2009
  • fDate
    March 30 2009-April 2 2009
  • Firstpage
    255
  • Lastpage
    262
  • Abstract
    Machine Learning techniques are successfully applied to establish quantitative relations between chemical structure and biological activity (QSAR), i.e. classify compounds as active or inactive with respect to a specific target biological system. This paper presents a comparison of artificial neural networks (ANN), support vector machines (SVM), and decision trees (DT) in an effort to identify potentiators of metabotropic glutamate receptor 5 (mGluR5), compounds that have potential as novel treatments against schizophrenia. When training and testing each of the three techniques on the same dataset enrichments of 61, 64, and 43 were obtained and an area under the curve (AUC) of 0.77, 0.78, and 0.63 was determined for ANNs, SVMs, and DTs, respectively. For the top percentile of predicted active compounds, the true positives for all three methods were highly similar, while the inactives were diverse offering the potential use of jury approaches to improve prediction accuracy.
  • Keywords
    decision trees; drugs; learning (artificial intelligence); medical computing; neural nets; support vector machines; ANN; SVM; area under the curve; artificial neural networks; biological activity; chemical structure; decision trees; machine learning; metabotropic glutamate receptor 5; quantitative structure activity relationships; schizophrenia; support vector machines; Amino acids; Artificial neural networks; Data mining; Decision trees; High temperature superconductors; Human immunodeficiency virus; Machine learning; Support vector machine classification; Support vector machines; Testing; Artificial Neural Network (ANN); Decision Trees (DT); Machine Learning; Support Vector Machine (SVM); area under the curve (AUC); high-throughput screening (HTS); quantitative structure activity relationship (QSAR); receiver operator characteristics (ROC);
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence in Bioinformatics and Computational Biology, 2009. CIBCB '09. IEEE Symposium on
  • Conference_Location
    Nashville, TN
  • Print_ISBN
    978-1-4244-2756-7
  • Type

    conf

  • DOI
    10.1109/CIBCB.2009.4925736
  • Filename
    4925736