DocumentCode :
2891976
Title :
A Comparative Assessment of Conventional and Machine-Learning-Based Scoring Functions in Predicting Binding Affinities of Protein-Ligand Complexes
Author :
Ashtawy, Hossam M. ; Mahapatra, Nihar R.
Author_Institution :
Dept. of Electr. & Comput. Eng., Michigan State Univ., East Lansing, MI, USA
fYear :
2011
fDate :
12-15 Nov. 2011
Firstpage :
627
Lastpage :
630
Abstract :
Accurately predicting the binding affinities of large sets of protein-ligand complexes is a key challenge in computational biomolecular science, with applications in drug discovery, chemical biology, and structural biology. Since a scoring function (SF) is used to score, rank, and identify drug leads, the fidelity with which it predicts the affinity of a ligand candidate for a protein´s binding site has a significant bearing on the accuracy of virtual screening. Despite intense efforts in developing conventional SFs, which are either force-field based, knowledge-based, or empirical, their limited predictive power has been a major roadblock toward cost-effective drug discovery. Therefore, in this work, we explore a range of novel SFs employing different machine-learning (ML) approaches in conjunction with physicochemical features characterizing protein-ligand complexes. We assess the scoring accuracies of these new ML-based SFs as well as those of conventional SFs in the context of the 2007 PDBbind benchmark dataset on both diverse and protein-family-specific test sets. We find that the best performing ML-based SF has a Pearson´s correlation coefficient of 0.771 between predicted and measured binding affinities compared to 0.644 achieved by a state-of-the-art conventional SF.
Keywords :
drugs; learning (artificial intelligence); medical computing; proteins; binding affinity prediction; chemical biology; comparative assessment; computational biomolecular science; drug discovery; empirical SF; force-field based SF; knowledge-based SF; machine-learning-based scoring functions; protein-family-specific test sets; protein-ligand complex; structural biology; virtual screening; Correlation; Drugs; Proteins; Radio frequency; Software; Support vector machines; Training; Machine learning; protein-ligand binding affinity; scoring function; scoring power; structure-based drug design;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2011 IEEE International Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4577-1799-4
Type :
conf
DOI :
10.1109/BIBM.2011.128
Filename :
6120516
Link To Document :
بازگشت