Title of article :
SmoPSI: Analysis and Prediction of Small Molecule Binding Sites Based on Protein Sequence Information
Author/Authors :
Wang, Wei Department of Computer Science and Technology - College of Computer and Information Engineering - Henan Normal University - Xinxiang - Henan Province, China , Li, Keliang Department of Computer Science and Technology - College of Computer and Information Engineering - Henan Normal University - Xinxiang - Henan Province, China , Lv, Hehe Department of Computer Science and Technology - College of Computer and Information Engineering - Henan Normal University - Xinxiang - Henan Province, China , Zhang, Hongjun School of Aviation Engineering - Anyang University - Anyang - Henan Province, China , Wang, Shixun Department of Computer Science and Technology - College of Computer and Information Engineering - Henan Normal University - Xinxiang - Henan Province, China , Huang, Junwei Department of Computer Science and Technology - College of Computer and Information Engineering - Henan Normal University - Xinxiang - Henan Province, China
Abstract :
(e analysis and prediction of small molecule binding sites is very important for drug discovery and drug design. (e traditional
experimental methods for detecting small molecule binding sites are usually expensive and time consuming, and the tools for
single species small molecule research are equally inefficient. In recent years, some algorithms for predicting binding sites of
protein-small molecules have been developed based on the geometric and sequence characteristics of proteins. In this paper, we
have proposed SmoPSI, a classification model based on the XGBoost algorithm for predicting the binding sites of small molecules,
using protein sequence information. (e model achieved better results with an AUC of 0.918 and an ACC of 0.913. (e experimental results demonstrate that our method achieves high performances and outperforms many existing predictors. In
addition, we also analyzed the binding residues and nonbinding residues and finally found the PSSM; hydrophilicity, hydrophobicity, charge, and hydrogen bonding have obviously different effects on the binding-site predictions.
Keywords :
SmoPSI , Molecule , Protein , ACC
Journal title :
Computational and Mathematical Methods in Medicine