Title of article :
Prediction of protein interaction hot spots using rough set-based multiple criteria linear programming
Author/Authors :
Chen، نويسنده , , Ruoying and Zhang، نويسنده , , Zhiwang and Wu، نويسنده , , Di and Zhang، نويسنده , , Peng and Zhang، نويسنده , , Xinyang and Wang، نويسنده , , Yong and Shi، نويسنده , , Yong، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2011
Abstract :
Protein–protein interactions are fundamentally important in many biological processes and it is in pressing need to understand the principles of protein–protein interactions. Mutagenesis studies have found that only a small fraction of surface residues, known as hot spots, are responsible for the physical binding in protein complexes. However, revealing hot spots by mutagenesis experiments are usually time consuming and expensive. In order to complement the experimental efforts, we propose a new computational approach in this paper to predict hot spots. Our method, Rough Set-based Multiple Criteria Linear Programming (RS-MCLP), integrates rough sets theory and multiple criteria linear programming to choose dominant features and computationally predict hot spots. Our approach is benchmarked by a dataset of 904 alanine-mutated residues and the results show that our RS-MCLP method performs better than other methods, e.g., MCLP, Decision Tree, Bayes Net, and the existing HotSprint database. In addition, we reveal several biological insights based on our analysis. We find that four features (the change of accessible surface area, percentage of the change of accessible surface area, size of a residue, and atomic contacts) are critical in predicting hot spots. Furthermore, we find that three residues (Tyr, Trp, and Phe) are abundant in hot spots through analyzing the distribution of amino acids.
Keywords :
Predictive performance , Alanine mutation , Combined Model , binding free energy , Residue’s features
Journal title :
Journal of Theoretical Biology
Journal title :
Journal of Theoretical Biology