DocumentCode :
2568712
Title :
Interaction prediction of PDZ domains using a machine learning approach
Author :
Kalyoncu, Sibel ; Keskin, Ozlem ; Gursoy, Attila
Author_Institution :
Chem. & Biol. Eng., Koc Univ., Istanbul, Turkey
fYear :
2010
fDate :
20-22 April 2010
Firstpage :
121
Lastpage :
124
Abstract :
Protein interaction domains play crucial roles in many complex cellular pathways. PDZ domains are one of the most common protein interaction domains. Prediction of binding specificity of PDZ domains by a computational manner could eliminate unnecessary, time-consuming experiments. In this study, interactions of PDZ domains are predicted by using a machine learning approach in which only primary sequences of PDZ domains and peptides are used. In order to encode feature vectors for each interaction, trigram frequencies of primary sequences of PDZ domains and corresponding peptides are calculated. After construction of numerical interaction dataset, we compared different classifiers and ended up with Random Forest (RF) algorithm which gave the top performance. We obtained very high prediction accuracy (91.4%) for binary interaction prediction which outperforms all previous similar methods.
Keywords :
biological techniques; biology computing; cellular biophysics; learning (artificial intelligence); molecular biophysics; proteins; trees (mathematics); PDZ domain binding specificity; PDZ domain interaction prediction; PDZ domain primary sequences; cellular pathways; feature vectors; machine learning approach; numerical interaction dataset; peptide primary sequences; protein interaction domains; random forest algorithm; Amino acids; Biological information theory; Biology computing; Chemical engineering; Machine learning; Peptides; Predictive models; Protein engineering; Radio frequency; Sequences; PDZ domains; protein-protein interactions; random forest;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Health Informatics and Bioinformatics (HIBIT), 2010 5th International Symposium on
Conference_Location :
Antalya
Print_ISBN :
978-1-4244-5968-1
Type :
conf
DOI :
10.1109/HIBIT.2010.5478896
Filename :
5478896
Link To Document :
بازگشت