Title :
A model based on minimotifs for classification of stable protein-protein complexes
Author :
Rueda, Laura ; Pandit, M.
Author_Institution :
Sch. of Comput. Sci., Univ. of Windsor, Windsor, ON, Canada
Abstract :
Prediction of protein-protein interactions (PPIs) is an important problem in biology, since interactions play key role in most biological processes and functions in living cells. PPIs have been studied from many perspectives. Of these, an important problem is prediction of different complex types such as obligate vs. non obligate and transient vs. permanent, among others. We focus on prediction of obligate protein complexes, which are more stable and perform a specific function, as opposed to transient and non-obligate complexes which last for a short period of time. We have modeled the prediction problem using minimotifs, aka short-linear motifs, to extract information contained in the protein sequences to distinguish between obligate and non-obligate PPIs. Incorporating different classifiers such as the k-nearest neighbor (k-NN), the support vector machine (SVM) and linear dimensionality reduction (LDR) yields a very powerful scheme for prediction. On two well-known datasets, the model delivers classification accuracies as high as 99%. Analysis and cross-dataset validation show that the information contained in the training sequences is crucial for prediction and determination of stability in PPIs.
Keywords :
cellular biophysics; molecular biophysics; molecular configurations; proteins; support vector machines; LDR; SVM; k-NN; k-nearest neighbor; linear dimensionality reduction; living cells; minimotifs; nonobligate complexes; protein sequences; short-linear motifs; stable protein-protein complex classification; support vector machine; Accuracy; Amino acids; Predictive models; Proteins; Support vector machines; Training; classification; linear dimensionality reduction; protein-protein interaction; short-linear motifs;
Conference_Titel :
Computational Intelligence in Bioinformatics and Computational Biology, 2014 IEEE Conference on
Conference_Location :
Honolulu, HI
DOI :
10.1109/CIBCB.2014.6845508