DocumentCode :
830088
Title :
Essential latent knowledge for protein-protein interactions: analysis by an unsupervised learning approach
Author :
Mamitsuka, Hiroshi
Author_Institution :
Inst. for Chem. Res., Kyoto Univ., Japan
Volume :
2
Issue :
2
fYear :
2005
Firstpage :
119
Lastpage :
130
Abstract :
Protein-protein interactions play a number of central roles in many cellular functions, including DNA replication, transcription and translation, signal transduction, and metabolic pathways. A recent increase in the number of protein-protein interactions has made predicting unknown protein-protein interactions important for the understanding of living cells. However, the protein-protein interactions experimentally obtained so far are often incomplete and contradictory and, consequently, existing computational prediction methods have integrated evidence (latent knowledge of proteins) from different and more reliable sources. Analyzing the relationships between proteins and the latent knowledge is important to understanding the cellular processes. For this analysis, we propose a new probabilistic model for protein-protein interactions by considering the latent knowledge of proteins. We further present an efficient learning algorithm for this model, based on an EM algorithm. Experimental results have shown that in a supervised test setting, the proposed method outperformed five other competing methods by a statistically significant factor in all cases. Using the probability parameters of a trained model, we have further shown the latent knowledge that is essential to predicting protein-protein interactions. Overall, our experimental results confirm that our proposed model is especially effective for analyzing protein-protein interactions from a viewpoint of the latent knowledge of proteins.
Keywords :
DNA; biology computing; cellular biophysics; molecular biophysics; prediction theory; probability; proteins; unsupervised learning; DNA replication; DNA transcription; DNA translation; cellular functions; computational prediction methods; essential latent knowledge; metabolic pathways; probability parameters; protein-protein interactions; signal transduction; unsupervised learning; Clustering algorithms; DNA; Data mining; Prediction methods; Predictive models; Protein engineering; Sequences; Signal analysis; Transaction databases; Unsupervised learning; Biology and genetics; data mining; machine learning; mining methods and algorithms.; Algorithms; Artificial Intelligence; Cluster Analysis; Gene Expression Profiling; Pattern Recognition, Automated; Protein Interaction Mapping; Proteome; Signal Transduction;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2005.23
Filename :
1438349
Link To Document :
بازگشت