DocumentCode
3491034
Title
Evaluating the denoising techniques in protein-protein interaction prediction
Author
Wang, Yong-Cui ; Ren, Xian-Wen ; Zhang, Chun-Hua ; Deng, Nai-Yang ; Xiang-Sun Zhang
Author_Institution
Key Lab. of Adaptation & Evolution of Plateau Biota, Chinese Acad. of Sci., Xining, China
fYear
2011
fDate
2-4 Sept. 2011
Firstpage
78
Lastpage
83
Abstract
The past decades witnessed extensive efforts to study the relationships among proteins. Particularly, sequence-based protein-protein interactions (PPIs) prediction is fundamentally important in speeding up the process of mapping interactomes of organisms. The composition vectors are usually constructed to encode proteins as real-value vectors, which is feeding to a machine learning framework. However, the composition vector value might be highly correlated to the distribution of amino acids, i.e., amino acids which are frequently observed in nature tend to have a large value of composition vector. Thus formulation to estimate the noise may be needed during representations. Here, we introduce two kinds of denoising composition vectors, which are efficient in construction of phylogenetic trees, to eliminate the noise. When validating these two denoising composition vectors on Escherichia coli (E.coli) and Saccharomyces cerevisiae (S.cerevisiae) randomly and artificial negative datasets, respectively, the predictive performance is not improved, and even worse than non-denoised prediction. These results suggest that, the denoising formulation efficient in phylogenetic trees construction can not improve the PPIs prediction, that is, what is noise is dependent on the applications.
Keywords
biological techniques; biology computing; microorganisms; molecular biophysics; proteins; Escherichia coli; Saccharomyces cerevisiae; artificial negative datasets; denoising composition vectors; denoising techniques; phylogenetic trees; protein-protein interaction prediction; Amino acids; Noise; Noise reduction; Phylogeny; Proteins; Support vector machines; Tin;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems Biology (ISB), 2011 IEEE International Conference on
Conference_Location
Zhuhai
Print_ISBN
978-1-4577-1661-4
Electronic_ISBN
978-1-4577-1665-2
Type
conf
DOI
10.1109/ISB.2011.6033124
Filename
6033124
Link To Document