Title :
Data mining in protein interactomics
Author :
Chen, Jake Y. ; Sivachenko, Andrey Y.
Author_Institution :
Indiana Univ., Indianapolis, IN, USA
Abstract :
In this article, protein interactomics, an emerging field that studies the total collection of proteins and intracellular protein interactions in an organism, i.e., the study of protein interactomes is introduced. Protein interactomics is concerned with all the expressed proteins in a given tissue or cell type and how proteins physically interact with, or bind to, one another in the protein interaction network. Protein interactomes can provide information about protein functional links and protein functional context not apparent from either protein sequence analysis or protein expression analysis. By studying protein interactomics, biologists can compile biological pathway models to understand functional roles of previously uncharacterized proteins and biological processes in different developmental and environmental conditions. The paper discussed new biological discovery opportunities by presenting six specific data mining challenges in protein interactomics - data generation, data representation, data cleansing, data integration, data analysis/visualization, and knowledge curation.
Keywords :
biological tissues; biology computing; cellular biophysics; data analysis; data mining; data structures; data visualisation; molecular biophysics; proteins; biological pathway models; data analysis/visualization; data cleansing; data generation; data integration; data mining; data representation; intracellular protein interactions; knowledge curation; organism; protein binding; protein expression; protein functional context; protein functional links; protein interactomics; protein sequence; tissue; Bioinformatics; Biological processes; Biological system modeling; Biology computing; Cells (biology); Data mining; Genomics; Information analysis; Protein engineering; Sequences; Algorithms; Artificial Intelligence; Computational Biology; Computer Simulation; Database Management Systems; Gene Expression Profiling; Information Storage and Retrieval; Models, Biological; Protein Interaction Mapping; Proteome; Proteomics; Software; Systems Integration; User-Computer Interface;
Journal_Title :
Engineering in Medicine and Biology Magazine, IEEE
DOI :
10.1109/MEMB.2005.1436466