DocumentCode
3215508
Title
Identifying structural repeats in proteins using graph centrality measures
Author
Jain, Ruchi ; Yalamanchili, Hari Krishna ; Parekh, Nita
Author_Institution
Center for Comput. Natural, Sci. & Bioinf., IIIT, Hyderabad, India
fYear
2009
fDate
9-11 Dec. 2009
Firstpage
110
Lastpage
115
Abstract
Here we apply the graph-theoretic concept of betweenness centrality to a class of protein repeats, e.g., Armadillo (ARM) and HEAT. The Betweenness of a node represents how often a node is traversed on the shortest path between all pairs of nodes i, j in the network and thus gives the contribution of each node in the network. These repeats are not easily detectable at the sequence level because of low conservation between independent repeated units, e.g., HEAT repeats are known to have less than 13% identity. Their identification at the structure level typically involves self structure-structure comparison, which can be computationally very intensive. Our analysis of a set of proteins from ARM and HEAT repeat family shows that the repeat regions exhibit similar connectivity patterns for the repeating units. Since it is generally accepted that in many networks, the larger the degree of a node, the larger the chance that many of the shortest paths will pass through this node, computing vertex Betweenness provides a simple and elegant approach for identifying tandem structural repeats in proteins.
Keywords
biology computing; data analysis; graph theory; molecular configurations; proteins; Armadillo; HEAT; graph centrality measure; graph theory; protein structural repeat identification; structure-structure comparison; Amino acids; Bioinformatics; Computer networks; Databases; Graph theory; Particle measurements; Pattern analysis; Proteins; Radar; Solvents; Armadillo; Betweenness; HEAT; Protein contact network; Protein structural repeats;
fLanguage
English
Publisher
ieee
Conference_Titel
Nature & Biologically Inspired Computing, 2009. NaBIC 2009. World Congress on
Conference_Location
Coimbatore
Print_ISBN
978-1-4244-5053-4
Type
conf
DOI
10.1109/NABIC.2009.5393609
Filename
5393609
Link To Document