DocumentCode
3104092
Title
Phylogenetic Predictions on Grids
Author
Katariya, Priyank Raj ; Vadhiyar, Sathish S.
Author_Institution
Supercomput. Educ. & Res. Centre, Indian Inst. of Sci., Bangalore, India
fYear
2009
fDate
9-11 Dec. 2009
Firstpage
58
Lastpage
65
Abstract
A phylogenetic or evolutionary tree is constructed from a set of species or DNA sequences and depicts the relatedness between the sequences. Predictions of future sequences in a phylogenetic tree are important for a variety of applications including drug discovery, pharmaceutical research and disease control. In this work, we predict future DNA sequences in a phylogenetic tree using cellular automata. Cellular automata are used for modeling neighbor-dependent mutations from an ancestor to a progeny in a branch of the phylogenetic tree. Since the number of possible ways of transformations from an ancestor to a progeny is huge, we use computational grids and middleware techniques to explore the large number of cellular automata rules used for the mutations. We use the popular and recurring neighbor-based transitions or mutations to predict the progeny sequences in the phylogenetic tree. We performed predictions for three types of sequences, namely, triose phosphate isomerase, pyruvate kinase, and polyketide synthase sequences, by obtaining cellular automata rules on a grid consisting of 29 machines in 4 clusters located in 4 countries, and compared the predictions of the sequences using our method with predictions by random methods. We found that in all cases, our method gave about 40% better predictions than the random methods.
Keywords
DNA; biology computing; cellular automata; evolution (biological); evolutionary computation; genetics; grid computing; middleware; trees (mathematics); DNA sequences; cellular automata; computational grids; disease control; drug discovery; evolutionary tree; middleware techniques; pharmaceutical research; phylogenetic predictions; phylogenetic tree; polyketide synthase sequences; pyruvate kinase; triose phosphate isomerase; Automatic control; DNA; Diseases; Drugs; Genetic mutations; Grid computing; Middleware; Pharmaceuticals; Phylogeny; Sequences; DNA sequences; Grids; master-worker; phylogeny; predictions;
fLanguage
English
Publisher
ieee
Conference_Titel
e-Science, 2009. e-Science '09. Fifth IEEE International Conference on
Conference_Location
Oxford
Print_ISBN
978-0-7695-3877-8
Type
conf
DOI
10.1109/e-Science.2009.17
Filename
5380886
Link To Document