Title :
Application re-structuring and data management on a grid environment: a case study for bioinformatics
Author :
Ciriello, Giovanni ; Comin, Matteo ; Guerra, Concettina
Author_Institution :
Dept. of Inf. Eng., Padova Univ.
Abstract :
This paper describes a distributed implementation of PROuST, a method for protein structure comparison, that involves a major restructuring of the application for an efficient grid immersion. PROuST consists of several components that perform different tasks at different stages. Given a target protein, an index-based search retrieves from a database a list of proteins that are good candidates for similarity, then a dynamic programming algorithm aligns the target protein with each candidate protein. The same geometric properties of secondary structure elements of proteins are used by different components of PROuST. Thus, an important issue of the distributed implementation is data transfer vs. data recomputation tradeoffs. Our implementation avoids recomputation by re-using the hash table data as much as possible, once they are accessed. The algorithmic changes to the application allow to reduce the number of data accesses to storage elements and consequently the execution time. In addition this paper discusses data replication strategies on a grid environment to optimize the data transfer time
Keywords :
biology computing; database indexing; document handling; grid computing; medical information systems; proteins; query formulation; PROuST; application restructuring; bioinformatics; data management; data transfer; grid immersion; index-based search; protein structure comparison; Bioinformatics; Biomedical computing; Computer aided software engineering; Distributed computing; Dynamic programming; Educational institutions; Grid computing; Heuristic algorithms; Information retrieval; Protein engineering;
Conference_Titel :
Parallel and Distributed Processing Symposium, 2006. IPDPS 2006. 20th International
Conference_Location :
Rhodes Island
Print_ISBN :
1-4244-0054-6
DOI :
10.1109/IPDPS.2006.1639539