DocumentCode :
1681600
Title :
Parallel, scalable, memory-efficient backtracking for combinatoria modeling of large-scale biological systems
Author :
Park, Byung-Hoon ; Schmidt, Matthew ; Thomas, Kevin ; Karpinets, Tatiana ; Samatova, Nagiza F.
Author_Institution :
Comput. Sci. & Math. Div., Oak Ridge Nat. Lab., Oak Ridge, TN
fYear :
2008
Firstpage :
1
Lastpage :
8
Abstract :
Data-driven modeling of biological systems such as protein- protein interaction networks is data-intensive and combinatorially challenging. Backtracking can constrain a combinatorial search space. Yet, its recursive nature, exacerbated by data-intensity, limits its applicability for large-scale systems. Parallel, scalable, and memory-efficient backtracking is a promising approach. Parallel backtracking suffers from unbalanced loads. Load rebalancing via synchronization and data movement is prohibitively expensive. Balancing these discrepancies, while minimizing end-to-end execution time and memory requirements, is desirable. This paper introduces such a framework. Its scalability and efficiency, demonstrated on the maximal clique enumeration problem, are attributed to the proposed: (a) representation of search tree decomposition to enable parallelization; (b) depth-first parallel search to minimize memory requirement; (c) least stringent synchronization to minimize data movement; and (d) on-demand work stealing with stack splitting to minimize processors´ idle time. The applications of this framework to real biological problems related to bioethanol production are discussed.
Keywords :
biology computing; combinatorial mathematics; parallel processing; proteins; resource allocation; storage management; tree searching; bioethanol production; combinatorial modeling; combinatorial search space; data movement; data-driven modeling; depth-first parallel search; end-to-end execution time; large-scale biological systems; least stringent synchronization; load rebalancing; memory requirement minimization; memory-efficient backtracking; on-demand work stealing; parallel backtracking; protein-protein interaction network; scalable backtracking; search tree decomposition; stack splitting; Biological system modeling; Biological systems; Biology computing; Computational modeling; Computer science; Laboratories; Large-scale systems; Mathematical model; Predictive models; Space exploration;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium on
Conference_Location :
Miami, FL
ISSN :
1530-2075
Print_ISBN :
978-1-4244-1693-6
Electronic_ISBN :
1530-2075
Type :
conf
DOI :
10.1109/IPDPS.2008.4536180
Filename :
4536180
Link To Document :
بازگشت