DocumentCode
3114087
Title
A scalable parallel framework for analyzing terascale molecular dynamics simulation trajectories
Author
Tu, Tiankai ; Rendleman, Charles A. ; Borhani, David W. ; Dror, Ron O. ; Gullingsrud, Justin ; Jensen, Morten Ø ; Klepeis, John L. ; Maragakis, Paul ; Miller, Patrick ; Stafford, Kate A. ; Shaw, David E.
Author_Institution
D.E. Shaw Res., New York, NY, USA
fYear
2008
fDate
15-21 Nov. 2008
Firstpage
1
Lastpage
12
Abstract
As parallel algorithms and architectures drive the longest molecular dynamics (MD) simulations towards the millisecond scale, traditional sequential post-simulation data analysis methods are becoming increasingly untenable. Inspired by the programming interface of Google´s MapReduce, we have built a new parallel analysis framework called HiMach, which allows users to write trajectory analysis programs sequentially, and carries out the parallel execution of the programs automatically. We introduce (1) a new MD trajectory data analysis model that is amenable to parallel processing, (2) a new interface for defining trajectories to be analyzed, (3) a novel method to make use of an existing sequential analysis tool called VMD, and (4) an extension to the original MapReduce model to support multiple rounds of analysis. Performance evaluations on up to 512 cores demonstrate the efficiency and scalability of the HiMach framework on a Linux cluster.
Keywords
data analysis; digital simulation; molecular dynamics method; parallel algorithms; parallel architectures; physics computing; Google MapReduce; HiMach; MD trajectory data analysis model; MapReduce model; longest molecular dynamics; parallel algorithm; parallel analysis framework; parallel architecture; parallel processing; programming interface; scalable parallel framework; sequential analysis tool; sequential post-simulation data analysis; tera-scale molecular dynamics simulation; trajectory analysis programs; Analytical models; Automatic programming; Data analysis; Linux; Parallel algorithms; Parallel processing; Parallel programming; Scalability; Sequential analysis; Trajectory;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computing, Networking, Storage and Analysis, 2008. SC 2008. International Conference for
Conference_Location
Austin, TX
Print_ISBN
978-1-4244-2834-2
Electronic_ISBN
978-1-4244-2835-9
Type
conf
DOI
10.1109/SC.2008.5214715
Filename
5214715
Link To Document