• DocumentCode
    3114087
  • Title

    A scalable parallel framework for analyzing terascale molecular dynamics simulation trajectories

  • Author

    Tu, Tiankai ; Rendleman, Charles A. ; Borhani, David W. ; Dror, Ron O. ; Gullingsrud, Justin ; Jensen, Morten Ø ; Klepeis, John L. ; Maragakis, Paul ; Miller, Patrick ; Stafford, Kate A. ; Shaw, David E.

  • Author_Institution
    D.E. Shaw Res., New York, NY, USA
  • fYear
    2008
  • fDate
    15-21 Nov. 2008
  • Firstpage
    1
  • Lastpage
    12
  • Abstract
    As parallel algorithms and architectures drive the longest molecular dynamics (MD) simulations towards the millisecond scale, traditional sequential post-simulation data analysis methods are becoming increasingly untenable. Inspired by the programming interface of Google´s MapReduce, we have built a new parallel analysis framework called HiMach, which allows users to write trajectory analysis programs sequentially, and carries out the parallel execution of the programs automatically. We introduce (1) a new MD trajectory data analysis model that is amenable to parallel processing, (2) a new interface for defining trajectories to be analyzed, (3) a novel method to make use of an existing sequential analysis tool called VMD, and (4) an extension to the original MapReduce model to support multiple rounds of analysis. Performance evaluations on up to 512 cores demonstrate the efficiency and scalability of the HiMach framework on a Linux cluster.
  • Keywords
    data analysis; digital simulation; molecular dynamics method; parallel algorithms; parallel architectures; physics computing; Google MapReduce; HiMach; MD trajectory data analysis model; MapReduce model; longest molecular dynamics; parallel algorithm; parallel analysis framework; parallel architecture; parallel processing; programming interface; scalable parallel framework; sequential analysis tool; sequential post-simulation data analysis; tera-scale molecular dynamics simulation; trajectory analysis programs; Analytical models; Automatic programming; Data analysis; Linux; Parallel algorithms; Parallel processing; Parallel programming; Scalability; Sequential analysis; Trajectory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing, Networking, Storage and Analysis, 2008. SC 2008. International Conference for
  • Conference_Location
    Austin, TX
  • Print_ISBN
    978-1-4244-2834-2
  • Electronic_ISBN
    978-1-4244-2835-9
  • Type

    conf

  • DOI
    10.1109/SC.2008.5214715
  • Filename
    5214715