• DocumentCode
    238951
  • Title

    Sensitivity Analysis for Time Dependent Problems: Optimal Checkpoint-Recompute HPC Workflows

  • Author

    Carey, Varis ; Abbasi, Hasan ; Rodero, Ivan ; Kolla, Hemanth

  • Author_Institution
    Inst. of Comput. Eng. & Sci., Univ. of Texas at Austin, Austin, TX, USA
  • fYear
    2014
  • fDate
    16-16 Nov. 2014
  • Firstpage
    20
  • Lastpage
    30
  • Abstract
    Sensitivity analysis (SA) is a fundamental tool of uncertainty quantification(UQ). Adjoint-based SA is the optimal approach in many large-scale applications, such as the direct numerical simulation (DNS) of combustion. However, one of the challenges of the adjoint workflow for time-dependent applications is the storage and I/O requirements for the application state. During the time-reversal portion of the workflow, forward state is required in last-in-first-out order. The resulting requirements for storage at exascale are enormous. To mitigate this requirement, application state is regenerated from checkpoints over short windows of application time. This approach drastically reduces the total volume of stored data, allows the caching of state in the regeneration window in memory and on local SSDs, may accelerate the application execution by reducing output frequency, and reduces the power overhead from I/O. We explore variations to this workflow, applied to a proxy for the SA of turbulent combustion, by varying checkpoint number, state storage, and other regeneration options to find efficient implementations for minimizing compute time or power consumption.
  • Keywords
    parallel processing; sensitivity analysis; storage management; workflow management software; DNS; I/O requirements; adjoint-based SA; direct numerical simulation; forward state; last-in-first-out order; local SSDs; optimal checkpoint-recompute HPC workflows; output frequency reduction; power consumption; power overhead; regeneration window; sensitivity analysis; state storage; time dependent problems; time-reversal portion; turbulent combustion; uncertainty quantification; Analytical models; Checkpointing; Combustion; Computational modeling; Mathematical model; Sensitivity analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Workflows in Support of Large-Scale Science (WORKS), 2014 9th Workshop on
  • Conference_Location
    New Orleans, LA
  • Type

    conf

  • DOI
    10.1109/WORKS.2014.15
  • Filename
    7019859