An Asynchronous Two-Level Checkpointing Method to Solve Adjoint Problems on Hierarchical Memory Spaces

作者:Datta Debanjan*; Appelhans David; Evangelinos Constantinos; Jordan Kirk
来源:Computing in Science & Engineering, 2018, 20(4): 39-55.
DOI:10.1109/MCSE.2018.042781325

摘要

The problem of data reversal in discretized adjoint problems is often solved using checkpointing, trading, memory usage with computations and data movement. The authors present a useful model to design and implement an asynchronous two-level checkpointing method with parameterizable values for current and future system configurations. They also evaluate the benefits of new supercomputing hardware through the implementation of an asynchronous algorithm that takes advantage of the fast NVLINK interconnect and Non-Volatile Memory Express (NVMe) memory. They show that the new hardware combined with an asynchronous approach is able to run bigger simulations faster than current generation hardware.

  • 出版日期2018-8
  • 单位IBM