• DocumentCode
    625577
  • Title

    On Closed Nesting and Checkpointing in Fault-Tolerant Distributed Transactional Memory

  • Author

    Dhoke, Aditya ; Ravindran, Binoy ; Bo Zhang

  • Author_Institution
    ECE Dept., Virginia Tech., Falls Church, VA, USA
  • fYear
    2013
  • fDate
    20-24 May 2013
  • Firstpage
    41
  • Lastpage
    52
  • Abstract
    We consider the closed nesting and checkpointing model for transactions in fault-tolerant distributed transactional memory (DTM). The closed nested model allows inner-nested transactions to be aborted (in the event of a transactional conflict) without aborting the parent transaction, while checkpointing allows transactions to rollback to a previous execution state, potentially improving concurrency over flat nesting. We consider a quorum-based replicated model for fault-tolerant DTM, and present algorithms to support closed nesting and checkpointing. The algorithms use incremental validation to avoid communication overhead on commit, and ensure 1-copy equivalence. Our experimental studies using a Java DTM implementation of the algorithms on micro and macro benchmarks reveal the conditions when they improve transactional throughput over flat nesting, and also their relative advantages and disadvantages.
  • Keywords
    Java; checkpointing; concurrency control; distributed memory systems; software fault tolerance; transaction processing; 1-copy equivalence; Java DTM implementation; checkpointing model; closed nesting; communication overhead; concurrency; execution state; fault-tolerant DTM; fault-tolerant distributed transactional memory; flat nesting; inner-nested transaction; macrobenchmark; microbenchmark; parent transaction; quorum-based replicated model; transactional conflict; transactional throughput; Benchmark testing; Checkpointing; Concurrent computing; Electronic mail; Fault tolerance; Fault tolerant systems; Protocols; checkpointing; closed nesting; distributed systems; replication; transactional memory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing (IPDPS), 2013 IEEE 27th International Symposium on
  • Conference_Location
    Boston, MA
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-4673-6066-1
  • Type

    conf

  • DOI
    10.1109/IPDPS.2013.103
  • Filename
    6569799