DocumentCode
3092945
Title
PAI: A Lightweight Mechanism for Single-Node Memory Recovery in DSM Servers
Author
Kim, Jangwoo ; Smolens, Jared C. ; Falsafi, Babak ; Hoe, James C.
Author_Institution
Carnegie Mellon Univ., Pittsburgh
fYear
2007
fDate
17-19 Dec. 2007
Firstpage
298
Lastpage
305
Abstract
Several recent studies identify the memory system as the most frequent source of hardware failures in commercial servers. Techniques to protect the memory system from failures must continue to service memory requests, despite hardware failures. Furthermore, to support existing OS´s, the physical address space must be retained following reconfiguration. Existing techniques either suffer from a high performance overhead or require pervasive hardware changes to support transparent recovery. In this paper, we propose physical address indirection (PAI), a lightweight, hardware-based mechanism for memory system failure recovery. PAI provides a simple hardware mapping to transparently reconstruct affected data in alternate locations, while maintaining high performance and avoiding physical address changes. With full-system simulation of commercial and scientific workloads on a 16-node distributed shared memory server, we show that prior techniques have an average degraded mode performance loss of 14 % and 51 % for commercial and scientific workloads, respectively. Using PAI´s data- swap reconstruction, the same workloads have 1 % and 32 % average performance losses.
Keywords
distributed shared memory systems; system recovery; DSM servers; distributed shared memory architectures; memory system failure recovery; physical address indirection; Availability; Computer architecture; Degradation; Hardware; Laboratories; Operating systems; Performance loss; Protection; Virtual machine monitors; Web server;
fLanguage
English
Publisher
ieee
Conference_Titel
Dependable Computing, 2007. PRDC 2007. 13th Pacific Rim International Symposium on
Conference_Location
Melbourne, Qld.
Print_ISBN
0-7695-3054-0
Type
conf
DOI
10.1109/PRDC.2007.37
Filename
4459674
Link To Document