DocumentCode :
779957
Title :
TRUSS: a reliable, scalable server architecture
Author :
Gold, Brian T. ; Kim, Jangwoo ; Smolens, Jared C. ; Chung, Eric S. ; Liaskovitis, Vasileios ; Nurvitadhi, Eriko ; Falsafi, Babak ; Hoe, James C. ; Nowatzyk, Andreas G.
Author_Institution :
Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume :
25
Issue :
6
fYear :
2005
Firstpage :
51
Lastpage :
59
Abstract :
Traditional techniques that mainframes use to increase reliability -special hardware or custom software - are incompatible with commodity server requirements. The Total Reliability Using Scalable Servers (TRUSS) architecture, developed at Carnegie Mellon, aims to bring reliability to commodity servers. TRUSS features a distributed shared-memory (DSM) multiprocessor that incorporates computation and memory storage redundancy to detect and recover from any single point of transient or permanent failure. Because its underlying DSM architecture presents the familiar shared-memory programming model, TRUSS requires no changes to existing applications and only minor modifications to the operating system to support error recovery.
Keywords :
distributed shared memory systems; fault tolerance; parallel architectures; DSM multiprocessor; TRUSS architecture; Total Reliability Using Scalable Servers; commodity server requirements; distributed shared-memory; mainframes; memory storage redundancy; shared-memory programming model; Biomembranes; Control systems; Delay; Error correction; Error correction codes; Fault tolerant systems; Out of order; Process design; Protection; Redundancy; Performance Analysis and Design Aids; Reliability; Testing; and Fault-Tolerance;
fLanguage :
English
Journal_Title :
Micro, IEEE
Publisher :
ieee
ISSN :
0272-1732
Type :
jour
DOI :
10.1109/MM.2005.122
Filename :
1566557
Link To Document :
بازگشت