DocumentCode :
2541502
Title :
Reliability algorithms for network swapping systems with page migration
Author :
Mitchell, Bernhard ; Rosse, J. ; Newhall, Tia
Author_Institution :
Swarthmore Coll., PA, USA
fYear :
2004
fDate :
20-23 Sept. 2004
Firstpage :
490
Abstract :
Summary form only given. Network swapping systems allow individual cluster nodes with over-committed memory to use the idle memory of remote nodes as their backing store, and to swap pages over the network. Without reliability support a single node crash can affect programs running on other nodes by losing their remotely swapped page data. RAID-based (Patterson et al., 1988; Markatos and Dramitinos, 1996) reliability solutions promise the best alternative in terms of flexibility and performance. However, two important features of our network swapping system, Nswap (Newhall et al., 2003), make direct application of RAID-based schemes impossible. First, Nswap adapts to each node\´s local memory load, adjusting the amount of RAM space it makes available for remote swapping, which results in a variable capacity "backing store". Second, Nswap supports migration of remotely swapped pages between cluster nodes, which occurs when a node needs to reclaim some of its RAM from Nswap to use for local processing. Page migration complicates reliability if, for example, two pages in the same parity group end up on the same node. We present novel reliability algorithms that solve these problems. Our Parity algorithm uses dynamic parity group membership to match Nswap\´s dynamic nature. We show that our algorithms add minimal overhead to remote swapping.
Keywords :
RAID; random-access storage; reliability; resource allocation; storage management; workstation clusters; Nswap; RAID-based reliability solutions; RAM space; capacity backing store; cluster nodes; idle memory; memory load; network swapping systems; over-committed memory; page migration; page swapping; parity group; reliability algorithms; remote nodes; remote swapping; Clustering algorithms; Computer crashes; Conference management; Distributed computing; Educational institutions; Heuristic algorithms; Linux; Random access memory; Read-write memory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cluster Computing, 2004 IEEE International Conference on
ISSN :
1552-5244
Print_ISBN :
0-7803-8694-9
Type :
conf
DOI :
10.1109/CLUSTR.2004.1392655
Filename :
1392655
Link To Document :
بازگشت