DocumentCode :
3331386
Title :
Reliable and randomized data distribution strategies for large scale storage systems
Author :
Miranda, Alberto ; Effert, Sascha ; Kang, Yangwook ; Miller, Ethan L. ; Brinkmann, Andre ; Cortes, Toni
Author_Institution :
Barcelona Supercomput. Center (BSC), Barcelona, Spain
fYear :
2011
fDate :
18-21 Dec. 2011
Firstpage :
1
Lastpage :
10
Abstract :
The ever-growing amount of data requires highly scalable storage solutions. The most flexible approach is to use storage pools that can be expanded and scaled down by adding or removing storage devices. To make this approach usable, it is necessary to provide a solution to locate data items in such a dynamic environment. This paper presents and evaluates the Random Slicing strategy, which incorporates lessons learned from table-based, rule-based, and pseudo-randomized hashing strategies and is able to provide a simple and efficient strategy that scales up to handle exascale data. Random Slicing keeps a small table with information about previous storage system insert and remove operations, drastically reducing the required amount of randomness while delivering a perfect load distribution.
Keywords :
file organisation; knowledge based systems; large scale storage systems; pseudorandomized hashing strategies; random slicing strategy; randomized data distribution strategies; rule based hashing strategies; scalable storage solutions; storage devices; storage pools; table based hashing strategies; Distribution strategy; Hard disks; Memory management; Partitioning algorithms; Redundancy; Scalability; Servers;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing (HiPC), 2011 18th International Conference on
Conference_Location :
Bangalore
Print_ISBN :
978-1-4577-1951-6
Electronic_ISBN :
978-1-4577-1949-3
Type :
conf
DOI :
10.1109/HiPC.2011.6152745
Filename :
6152745
Link To Document :
بازگشت