DocumentCode :
3016616
Title :
Supporting Load Balancing and Efficient Reorganization During System Scaling
Author :
Zhu, Feng ; Sun, Xiaowei ; Salzberg, Betty ; Hvasshovd, Svein-Olaf
Author_Institution :
Coll. of Comput. & Inf. Sci., Northeastern Univ., Boston, MA, USA
fYear :
2005
fDate :
04-08 April 2005
Abstract :
Reorganization becomes constantly necessary for maintaining load balancing as distributed storage systems scale up and down. To support load balancing and efficient reorganization during system scaling, we propose a new hashing method called Prime Based Hashing (PBH) that can be used for data allocation in large distributed systems. PBH distributes objects among storage units based on residues (congruence) of hash-transformed key values modulo prime numbers. PBH provides nearly perfect load balancing, distributes objects evenly and rebalances to preserve the even distribution as system scales. At the same time it facilitates cost-effective reorganization by minimizing data migration during system scaling. Locating an object in PBH is fast through low complexity computations, requiring only the knowledge of the total number of storage units. We also propose a local data clustering method to couple with PBH to make reorganizationmore efficient. Objects are clustered according to the order of migration so that only the part of the data that needs to be migrated is scanned. In addition, we show that by storing a small amount of pre-computed information, ordering of objects for clustering can be very efficient. We demonstrate through analysis and experiments the effectiveness of our algorithms.
Keywords :
communication complexity; data analysis; distributed object management; number theory; pattern clustering; resource allocation; storage management; PBH; data clustering method; distributed object management; distributed storage systems scaling; load balancing; modulo prime numbers; prime based hashing; Algorithm design and analysis; Clustering algorithms; Clustering methods; Costs; Distributed computing; Educational institutions; Hardware; Information science; Load management; Sun;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium, 2005. Proceedings. 19th IEEE International
Print_ISBN :
0-7695-2312-9
Type :
conf
DOI :
10.1109/IPDPS.2005.414
Filename :
1419869
Link To Document :
بازگشت