Title :
Choosing replica placement heuristics for wide-area systems
Author :
Karlsson, Magnus ; Karamanolis, Christos
Author_Institution :
Storage Syst. Dept., HP Lab., Palo Alto, CA, USA
Abstract :
Data replication is used extensively in wide-area distributed systems to achieve low data-access latency. A large number of heuristics have been proposed to perform replica placement. Practical experience indicates that the choice of heuristic makes a big difference in terms of the cost of required infrastructure (e.g., storage capacity and network bandwidth), depending on system topology, workload and performance goals. We describe a method to assist system designers choose placement heuristics that meet their performance goals for the lowest possible infrastructure cost. Existing heuristics are classified according to a number of properties. The inherent cost (lower bound) for each class of heuristics is obtained for given system, workload and performance goals. The system designer compares different classes of heuristics on the basis of these lower bounds. Experimental results show that choosing a heuristic with the proposed methodology results in up to 7 times lower cost compared to using an "obvious " heuristic, such as caching.
Keywords :
bandwidth allocation; cache storage; replicated databases; wide area networks; caching; data replication; data-access latency; network bandwidth; storage capacity; system topology; wide-area distributed system; Bandwidth; Costs; Delay; Distributed computing; Laboratories; Linear programming; NP-hard problem; Network topology; Video on demand;
Conference_Titel :
Distributed Computing Systems, 2004. Proceedings. 24th International Conference on
Print_ISBN :
0-7695-2086-3
DOI :
10.1109/ICDCS.2004.1281600