DocumentCode
2386439
Title
Distributed Popularity Based Replica Placement in Data Grid Environments
Author
Shorfuzzaman, Mohammad ; Graham, Peter ; Eskicioglu, Rasit
Author_Institution
Dept. of Comput. Sci., Univ. of Manitoba, Winnipeg, MB, Canada
fYear
2010
fDate
8-11 Dec. 2010
Firstpage
66
Lastpage
77
Abstract
Data grids support distributed data-intensive applications that need to access massive datasets stored around the world. Ensuring efficient access to such datasets is hindered by the high latencies of wide-area networks. To speed up access, files can be replicated so a user can access a nearby replica. Replication also provides improved availability, decreased bandwidth use, increased fault tolerance, and improved scalability. Since a grid environment is dynamic, resource availability, network latency, and user requests may change. To address these issues a dynamic replica placement strategy that adapts to changing behaviour is needed. In this paper, we introduce a highly distributed replica placement algorithm for hierarchical data grids. Our algorithm exploits data access histories to identify popular files and determines optimal replication locations to improve access performance by minimizing replication overhead (access and update) assuming a given traffic pattern. The problem is formulated using dynamic programming. We evaluate our algorithm using the OptorSim simulator and find that it offers shorter execution time and reduced bandwidth consumption compared to other dynamic replica placement methods.
Keywords
dynamic programming; grid computing; information retrieval; minimisation; replicated databases; OptorSim simulator; bandwidth consumption; data access; data grid environment; distributed data-intensive applications; distributed replica placement algorithm; dynamic programming; hierarchical data grids; optimal replication locations; replication overhead; traffic pattern; wide area networks; Bandwidth; Cost function; Data models; Dynamic programming; Heuristic algorithms; Peer to peer computing; Servers; data grids; distributed algorithms; dynamic programming; file popularity; replication;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Computing, Applications and Technologies (PDCAT), 2010 International Conference on
Conference_Location
Wuhan
Print_ISBN
978-1-4244-9110-0
Electronic_ISBN
978-0-7695-4287-4
Type
conf
DOI
10.1109/PDCAT.2010.78
Filename
5704405
Link To Document