• DocumentCode
    2647038
  • Title

    Distributed Placement of Replicas in Hierarchical Data Grids with User and System QoS Constraints

  • Author

    Shorfuzzaman, Mohammad ; Graham, Peter ; Eskicioglu, Rasit

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Manitoba, Winnipeg, MB, Canada
  • fYear
    2011
  • fDate
    26-28 Oct. 2011
  • Firstpage
    177
  • Lastpage
    186
  • Abstract
    Data grids support distributed data-intensive applications that need to access massive datasets stored around the world. Ensuring efficient access to such datasets is hindered by the high latencies of wide-area networks. To speed up access, files can be replicated so a user can access a nearby replica. Much of the work on the replica placement problem in data grids has focused on average system performance and ignored quality assurance issues. In the existing work that considers QoS, a simplified replication model is often assumed, therefore, resulting solutions may not be applicable to real systems. In this paper, we introduce a more realistic model for replica placement in hierarchical Data Grids which determines the positions of a minimum number of replicas expected to satisfy certain quality requirements both from user and system perspectives. Our placement algorithm is based on a highly distributed and decentralized technique that exploits the data access history for popular data files and computes replica locations by minimizing overall replication cost (read and update) while maximizing QoS satisfaction for a given traffic pattern. The problem is formulated using dynamic programming. We assess our algorithm using OptorSim. Simulation results demonstrate the effectiveness of our replica placement technique considering various factors such as storage and workload constraints of replica servers, link capacity constraints, user QoS requirements, etc.
  • Keywords
    data analysis; distributed algorithms; dynamic programming; grid computing; information retrieval; quality assurance; quality of service; replicated databases; storage management; OptorSim; QoS satisfaction; average system performance; data access history; decentralized technique; distributed data-intensive applications; distributed placement; distributed technique; dynamic programming; hierarchical data grids; link capacity constraints; nearby replica; overall replication cost; placement algorithm; popular data files; quality assurance issues; quality requirements; realistic model; replica locations; replica placement problem; replica placement technique; replica servers; replicas; replication model; system QoS constraints; system perspective; traffic pattern; user QoS constraints; user QoS requirements; user perspective; wide-area networks; Bandwidth; Cost function; Equations; Heuristic algorithms; Mathematical model; Quality of service; Servers; data grids; distributed algorithm; dynamic programming; quality of service; replication; workload and link constraints;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    P2P, Parallel, Grid, Cloud and Internet Computing (3PGCIC), 2011 International Conference on
  • Conference_Location
    Barcelona
  • Print_ISBN
    978-1-4577-1448-1
  • Type

    conf

  • DOI
    10.1109/3PGCIC.2011.35
  • Filename
    6103156