• DocumentCode
    1984770
  • Title

    Dynamic Load Balancing in Data Grids by Global Load Estimation

  • Author

    Rupprecht, Lukas ; Reiser, Angelika ; Kemper, Alfons

  • Author_Institution
    Dept. of Comput. Sci., Tech. Univ. Munchen, Munich, Germany
  • fYear
    2012
  • fDate
    25-29 June 2012
  • Firstpage
    243
  • Lastpage
    250
  • Abstract
    Peer-to-Peer (P2P) technology can be utilized to combine remote resources and build distributed, high performance database systems, called data grids, which help to handle the rapidly increasing volumes of data produced by disciplines like astrophysics, biology, or geology. One major challenge of data grids are skewed query patterns which cause load imbalances and heavily diminish performance and availability. To avoid hot spots, sophisticated load balancing techniques are required. We present a dynamic replication strategy which prevents hot spots by dynamically replicating the hot data on different locations. The main questions of such a strategy are when to copy which data to what receivers and when to delete the copies. To answer these questions we propose a low-overhead, decentralized method which is able to deliver a highly accurate estimate of the global load and the single peer loads to all clients. We use that information in an optimization problem to determine the data to be replicated and the optimal replica receivers. A simulated performance evaluation based on a real-world scenario demonstrates the effectiveness of the approach.
  • Keywords
    data handling; grid computing; optimisation; peer-to-peer computing; query processing; replicated databases; resource allocation; software performance evaluation; P2P technology; data copying; data grids; data handling; distributed high performance database system; dynamic data replication; dynamic load balancing; dynamic replication strategy; global load estimation; low-overhead decentralized method; optimal replica receivers; optimization problem; peer-to-peer technology; remote resource utilization; simulated performance evaluation; single peer load estimation; skewed query patterns; Computer integrated manufacturing; Distributed databases; Load management; Load modeling; Optimization; Peer to peer computing; Receivers; data grids; dynamic replication; load balancing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Computing (ISPDC), 2012 11th International Symposium on
  • Conference_Location
    Munich/Garching, Bavaria
  • Print_ISBN
    978-1-4673-2599-8
  • Type

    conf

  • DOI
    10.1109/ISPDC.2012.40
  • Filename
    6341518