• DocumentCode
    2481124
  • Title

    Robust sequential resource allocation in heterogeneous distributed systems with random compute node failures

  • Author

    Shestak, Vladimir ; Chong, Edwin K P ; Maciejewski, Anthony A. ; Siegel, Howard Jay

  • Author_Institution
    InfoPrint Solutions Co., Boulder, CO, USA
  • fYear
    2009
  • fDate
    23-29 May 2009
  • Firstpage
    1
  • Lastpage
    12
  • Abstract
    The problem of finding efficient workload distribution techniques is becoming increasingly important today for heterogeneous distributed systems where the availability of compute nodes may change spontaneously over time. Therefore, the resource-allocation policy must be designed to be robust with respect to absence and re-emergence of compute nodes so that the performance of the system is maximized. Such a policy is developed in this work, and its performance is evaluated on a model of a dedicated system composed of a limited set of heterogeneous Web servers. Assuming that each HTML request results in a rdquorewardrdquo if completed before its hard deadline, the goal is to maximize a cumulative reward obtained in the system. A failure rate for each server is set relatively high to simulate its operation under harsh conditions. The results demonstrate that the proposed approach based on the concepts of the Derman-Lieberman-Ross theorem outperforms other policies compared in our experiments for inconsistent, processor-consistent, and task-processor-consistent types of heterogeneity.
  • Keywords
    Internet; fault tolerant computing; file servers; resource allocation; system recovery; Derman-Lieberman-Ross theorem; HTML; cumulative reward; heterogeneous Web server; heterogeneous distributed system; random compute node failure; sequential resource allocation; system recovery; workload distribution technique; Active appearance model; Availability; Computer science; Distributed computing; Mathematics; Resource management; Road transportation; Robustness; Uncertainty; Web server;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing, 2009. IPDPS 2009. IEEE International Symposium on
  • Conference_Location
    Rome
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-4244-3751-1
  • Electronic_ISBN
    1530-2075
  • Type

    conf

  • DOI
    10.1109/IPDPS.2009.5160916
  • Filename
    5160916