• DocumentCode
    2688175
  • Title

    Workload Balancing Methodology for Data-Intensive Applications with Divisible Load

  • Author

    Rosas, Claudia ; Morajko, Anna ; Jorba, Josep ; Cesar, Eduardo

  • Author_Institution
    Comput. Archit. & Oper. Syst., Univ. Autonoma de Barcelona, Barcelona, Spain
  • fYear
    2011
  • fDate
    26-29 Oct. 2011
  • Firstpage
    48
  • Lastpage
    55
  • Abstract
    Data-intensive applications are those that explore, query, analyze, and, in general, process very large data sets. Generally in High Performance Computing (HPC), the main performance problem associated to these applications is the load unbalance or inefficient resources utilization. This paper proposes a methodology for improving performance of data-intensive applications based on performing multiple data partitions prior to the execution, and ordering the data chunks according to their processing times during the application execution. As a first step, we consider that a single execution includes multiple related explorations on the same data set. Consequently, we propose to monitor the processing of each exploration and use the data gathered to dynamically tune the performance of the application. The tuning parameters included in the methodology are the partition factor of the data set, the distribution of these data chunks, and the number of processing nodes to be used by the application. The methodology has been initially tested using the well-known bioinformatics tool BLAST, obtaining encouraging results (up to a 40% of improvement).
  • Keywords
    bioinformatics; distributed processing; resource allocation; application execution; bioinformatics tool BLAST; data chunks; data set partition factor; data-intensive applications; divisible load; exploration processing monitoring; high performance computing; multiple data partitions; processing nodes; resources utilization; workload balancing methodology; Bioinformatics; Databases; Load management; Monitoring; Phase measurement; Tuning; DLT; data-intensive; load balancing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Architecture and High Performance Computing (SBAC-PAD), 2011 23rd International Symposium on
  • Conference_Location
    Vitoria, Espirito Santo
  • ISSN
    1550-6533
  • Print_ISBN
    978-1-4577-2050-5
  • Type

    conf

  • DOI
    10.1109/SBAC-PAD.2011.15
  • Filename
    6106005