• DocumentCode
    634817
  • Title

    Optimizing communication and cooling costs in HPC data centers via intelligent job allocation

  • Author

    Kaplan, Fulya ; Jie Meng ; Coskun, Ayse K.

  • Author_Institution
    Electr. & Comput. Eng. Dept., Boston Univ., Boston, MA, USA
  • fYear
    2013
  • fDate
    27-29 June 2013
  • Firstpage
    1
  • Lastpage
    10
  • Abstract
    Nearly half of the energy in the computing clusters today is consumed by the cooling infrastructure. It is possible to reduce the cooling cost by allowing the data center temperatures to rise; however, component reliability constraints impose thermal thresholds as failure rates are exponentially dependent on the processor temperatures. Existing thermally-aware job allocation policies optimize the cooling costs by minimizing the peak inlet temperatures of the server nodes. An important constraint in high performance computing (HPC) data centers, however, is performance. Specifically, HPC data centers run multi-threaded applications with significant communication among the threads. Thus, performance of such applications is strongly affected by the job allocation decisions. This paper proposes a novel job allocation methodology to jointly minimize communication cost of an HPC application while also reducing the cooling energy. The proposed method also considers temperature-dependent hardware reliability as part of the optimization.
  • Keywords
    computer centres; cooling; energy conservation; parallel processing; resource allocation; temperature; HPC data centers; communication cost optimization; component reliability constraints; computing clusters; cooling cost optimization; cooling energy reduction; cost reduction; data center temperatures; high performance computing; intelligent job allocation; multithreaded applications; processor temperatures; temperature-dependent hardware reliability; thermal thresholds; thread communication; Computational modeling; Cooling; Reliability; Resource management; Servers; Temperature distribution;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Green Computing Conference (IGCC), 2013 International
  • Conference_Location
    Arlington, VA
  • Type

    conf

  • DOI
    10.1109/IGCC.2013.6604521
  • Filename
    6604521