مرکز منطقه ای اطلاع رساني علوم و فناوري - TX: Algorithmic Energy Saving for Distributed Dense Matrix Factorizations

DocumentCode :

234504

Title :

TX: Algorithmic Energy Saving for Distributed Dense Matrix Factorizations

Author :

Li Tan ; Zizhong Chen

fYear :

2014

fDate :

17-17 Nov. 2014

Firstpage :

Lastpage :

Abstract :

The pressing demands of improving energy efficiency for high performance scientific computing have motivated a large body of solutions using Dynamic Voltage and Frequency Scaling (DVFS) that strategically switch processors to low-power states, if the peak processor performance is unnecessary. Although OS level solutions have demonstrated the effectiveness of saving energy in a black-box fashion, for applications with variable execution patterns, the optimal energy efficiency can be blundered away due to defective prediction mechanism and untapped load imbalance. In this paper, we propose TX, a library level race-tohalt DVFS scheduling approach that analyzes Task Dependency Set of each task in distributed Cholesky/LU/QR factorizations to achieve substantial energy savings OS level solutions cannot fulfill. Partially giving up the generality of OS level solutions per requiring library level source modification, TX leverages algorithmic characteristics of the applications to gain greater energy savings. Experimental results on two clusters indicate that TX can save up to 17.8% more energy than state-of-the-art OS level solutions with negligible 3.5% on average performance loss.

Keywords :

energy conservation; matrix decomposition; operating systems (computers); parallel processing; power aware computing; scheduling; OS level solutions; TX; algorithmic energy saving; average performance loss; black-box fashion; defective prediction mechanism; distributed Cholesky-LU-QR factorizations; distributed dense matrix factorizations; dynamic voltage and frequency scaling; high performance scientific computing; library level race-tohalt DVFS scheduling approach; library level source modification; low-power states; optimal energy efficiency; peak processor performance; switch processors; task dependency set analysis; untapped load imbalance; variable execution patterns; Graphics processing units; Hardware; Libraries; Prediction algorithms; Runtime; Scheduling; Time-frequency analysis;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Latest Advances in Scalable Algorithms for Large-Scale Systems (ScalA), 2014 5th Workshop on

Conference_Location :

New Orleans, LA

Type :

conf

DOI :

10.1109/ScalA.2014.7

Filename :

7016730

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=234504