DocumentCode :
3204607
Title :
Hardware-Based Job Queue Management for Manycore Architectures and OpenMP Environments
Author :
Lee, Junghee ; Nicopoulos, Chrysostomos ; Lee, Yongjae ; Lee, Hyung Gyu ; Kim, Jongman
fYear :
2011
fDate :
16-20 May 2011
Firstpage :
407
Lastpage :
418
Abstract :
The seemingly interminable dwindle of technology feature sizes well into the nano-scale regime has afforded computer architects with an abundance of computational resources on a single chip. The Chip Multi-Processor (CMP) paradigm is now seen as the de facto architecture for years to come. However, in order to efficiently exploit the increasing number of on-chip processing cores, it is imperative to achieve and maintain efficient utilization of the resources at run time. Uneven and skewed distribution of workloads misuses the CMP resources and may even lead to such undesired effects as traffic and temperature hotspots. While existing techniques rely mostly on software for the undertaking of load balancing duties and exploit hardware mainly for synchronization, we will demonstrate that there are wider opportunities for hardware support of load balancing in CMP systems. Based on this fact, this paper proposes IsoNet, a conflict-free dynamic load distribution engine that exploits hardware aggressively to reinforce massively parallel computation in many core settings. Moreover, the proposed architecture provides extensive fault-tolerance against both CPU faults and intra-IsoNet faults. The hardware takes charge of both (1) the management of the list of jobs to be executed, and (2) the transfer of jobs between processing elements to maintain load balance. Experimental results show that, unlike the existing popular techniques of blocking and job stealing, IsoNet is scalable with as many as 1024 processing cores.
Keywords :
fault tolerance; microprocessor chips; multiprocessing systems; resource allocation; IsoNet; OpenMP environments; chip multiprocessor paradigm; conflict-free dynamic load distribution engine; fault tolerance; hardware-based job queue management; load balancing; manycore architectures; resource utilization; Fault tolerance; Fault tolerant systems; Hardware; Instruction sets; Load management;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel & Distributed Processing Symposium (IPDPS), 2011 IEEE International
Conference_Location :
Anchorage, AK
ISSN :
1530-2075
Print_ISBN :
978-1-61284-372-8
Electronic_ISBN :
1530-2075
Type :
conf
DOI :
10.1109/IPDPS.2011.47
Filename :
6012811
Link To Document :
بازگشت