DocumentCode :
1194389
Title :
Data-Driven Multithreading Using Conventional Microprocessors
Author :
Kyriacou, Costas ; Evripidou, Paraskevas ; Trancoso, Pedro
Author_Institution :
Comput. Sci. & Eng. Dept., Frederick Inst. of Technol.
Volume :
17
Issue :
10
fYear :
2006
Firstpage :
1176
Lastpage :
1188
Abstract :
This paper describes the data-driven multithreading (DDM) model and how it may be implemented using off-the-shelf microprocessors. Data-driven multithreading is a nonblocking multithreading execution model that tolerates internode latency by scheduling threads for execution based on data availability. Scheduling based on data availability can be used to exploit cache management policies that reduce significantly cache misses. Such policies include firing a thread for execution only if its data is already placed in the cache. We call this cache management policy the CacheFlow policy. The core of the DDM implementation presented is a memory mapped hardware module that is attached directly to the processor´s bus. This module is responsible for thread scheduling and is known as the thread synchronization unit (TSU). The evaluation of DDM was performed using simulation of the data-driven network of workstations (D2NOW). D2NOW is a DDM implementation built out of regular workstations augmented with the TSU. The simulation was performed for nine scientific applications, seven of which belong to the SPLASH-2 suite. The results show that DDM can tolerate well both the communication and synchronization latency. Overall, for 16 and 32-node D2NOW machines the speedup observed was 14.4 and 26.0, respectively
Keywords :
cache storage; multi-threading; processor scheduling; synchronisation; workstation clusters; CacheFlow policy; cache management; data availability; data-driven multithreading; data-driven network-of-workstation; memory mapped hardware module; off-the-shelf microprocessor; scheduling; thread synchronization unit; Application software; Computer architecture; Delay; Distributed decision making; Hardware; Microprocessors; Multithreading; Processor scheduling; Workstations; Yarn; Dataflow; cache prefetching; high performance computing.; multiprocessors; multithreading; network of workstations; nonblocking threads;
fLanguage :
English
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9219
Type :
jour
DOI :
10.1109/TPDS.2006.136
Filename :
1687886
Link To Document :
بازگشت