DocumentCode
3216499
Title
PSS: a novel statement scheduling mechanism for a high-performance SoC architecture
Author
Chu, Slo-Li
Author_Institution
Dept. of Inf. & Comput. Eng., Chung Yuan Christian Univ., Chung-Li, Taiwan
fYear
2004
fDate
7-9 July 2004
Firstpage
690
Lastpage
697
Abstract
Continuous improvements in semiconductor fabrication density are supporting new classes of system-on-a-chip (SoC) architectures that combine extensive processing logic/processor with high-density memory. Such architectures are generally called processor-in-memory (PIM) or intelligent memory (I-RAM) and can support high-performance computing by reducing the performance gap between the processor and the memory. The PIM architecture combines various processors in a single system. These processors are characterized by their computation and memory-access capabilities. Therefore, a strategy must be developed to identify their capabilities and dispatch the most appropriate jobs to them in order to exploit them fully. Accordingly, this study presents a new automatic source-to-source parallelizing system, called SAGE, to exploit the advantages of PIM architectures. Unlike conventional iteration-based parallelizing systems, SAGE adopts statement-based analyzing approaches. It adopts a new pair-selection scheduling (PSS) mechanism to achieve better utilization and workload balance between the host and memory processors of PIM architectures. This paper also provides performance results and comparison of several benchmarks to demonstrate the capability of this new scheduling algorithm.
Keywords
memory architecture; parallel algorithms; parallel architectures; processor scheduling; random-access storage; resource allocation; system-on-chip; I-RAM architecture; PIM architecture; SAGE; SoC architecture; intelligent memory; iteration-based parallelizing systems; memory processing; pair-selection scheduling; processor-in-memory; scheduling algorithm; semiconductor fabrication; source-to-source parallelizing system; statement scheduling; system-on-chip; workload balance; Computer architecture; Continuous improvement; Delay; Fabrication; High performance computing; Logic; Memory architecture; Processor scheduling; Random access memory; System-on-a-chip;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Systems, 2004. ICPADS 2004. Proceedings. Tenth International Conference on
ISSN
1521-9097
Print_ISBN
0-7695-2152-5
Type
conf
DOI
10.1109/ICPADS.2004.1316156
Filename
1316156
Link To Document