PSS: a novel statement scheduling mechanism for a high-performance SoC architecture

Author

Chu, Slo-Li

Author_Institution

Dept. of Inf. & Comput. Eng., Chung Yuan Christian Univ., Chung-Li, Taiwan

fYear

2004

fDate

7-9 July 2004

Firstpage

690

Lastpage

697

Abstract

Continuous improvements in semiconductor fabrication density are supporting new classes of system-on-a-chip (SoC) architectures that combine extensive processing logic/processor with high-density memory. Such architectures are generally called processor-in-memory (PIM) or intelligent memory (I-RAM) and can support high-performance computing by reducing the performance gap between the processor and the memory. The PIM architecture combines various processors in a single system. These processors are characterized by their computation and memory-access capabilities. Therefore, a strategy must be developed to identify their capabilities and dispatch the most appropriate jobs to them in order to exploit them fully. Accordingly, this study presents a new automatic source-to-source parallelizing system, called SAGE, to exploit the advantages of PIM architectures. Unlike conventional iteration-based parallelizing systems, SAGE adopts statement-based analyzing approaches. It adopts a new pair-selection scheduling (PSS) mechanism to achieve better utilization and workload balance between the host and memory processors of PIM architectures. This paper also provides performance results and comparison of several benchmarks to demonstrate the capability of this new scheduling algorithm.

Keywords

memory architecture; parallel algorithms; parallel architectures; processor scheduling; random-access storage; resource allocation; system-on-chip; I-RAM architecture; PIM architecture; SAGE; SoC architecture; intelligent memory; iteration-based parallelizing systems; memory processing; pair-selection scheduling; processor-in-memory; scheduling algorithm; semiconductor fabrication; source-to-source parallelizing system; statement scheduling; system-on-chip; workload balance; Computer architecture; Continuous improvement; Delay; Fabrication; High performance computing; Logic; Memory architecture; Processor scheduling; Random access memory; System-on-a-chip;

fLanguage

English

Publisher

ieee

Conference_Titel

Parallel and Distributed Systems, 2004. ICPADS 2004. Proceedings. Tenth International Conference on

ISSN

1521-9097

Print_ISBN

0-7695-2152-5

Type

conf

DOI

10.1109/ICPADS.2004.1316156

Filename

1316156