DocumentCode :
2068569
Title :
LE1: A Parameterizable VLIW Chip-Multiprocessor with Hardware PThreads Support
Author :
Stevens, David ; Chouliaras, Vassilios
Author_Institution :
Dept. of Electron. & Electr. Eng., Loughborough Univ., Loughborough, UK
fYear :
2010
fDate :
5-7 July 2010
Firstpage :
122
Lastpage :
126
Abstract :
We discuss LE1, a parameterized VLIW Chip Multiprocessor (CMP) adhering to the shared memory programmers model. LE1´s novelty lies in its ability to perform dynamic thread-spawning through hardware support for PThread-like primitives in addition to its substantial architectural and microarchitectural parameterization. Dynamic (hardware) thread creation is very fast and removes the need for an executive/OS, presenting to the application programmer a ´bare-metal´ multiprocessor, capable of exploiting all forms of parallelism. The core LE1 CPU is a configurable, 8-stage pipeline VLIW engine with a proprietary Instruction Set Architecture (ISA) supporting both partial and full predication and pipelined, multi-input, multi-output (MIMO) instruction extensions. The LE1 CMP is parameterizable as to the number of processors, their issue capability, internal microarchitectural features, functional unit mix and latency and the local memory system architecture. Preliminary results indicate near-linear performance improvement when executing a threaded version of the Mandelbrot calculation on 2-way and 4-way processor configurations with a 256 KB, 4-way banked tightly-coupled memory system. Similar trends are seen when executing a threaded matrix multiplication benchmark. We present these findings along with VLSI implementations of 4-way, dual-issue and 3-way, quad issue multiprocessor configurations.
Keywords :
matrix multiplication; microprocessor chips; multi-threading; pipeline processing; shared memory systems; 4-way banked tightly coupled memory system; 8-stage pipeline VLIW engine; Mandelbrot calculation; bare metal multiprocessor; core LE1 CPU; dynamic hardware thread creation; dynamic thread spawning; hardware PThreads support; instruction set architecture; local memory system architecture; microarchitectural features; microarchitectural parameterization; multi-input multi-output instruction extension; near linear performance improvement; parameterizable VLIW chip multiprocessor; shared memory programmers model; threaded matrix multiplication; Benchmark testing; Computer architecture; Hardware; Instruction sets; Message systems; Random access memory; Chip Multiprocessor (CMP); PThreads; VLIW; configurable; extensible CPUs;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
VLSI (ISVLSI), 2010 IEEE Computer Society Annual Symposium on
Conference_Location :
Lixouri, Kefalonia
Print_ISBN :
978-1-4244-7321-2
Type :
conf
DOI :
10.1109/ISVLSI.2010.107
Filename :
5571794
Link To Document :
بازگشت