Title :
SMTp: an architecture for next-generation scalable multi-threading
Author :
Chaudhuri, Mainak ; Heinrich, Mark
Author_Institution :
Comput. Syst. Lab., Cornell Univ., Ithaca, NY, USA
Abstract :
We introduce the SMTp architecture - an SMT processor augmented with a coherence protocol thread context, that together with a standard integrated memory controller can enable the design of (among other possibilities) scalable cache-coherent hardware distributed shared memory (DSM) machines from commodity nodes. We describe the minor changes needed to a conventional out-of-order multi-threaded core to realize SMTp, discussing issues related to both deadlock avoidance and performance. We then compare SMTp performance to that of various conventional DSM machines with normal SMT processors both with and without integrated memory controllers. On configurations from 1 to 32 nodes, with 1 to 4 application threads per node, we find that SMTp delivers performance comparable to, and sometimes better than, machines with more complex integrated DSM-specific memory controllers. Our results also show that the protocol thread has extremely low pipeline overhead. Given the simplicity and the flexibility of the SMTp mechanism, we argue that next-generation multi-threaded processors with integrated memory controllers should adopt this mechanism as a way of building less complex high-performance DSM multiprocessors.
Keywords :
concurrency control; distributed shared memory systems; memory architecture; memory protocols; microprocessor chips; multi-threading; pipeline processing; DSM machines; DSM-specific memory controllers; SMT processor; SMT processors; SMTp architecture; cache-coherent hardware distributed shared memory; coherence protocol thread; commodity nodes; deadlock avoidance; high-performance DSM multiprocessors; integrated memory controller; multithreaded processors; next-generation scalable multithreading; pipeline overhead; Coherence; Computer architecture; Control systems; Costs; Fabrics; Hardware; Large-scale systems; Protocols; Surface-mount technology; Yarn;
Conference_Titel :
Computer Architecture, 2004. Proceedings. 31st Annual International Symposium on
Print_ISBN :
0-7695-2143-6
DOI :
10.1109/ISCA.2004.1310769