DocumentCode :
3016728
Title :
Fast Address Translation Techniques for Distributed Shared Memory Compilers
Author :
Cantonnet, François ; El-Ghazawi, Tarek A. ; Lorenz, Pascal ; Gaber, Jaafer
Author_Institution :
Dept. of Electr. & Comput. Eng., George Washington Univ., DC, USA
fYear :
2005
fDate :
04-08 April 2005
Abstract :
The Distributed Shared Memory (DSM) model is designed to leverage the ease of programming of the shared memory paradigm, while enabling the highperformance by expressing locality as in the messagepassing model. Experience, however, has shown that DSM programming languages, such as UPC, may be unable to deliver the expected high level of performance. Initial investigations have shown that among the major reasons is the overhead of translating from the UPC memory model to the target architecture virtual addresses space, which can be very costly. Experimental measurements have shown this overhead increasing execution time by up to three orders of magnitude. Previous work has also shown that some of this overhead can be avoided by hand-tuning, which on the other hand can significantly decrease the UPC ease of use. In addition, such tuning can only improve the performance of local shared accesses but not remote shared accesses. Therefore, a new technique that resembles the Translation Look Aside Buffers (TLBs) is proposed here. This technique, which is called the Memory Model Translation Buffer (MMTB) has been implemented in the GCC-UPC compiler using two alternative strategies, full-table (FT) and reduced-table (RT). It will be shown that the MMTB strategies can lead to a performance boost of up to 700%, enabling ease-of-programming while performing at a similar performance to hand-tuned UPC and MPI codes.
Keywords :
C language; buffer storage; distributed shared memory systems; message passing; parallel programming; program compilers; storage allocation; table lookup; DSM programming language; GCC-UPC compiler; MMTB; MPI code; TLB; distributed shared memory compiler; fast address translation technique; full-table; hand-tuned UPC; memory model translation buffer; message-passing model; reduced-table; translation look aside buffer; virtual address space; Computer languages; Electronics packaging; Hardware; ISO standards; Message passing; Parallel programming; Performance gain; Program processors; Programming profession; Time measurement;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium, 2005. Proceedings. 19th IEEE International
Print_ISBN :
0-7695-2312-9
Type :
conf
DOI :
10.1109/IPDPS.2005.219
Filename :
1419873
Link To Document :
بازگشت