Title :
Evaluation Method of Synchronization for Shared-Memory On-Chip Many-Core Processor
Author :
Song, Fenglong ; Liu, Zhiyong ; Fan, Dongrui ; Huang, He ; Yuan, Nan ; Yu, Lei ; Zhang, Junchao
Author_Institution :
Key Lab. of Comput. Syst. & Archit., Chinese Acad. of Sci., Beijing, China
Abstract :
On-chip many core architecture is an emerging and promising computation platform. High speed on-chip communication and abundant chipped resources are two outstanding advantages of this architecture, which provide an opportunity to implement efficient synchronization scheme. The practical execution efficiency of synchronization scheme is critical to this platform. However, there are few researches on systematic evaluation method of choice synchronization schemes for on-chip many core processors, and effect of dedicated hardware support in this context. So we focus on the evaluation method and criterion of synchronization scheme on the platform. Firstly, we present several criterions proper to on-chip many core architecture, that is, absolute overhead of synchronization operation, the transferring time between different synchronization operations, overhead caused by load imbalance, and the network congestion caused by synchronization operation. Secondly, we illustrate how to design microbenchmarks which one dedicated to evaluate a performance criterion respectively. Finally, we implement these microbenchmarks and synchronization schemes on an on-chip many core processor with shared level-two cache and AMD Opteron commercial chip multi-processor, respectively. And we analyze effect of dedicated hardware support. Results show that the most overhead of synchronization is caused by load imbalance and serialization on synchronization point. It also shows that synchronization scheme supported with dedicated hardware can improve its performance obviously for chipped many-core processor.
Keywords :
benchmark testing; cache storage; memory architecture; microprocessor chips; performance evaluation; shared memory systems; synchronisation; system-on-chip; AMD Opteron commercial chip multi-processor; hardware support; load imbalance; microbenchmarks design; network congestion; on-chip many-core processor architecture; performance criterion; shared level-two cache; shared-memory; synchronization evaluation method; Application software; Computer architecture; Concurrent computing; Costs; Distributed computing; Distributed processing; Hardware; Network-on-a-chip; System-on-a-chip; Yarn; evaluation; hardware-supported; many-core architecture; micro-benchmark; synchronization;
Conference_Titel :
Parallel and Distributed Processing with Applications, 2009 IEEE International Symposium on
Conference_Location :
Chengdu
Print_ISBN :
978-0-7695-3747-4
DOI :
10.1109/ISPA.2009.6