DocumentCode :
125599
Title :
Understanding the Data Traffic of Uncore in Westmere NUMA Architecture
Author :
Qiuming Luo ; Chang Kong ; Yuanyuan Zhou ; Guoqiang Liu ; Chenjian Liu
Author_Institution :
Nat. High Performance Comput. Center (NHPCC), Shenzhen Univ., Shenzhen, China
fYear :
2014
fDate :
12-14 Feb. 2014
Firstpage :
392
Lastpage :
399
Abstract :
Non-Uniform Memory Access (NUMA) has become the main stream architecture of modern servers. In processors, Uncore part plays a very important role, especially in NUMA systems, because it is used to connect Cores, Last Level Caches (LLC), on-chip multiple Memory Controllers (MCs) and highspeed interconnections. Recent study shows that Uncore congestion plays a more important role than locality. It needs more understanding of Uncore behavior to alleviate the congestion and efficiently utilize certain architecture. Our work focuses on the unbalance and congestion of data traffic happened on processor\´s Uncore part. We choose an Intel NUMA architecture named "Westmere" and use hardware performance counters to investigate several benchmarks\´ data flow in Uncore. In our experiments we find that data unbalance of Global Queue (GQ) and QuickPath Home Logical (QHL)\´s trackers is really serious, the biggest unbalance rate is more than 1000 times, new dynamic entries management algorithm is needed to improve entries\´ usage the congestion of GQ and QHL\´s trackers has different behaviors with threads number increases and also for a given memory access pattern the congestion of GQ and QHL\´s trackers grows linearly with the problem size increases.
Keywords :
multiprocessing systems; QuickPath home logical; data flow; data traffic; dynamic entries management algorithm; global queue; last level caches; nonuniform memory access; on-chip multiple memory controllers; uncore congestion; westmere NUMA architecture; Benchmark testing; Computer architecture; Hardware; Monitoring; Phasor measurement units; Program processors; Sockets; NUMA; Performance Monitoring Unit; Uncore; congestion; unbalance;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel, Distributed and Network-Based Processing (PDP), 2014 22nd Euromicro International Conference on
Conference_Location :
Torino
ISSN :
1066-6192
Type :
conf
DOI :
10.1109/PDP.2014.71
Filename :
6787304
Link To Document :
بازگشت