DocumentCode :
3048667
Title :
Memory performance model for loops and kernels on Power3 processors
Author :
Pfeiffer, Wayne
Author_Institution :
San Diego Supercomputer Center, La Jolla, CA, USA
fYear :
2004
fDate :
26-30 April 2004
Firstpage :
252
Abstract :
Summary form only given. A performance model for loops and kernels limited by memory access is developed that is applicable to Power3 processors. The output of the model is the time delay arising from cache and TLB misses. The input variables are the miss rates of each cache and the TLB, while the model parameters are the miss penalties of each cache and the TLB. Load misses are treated separately from store misses and typically have smaller penalties because of prefetching. The parameters have been obtained by fits to data from simple test loops measured with a hardware performance monitor. Results are presented for two types of Power3 processor running in serial as well as for one of the processor types running in parallel. For codes limited by store misses, the model fits the data very well. For codes limited by load misses, the model shows greater variability relative to the data, presumably because of the limited treatment of prefetching.
Keywords :
cache storage; operating system kernels; parallel processing; Power3 processors; cache miss rates; kernels; memory performance model; parallel processing; prefetching; Application software; Bandwidth; Condition monitoring; Delay effects; Hardware; Input variables; Kernel; Prefetching; Supercomputers; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium, 2004. Proceedings. 18th International
Print_ISBN :
0-7695-2132-0
Type :
conf
DOI :
10.1109/IPDPS.2004.1303314
Filename :
1303314
Link To Document :
بازگشت