Title :
Accessing hardware performance counters in order to measure the influence of cache on the performance of integer sorting
Author :
Cérin, Christophe ; Fkaier, Hazem ; Jemni, Mohamed
Author_Institution :
Univ. de Picardie Jules Verne, Amiens, France
Abstract :
Hardware performance counters are available on most modern microprocessors. These counters are implemented as a small set of registers that count events related to the processor´s functions. The Perfctr toolkit is one of the most popular toolkits (for x86 processors) for monitoring these events. In this paper, it is used to discover the impact of L1 data cache misses on the overall performance of six integer sorting algorithms. Most of them are cache conscious algorithms recently introduced, or known to behave well according to previous simulations, or they are totally not explored. We demonstrate through experiments on an Athlon processor that a good balance between L1 data cache misses and retired instructions provides the fastest algorithm for sorting in practical cases. The fastest sorting algorithm is not obtained with the implementation that gives the smallest number of misses and the smallest number of instructions. The fastest algorithm in practice is thus a new flavour of merge-sort that we have developed and it beats its rival. Keywords: hardware performance counters, cache conscious and oblivious algorithms, in-core sorting algorithms, two levels memory hierarchy, parallelism at the chip level.
Keywords :
cache storage; merging; microprocessor chips; sorting; Athlon processor; Perfctr toolkit; cache; data cache misses; hardware performance counters; integer sorting; memory hierarchy; microprocessors; oblivious algorithms; Algorithm design and analysis; Concurrent computing; Counting circuits; Hardware; Microprocessors; Monitoring; Parallel processing; Performance analysis; Registers; Sorting;
Conference_Titel :
Parallel and Distributed Processing Symposium, 2003. Proceedings. International
Print_ISBN :
0-7695-1926-1
DOI :
10.1109/IPDPS.2003.1213491