DocumentCode
332941
Title
Towards an optimal bit-reversal permutation program
Author
Carter, Larry ; Gatlin, Kang Su
Author_Institution
Dept. of Comput. Sci., California Univ., San Diego, La Jolla, CA, USA
fYear
1998
fDate
8-11 Nov 1998
Firstpage
544
Lastpage
553
Abstract
The speed of many computations is limited not by the number of arithmetic operations but by the time it takes to move and rearrange data in the increasingly complicated memory hierarchies of modern computers. Array transpose and the bit-reversal permutation-trivial operations on a RAM-present non-trivial problems, when designing highly-tuned scientific library functions, particular for the Fast Fourier Transform. We prove a precise bound for RoCol, a simple pebble-type game that is relevant to implementing these permutations. We use RoCol to give lower bounds on the amount of memory traffic in a computer with four-levels of memory (registers, cache, TLB, and memory), taking into account such “messy” features as block moves and set-associative caches. The insights from this analysis lead to a bit-reversal algorithm whose performance is close to the theoretical minimum. Experiments show that it performs significantly better than every program in a comprehensive study of 30 published algorithms
Keywords
computational complexity; game theory; RoCol; arithmetic operations; bit-reversal algorithm; fast Fourier transform; memory hierarchies; optimal bit-reversal permutation program; pebble-type game; Argon; Computational modeling; Computer science; Costs; Drives; Libraries; Random access memory; Read only memory; Read-write memory; Registers;
fLanguage
English
Publisher
ieee
Conference_Titel
Foundations of Computer Science, 1998. Proceedings. 39th Annual Symposium on
Conference_Location
Palo Alto, CA
ISSN
0272-5428
Print_ISBN
0-8186-9172-7
Type
conf
DOI
10.1109/SFCS.1998.743505
Filename
743505
Link To Document