DocumentCode :
1011763
Title :
Dynamic data layouts for cache-conscious implementation of a class of signal transforms
Author :
Park, Neungsoo ; Prasanna, Viktor K.
Author_Institution :
Dept. of Comput. Sci. & Eng., Konkuk Univ., Seoul, South Korea
Volume :
52
Issue :
7
fYear :
2004
fDate :
7/1/2004 12:00:00 AM
Firstpage :
2120
Lastpage :
2134
Abstract :
Effective utilization of cache memories is a key factor in achieving high performance for computing large signal transforms. Nonunit stride access in the computation of large signal transforms results in poor cache performance, leading to severe degradation in the overall performance. In this paper, we develop a cache-conscious technique, called a dynamic data layout, to improve the performance of large signal transforms. In our approach, data reorganization is performed between computation stages to reduce cache misses. We develop an efficient search algorithm to determine an optimal tree with the minimum execution time among possible factorization trees based on the size of the signal transform and the data access stride. Our approach is applied to compute the fast Fourier transform (FFT) and the Walsh-Hadamard transform (WHT). Experiments were performed on Alpha 21264, MIPS R10000, UltraSPARC III, and Pentium 4. Experimental results show that our FFT and WHT achieve performance improvement of up to 3.52 times over other state-of-the-art FFT and WHT packages. The proposed optimization is portable across various platforms.
Keywords :
Hadamard transforms; cache storage; fast Fourier transforms; optimisation; signal processing; tree data structures; tree searching; Alpha 21264; FFT; MIPS R10000; Pentium 4; UltraSPARC III; WHT; Walsh-Hadamard transform; cache memories; cache-conscious implementation; data access stride; data reorganization; dynamic data layouts; efficient search algorithm; factorization trees; fast Fourier transform; large signal transforms; linear discrete signal processing; minimum execution time; optimal tree; optimization technique; Cache memory; Computer architecture; Computer science; Degradation; Fast Fourier transforms; High performance computing; Packaging; Software architecture; Subcontracting; Tree data structures;
fLanguage :
English
Journal_Title :
Signal Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1053-587X
Type :
jour
DOI :
10.1109/TSP.2004.828946
Filename :
1306661
Link To Document :
بازگشت