DocumentCode
3507712
Title
Memory Parallelism Using Custom Array Mapping to Heterogeneous Storage Structures
Author
Baradaran, Nastaran ; Diniz, Pedro C.
Author_Institution
University of Southern California / Information Sciences Institute, Marina del Rey, California, USA, e-mail: nastaran@isi.edu
fYear
2006
fDate
Aug. 2006
Firstpage
1
Lastpage
6
Abstract
Configurable architectures offer the unique opportunity of customizing the storage allocation to meet specific applications¿ needs. In this paper we describe a compiler approach to map the arrays of a loop-based computation to internal memories of a configurable architecture with the objective of minimizing the overall execution time. We present an algorithm that considers the data access patterns of the arrays along the critical path of the computation as well as the available storage and memory bandwidth. We demonstrate experimental results of the application of this approach for a set of kernel codes when targeting a Field-Programmable Gate-Array (FPGA). The results reveal that our algorithm outperforms naive and custom data layouts for these kernels by an average of 33% and 15% in terms of execution time, while taking into account the available hardware resources.
Keywords
Algorithm design and analysis; Bandwidth; Computer architecture; Field programmable gate arrays; Hardware; Kernel; Parallel processing; Processor scheduling; Random access memory; Signal processing algorithms;
fLanguage
English
Publisher
ieee
Conference_Titel
Field Programmable Logic and Applications, 2006. FPL '06. International Conference on
Conference_Location
Madrid
Print_ISBN
1-4244-0312-X
Type
conf
DOI
10.1109/FPL.2006.311241
Filename
4101003
Link To Document