DocumentCode
1986047
Title
Performance and area modeling of complete FPGA designs in the presence of loop transformations
Author
Shayee, K. R Shesha ; Park, Joonseok ; Diniz, Pedro C.
Author_Institution
Dept. of Electr. Eng., Univ. of Southern California, Los Angeles, CA, USA
fYear
2003
fDate
9-11 April 2003
Firstpage
296
Abstract
Digital image processing algorithms are a good match for direct implementation on FPGAs as current FPGA architectures can naturally match the fine grain parallelism in these applications. Typically, these algorithms are structured as a sequence of operations, expressed in high-level programming languages as tight loop nests. The loops usually define a shifting-window region over which the algorithm applies a simple localized operator (e.g., a differential gradient, or a min/max). In this research we focus on the development of fast, yet accurate performance and area modeling of complete FPGA designs that combine analytical, empirical and behavioral estimation techniques. We model the application of a set of important program transformations for image processing algorithms, namely loop unrolling, tiling, loop interchanging, loop fission and array privatization, and explore pipelined and non-pipelined execution modes. We take into consideration the impact of various transformations, in the presence of limited I/O resources like address generators and external memory data channels, on the performance of a complete design implemented in a FPGA based architecture.
Keywords
field programmable gate arrays; gradient methods; image processing; input-output programs; logic design; performance evaluation; program control structures; reconfigurable architectures; FPGA architecture; FPGA design; I/O resource; address generator; analytical estimation; area modeling; array privatization; behavioral estimation; differential gradient; digital image processing algorithm; empirical estimation; external memory data channel; fine grain parallelism; high-level programming language; localized operator; loop transformation; operation sequence; performance modeling; program transformation; sequence structured; shifting-window region; tight loop nest; Bandwidth; Computer languages; Delay; Field programmable gate arrays; Hardware; Image processing; Information analysis; Parallel processing; Performance analysis; Read-write memory;
fLanguage
English
Publisher
ieee
Conference_Titel
Field-Programmable Custom Computing Machines, 2003. FCCM 2003. 11th Annual IEEE Symposium on
Print_ISBN
0-7695-1979-2
Type
conf
DOI
10.1109/FPGA.2003.1227278
Filename
1227278
Link To Document