• DocumentCode
    1986047
  • Title

    Performance and area modeling of complete FPGA designs in the presence of loop transformations

  • Author

    Shayee, K. R Shesha ; Park, Joonseok ; Diniz, Pedro C.

  • Author_Institution
    Dept. of Electr. Eng., Univ. of Southern California, Los Angeles, CA, USA
  • fYear
    2003
  • fDate
    9-11 April 2003
  • Firstpage
    296
  • Abstract
    Digital image processing algorithms are a good match for direct implementation on FPGAs as current FPGA architectures can naturally match the fine grain parallelism in these applications. Typically, these algorithms are structured as a sequence of operations, expressed in high-level programming languages as tight loop nests. The loops usually define a shifting-window region over which the algorithm applies a simple localized operator (e.g., a differential gradient, or a min/max). In this research we focus on the development of fast, yet accurate performance and area modeling of complete FPGA designs that combine analytical, empirical and behavioral estimation techniques. We model the application of a set of important program transformations for image processing algorithms, namely loop unrolling, tiling, loop interchanging, loop fission and array privatization, and explore pipelined and non-pipelined execution modes. We take into consideration the impact of various transformations, in the presence of limited I/O resources like address generators and external memory data channels, on the performance of a complete design implemented in a FPGA based architecture.
  • Keywords
    field programmable gate arrays; gradient methods; image processing; input-output programs; logic design; performance evaluation; program control structures; reconfigurable architectures; FPGA architecture; FPGA design; I/O resource; address generator; analytical estimation; area modeling; array privatization; behavioral estimation; differential gradient; digital image processing algorithm; empirical estimation; external memory data channel; fine grain parallelism; high-level programming language; localized operator; loop transformation; operation sequence; performance modeling; program transformation; sequence structured; shifting-window region; tight loop nest; Bandwidth; Computer languages; Delay; Field programmable gate arrays; Hardware; Image processing; Information analysis; Parallel processing; Performance analysis; Read-write memory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Field-Programmable Custom Computing Machines, 2003. FCCM 2003. 11th Annual IEEE Symposium on
  • Print_ISBN
    0-7695-1979-2
  • Type

    conf

  • DOI
    10.1109/FPGA.2003.1227278
  • Filename
    1227278