• DocumentCode
    3507712
  • Title

    Memory Parallelism Using Custom Array Mapping to Heterogeneous Storage Structures

  • Author

    Baradaran, Nastaran ; Diniz, Pedro C.

  • Author_Institution
    University of Southern California / Information Sciences Institute, Marina del Rey, California, USA, e-mail: nastaran@isi.edu
  • fYear
    2006
  • fDate
    Aug. 2006
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Configurable architectures offer the unique opportunity of customizing the storage allocation to meet specific applications¿ needs. In this paper we describe a compiler approach to map the arrays of a loop-based computation to internal memories of a configurable architecture with the objective of minimizing the overall execution time. We present an algorithm that considers the data access patterns of the arrays along the critical path of the computation as well as the available storage and memory bandwidth. We demonstrate experimental results of the application of this approach for a set of kernel codes when targeting a Field-Programmable Gate-Array (FPGA). The results reveal that our algorithm outperforms naive and custom data layouts for these kernels by an average of 33% and 15% in terms of execution time, while taking into account the available hardware resources.
  • Keywords
    Algorithm design and analysis; Bandwidth; Computer architecture; Field programmable gate arrays; Hardware; Kernel; Parallel processing; Processor scheduling; Random access memory; Signal processing algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Field Programmable Logic and Applications, 2006. FPL '06. International Conference on
  • Conference_Location
    Madrid
  • Print_ISBN
    1-4244-0312-X
  • Type

    conf

  • DOI
    10.1109/FPL.2006.311241
  • Filename
    4101003