• DocumentCode
    507386
  • Title

    Automatic memory partitioning and scheduling for throughput and power optimization

  • Author

    Cong, Jason ; Jiang, Wei ; Liu, Bin ; Zou, Yi

  • Author_Institution
    Comput. Sci. Dept., Univ. of California, Los Angeles, CA, USA
  • fYear
    2009
  • fDate
    2-5 Nov. 2009
  • Firstpage
    697
  • Lastpage
    704
  • Abstract
    Hardware acceleration is crucial in modern embedded system design to meet the explosive demands on performance and cost. Selected computation kernels for acceleration are usually captured by nest loops, which are optimized by state-of-the-art techniques like loop tiling and loop pipelining. However, memory bandwidth bottlenecks prevent designs to reach optimal throughput with respect to available parallelism. In this paper we present an automatic memory partitioning technique which can efficiently improve throughput and reduce energy consumption of pipelined loop kernels for given throughput constraints and platform requirement. Our partition scheme consists of two steps, the first step considers cycle accurate scheduling information to meet the hard constraints on memory bandwidth requirements specifically for synchronized hardware designs. Experimental results show an average 6X throughput improvement on a set of real world designs with moderate area increase (about 45% on average), given that less resource sharing opportunities exist with higher throughput in optimized designs. The second step further partitions the memory banks for reducing the dynamic power consumption of the final design. In contrast with previous approaches, our technique can statically compute memory access frequencies in polynomial time with little to none profiling. Experimental results show about 30% power reduction on the same set of benchmarks.
  • Keywords
    embedded systems; optimisation; pipeline processing; power aware computing; power consumption; program control structures; automatic memory partitioning; automatic memory scheduling; computation kernels; cycle accurate scheduling information; dynamic power consumption; embedded system design; hardware acceleration; loop pipelining; loop tiling; memory access frequencies; memory bandwidth bottlenecks; nest loops; polynomial time; power optimization; resource sharing opportunities; state-of the art techniques; synchronized hardware designs; throughput optimization; Acceleration; Bandwidth; Costs; Embedded system; Energy consumption; Explosives; Hardware; Kernel; Processor scheduling; Throughput; Behavioral Synthesis; Memory Partition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer-Aided Design - Digest of Technical Papers, 2009. ICCAD 2009. IEEE/ACM International Conference on
  • Conference_Location
    San Jose, CA
  • ISSN
    1092-3152
  • Print_ISBN
    978-1-60558-800-1
  • Electronic_ISBN
    1092-3152
  • Type

    conf

  • Filename
    5361219