• DocumentCode
    2962899
  • Title

    Software-managed automatic data sharing for Coarse-Grained Reconfigurable coprocessors

  • Author

    Mai, Toan X. ; Jongeun Lee

  • Author_Institution
    Sch. of ECE, UNIST, Ulsan, South Korea
  • fYear
    2012
  • fDate
    10-12 Dec. 2012
  • Firstpage
    277
  • Lastpage
    284
  • Abstract
    Coarse-Grained Reconfigurable Architecture (CGRA) in a hybrid system can significantly accelerate the execution of compute-intensive kernels of applications. However, the data communication overhead between the main processor (MP) and the CGRA may be huge and can negate the speed-up of the CGRA. In this paper we address the problem of reducing the data communication overhead in a hybrid system by offering a partially automatic data sharing technique using a special shared memory called Configurable Range Memory (CRM). Unlike the previous work the CRM architecture we use here is based on comparators, which gives much higher flexibility in terms of where an array can be placed within a CRM while it makes the runtime software management of a CRM much more challenging. We present an efficient runtime algorithm based on first-fit heuristic. Our experimental results demonstrate that our CRM-based system can reduce the amount of data transfer between a MP and a CGRA up to 89.5% compared to ScratchPad Memory (SPM)-based systems, while the software management overhead is only 1.20~1.34% on average (depending on CRM architecture parameters) of the kernel cycles in the MP-only execution. Overall our CRM-based system can achieve average kernel speedup of 3.47 times over the MP-only execution, which is about 20% improvement over the SPM-based system.
  • Keywords
    comparators (circuits); computational complexity; coprocessors; memory architecture; reconfigurable architectures; shared memory systems; software management; CGRA; CRM architecture parameters; MP-only execution; coarse-grained reconfigurable coprocessor architecture; comparators; compute-intensive application kernels; configurable range memory architecture; data communication overhead reduction; data transfer reduction; first-fit heuristic; hybrid system; kernel cycles; main processor; runtime algorithm; runtime software management; shared memory; software management overhead; software-managed partially-automatic data sharing; Arrays; Coprocessors; Customer relationship management; Data transfer; Kernel;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Field-Programmable Technology (FPT), 2012 International Conference on
  • Conference_Location
    Seoul
  • Print_ISBN
    978-1-4673-2846-3
  • Electronic_ISBN
    978-1-4673-2844-9
  • Type

    conf

  • DOI
    10.1109/FPT.2012.6412148
  • Filename
    6412148