• DocumentCode
    2155278
  • Title

    An energy-efficient coarse-grained dynamically reconfigurable fabric for multiple-standard video decoding applications

  • Author

    Leibo Liu ; Chenchen Deng ; Dong Wang ; Min Zhu ; Shouyi Yin ; Peng Cao ; Shaojun Wei

  • Author_Institution
    Inst. of Microelectron., Tsinghua Univ., Beijing, China
  • fYear
    2013
  • fDate
    22-25 Sept. 2013
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    In this paper, we introduce a coarse-grained dynamically reconfigurable fabric, named Reconfigurable Processing Unit (RPU), which is implemented on a 5.4×3.1 mm2 silicon with TSMC 65 nm LP1P8M technology. This fabric consists of 16×16 multi-functional Processing Elements (PEs) interconnected by an area-efficient Line-Switched Mesh Connect (LSMC) routing. A Hierarchical Configuration Context (HCC) organization scheme is proposed to reduce the scale of the context memory and enhance configuration efficiency. Two reconfigurable processors are then designed and fabricated to verify the proposed techniques. One processor (called REMUS_HPP) integrates two RPUs, targeting the high performance applications. REMUS_HPP could decode 1920×1080@30fps H.264 streams with 280mW under 200MHz, achieving a performance gain of 1.81x and a 14.3x energy efficiency improvement over XPP-III. The other processor (called REMUS_LPP) integrates only one RPU, targeting the low power applications. REMUS_LPP could decode 720×480@35fps H.264 streams with 24.81mW under 75MHz, achieving a 76% power reduction and a 3.96x energy efficiency improvement compared with ADRES. More importantly, RPU is not only limited to video decoding applications. It can also be used to process some other computation-intensive applications and the corresponding analysis is given in this paper as well.
  • Keywords
    data compression; decoding; video codecs; video coding; HCC organization; Hierarchical Configuration Context; LSMC routing; RPU; area efficient line switched mesh connect routing; computation intensive applications; context memory; energy efficiency improvement; energy efficient coarse grained dynamically reconfigurable fabric; enhance configuration efficiency; frequency 200 MHz; frequency 75 MHz; multifunctional processing elements; multiple standard video decoding applications; power 24.81 mW; power 280 mW; reconfigurable processing unit; reconfigurable processors; size 65 nm; Algorithm design and analysis; Computer architecture; Context; Decoding; Energy efficiency; Fabrics; Program processors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Custom Integrated Circuits Conference (CICC), 2013 IEEE
  • Conference_Location
    San Jose, CA
  • Type

    conf

  • DOI
    10.1109/CICC.2013.6658434
  • Filename
    6658434