• DocumentCode
    2802278
  • Title

    Green Multicore-SoC Software-Execution Framework with Timely-Power-Gating Scheme

  • Author

    Onouchi, M. ; Toyama, K. ; Nojiri, T. ; Sato, Mitsuhisa ; Mase, Michela ; Shirako, Jun ; Sato, Mitsuhisa ; Takada, Masumi ; Ito, Minora ; Mizuno, Hidenori ; Namiki, Mitaro ; Kimura, K. ; Kasahara, Hironori

  • Author_Institution
    Hitachi Ltd., Kokubunji, Japan
  • fYear
    2009
  • fDate
    22-25 Sept. 2009
  • Firstpage
    510
  • Lastpage
    517
  • Abstract
    We are developing a software-execution framework based on an octo-core chip multiprocessor named RP2 and an automatic multigrain-parallelizing compiler named OSCAR. The main purpose of this framework is to maintain good speed scalability and power efficiency over the number of processor cores under severe hardware restrictions for embedded use. Key to the speed scalability is reduction of a communication overhead with parallelized tasks. A data-categorization scheme enables small-overhead cache-coherency maintenance by using directives and instructions from the compiler. In this scheme, the number of cache-flushing time is minimized and parallelized tasks are quickly synchronized by using flags in local memory. As regards power efficiency, to reduce power consumption, power supply to processor cores waiting for other cores is timely and frequently cut off, even in the middle of an application, by using a timely-power- gating scheme. In this scheme, to achieve quick mode transition between "NORMAL" mode and "RESUME POWEROFF" mode, register values of the processor core are stored in core-local memory, which is active even in "RESUME POWEROFF" mode and can be accessed in one or two clock cycles. Measured speed and power of an application show good speed scalability in execution time and high power efficiency, simultaneously. In the case of a secure AAC-LC encoding program, execution speed when eight processor cores are used can be increased by 4.85 times compared to that of sequential execution. Moreover, power consumption under the same condition can be reduced by 51.0% by parallelizing and timely-power gating. The time for mode transition is less than 20 micro seconds, which is only 2.5% of the "RESUME POWER-OFF" period.
  • Keywords
    cache storage; multiprocessing programs; program compilers; security of data; software maintenance; system-on-chip; OSCAR; automatic multigrain-parallelizing compiler; cache-coherency maintenance; data-categorization scheme; green multicore-SoC software-execution; octo-core chip multiprocessor; secure AAC-LC encoding program; timely-power-gating scheme; Clocks; Energy consumption; Hardware; Multicore processing; Power measurement; Power supplies; Registers; Resumes; Scalability; Synchronization; low-power; parallelization; power-gating;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel Processing, 2009. ICPP '09. International Conference on
  • Conference_Location
    Vienna
  • ISSN
    0190-3918
  • Print_ISBN
    978-1-4244-4961-3
  • Electronic_ISBN
    0190-3918
  • Type

    conf

  • DOI
    10.1109/ICPP.2009.6
  • Filename
    5362472