• DocumentCode
    58828
  • Title

    Automatic Generation of Miniaturized Synthetic Proxies for Target Applications to Efficiently Design Multicore Processors

  • Author

    Ganesan, Kavita ; John, Lizy Kurian

  • Author_Institution
    Oracle Inc., Austin, TX, USA
  • Volume
    63
  • Issue
    4
  • fYear
    2014
  • fDate
    Apr-14
  • Firstpage
    833
  • Lastpage
    846
  • Abstract
    Prohibitive simulation time with pre-silicon design models and unavailability of proprietary target applications make microprocessor design very tedious. The framework proposed in this paper is the first attempt to automatically generate synthetic benchmark proxies for real world multithreaded applications. The framework includes metrics that characterize the behavior of the workloads in the shared caches, coherence logic, out-of-order cores, interconnection network and DRAM. The framework is evaluated by generating proxies for the workloads in the multithreaded PARSEC benchmark suite and validating their fidelity by comparing the microarchitecture dependent and independent metrics to that of the original workloads. The average error in IPC is 4.87 percent and maximum error is 10.8 percent for Raytrace in comparison to the original workloads. The average error in the power-per-cycle metric is 2.73 percent with a maximum of 5.5 percent when compared to original workloads. The representativeness of the proxies to that of the original workloads in terms of their sensitivity to design changes is evaluated by finding the correlation coefficient between the trends followed by the synthetic and the original for design changes in IPC, which is 0.92. A speedup of four to six orders of magnitude is achieved by using the synthetic proxies over the original workloads.
  • Keywords
    DRAM chips; cache storage; microprocessor chips; multi-threading; multiprocessing systems; multiprocessor interconnection networks; DRAM; Raytrace; coherence logic; interconnection network; microprocessor design; miniaturized synthetic proxies; multicore processors; multithreaded applications; out-of-order cores; presilicon design models; prohibitive simulation; shared caches; synthetic benchmark proxies; Benchmark testing; Cloning; Measurement; Memory management; Multicore processing; Parallel processing; Power demand; Computer architecture; multicore systems; synthetic benchmarks; workload cloning and power modeling;
  • fLanguage
    English
  • Journal_Title
    Computers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9340
  • Type

    jour

  • DOI
    10.1109/TC.2013.36
  • Filename
    6463386