• DocumentCode
    3601202
  • Title

    Near-Optimal Thermal Monitoring Framework for Many-Core Systems-on-Chip

  • Author

    Ranieri, Juri ; Vincenzi, Alessandro ; Chebira, Amina ; Atienza, David ; Vetterli, Martin

  • Author_Institution
    Sch. of Comput. & Commun. Sci, Ecole Polytech. Fed. de Lausanne, Lausanne, Switzerland
  • Volume
    64
  • Issue
    11
  • fYear
    2015
  • Firstpage
    3197
  • Lastpage
    3209
  • Abstract
    Chip designers place on-chip thermal sensors to measure local temperatures, thus preventing thermal runaway situations in many-core processing architectures. However, the quality of the thermal reconstruction is directly dependent on the number of placed sensors, which should be minimized, while guaranteeing full detection of all the worst case temperature gradient. In this paper, we present an entire framework for the thermal management of complex many-core architectures, such that we can precisely recover the thermal distribution from a minimal number of sensors. The proposed sensor placement algorithm is guaranteed to reduce the impact of noisy measurements on the reconstructed thermal distribution. We achieve significant improvements compared to the state of the art, in terms of both computational complexity and reconstruction precision. For example, if we consider a 64 cores systems-on-chip with 64 noisy sensors (σ2 = 4), we achieve an average reconstruction error of 1:5°C, that is less than half of what previous state-of-the-art methods achieve. We also study the practical limits of the proposed method and show that we do not need realistic workloads to learn the model and efficiently place the sensors. In fact, we show that the reconstruction error is not significantly increased if we randomly generate the power-traces of the components or if we have just a part of the correct workload.
  • Keywords
    computational complexity; multiprocessing systems; optimisation; sensor placement; system-on-chip; temperature sensors; SoC; computational complexity; many-core systems-on-chip; near-optimal thermal monitoring framework; on-chip thermal sensor; reconstruction precision; sensor placement algorithm; temperature gradient; thermal management; Computational modeling; Principal component analysis; System-on-chip; Temperature measurement; Temperature sensors; Sensor placement; thermal management; thermal monitoring;
  • fLanguage
    English
  • Journal_Title
    Computers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9340
  • Type

    jour

  • DOI
    10.1109/TC.2015.2395423
  • Filename
    7018011