• DocumentCode
    3664146
  • Title

    Bridging the Gap between Performance and Bounds of Cholesky Factorization on Heterogeneous Platforms

  • Author

    Emmanuel Agullo;Olivier Beaumont;Lionel Eyraud-Dubois;Julien Herrmann;Suraj Kumar;Loris Marchal;Samuel Thibault

  • Author_Institution
    INRIA Bordeaux - Sud-Quest, Talence, France
  • fYear
    2015
  • fDate
    5/1/2015 12:00:00 AM
  • Firstpage
    34
  • Lastpage
    45
  • Abstract
    We consider the problem of allocating and scheduling dense linear application on fully heterogeneous platforms made of CPUs and GPUs. More specifically, we focus on the Cholesky factorization since it exhibits the main features of such problems. Indeed, the relative performance of CPU and GPU highly depends on the sub-routine: GPUs are for instance much more efficient to process regular kernels such as matrix-matrix multiplications rather than more irregular kernels such as matrix factorization. In this context, one solution consists in relying on dynamic scheduling and resource allocation mechanisms such as the ones provided by PaRSEC or StarPU. In this paper we analyze the performance of dynamic schedulers based on both actual executions and simulations, and we investigate how adding static rules based on an offline analysis of the problem to their decision process can indeed improve their performance, up to reaching some improved theoretical performance bounds which we introduce.
  • Keywords
    "Dynamic scheduling","Kernel","Processor scheduling","Runtime","Schedules","Linear algebra","Electronic mail"
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium Workshop (IPDPSW), 2015 IEEE International
  • Type

    conf

  • DOI
    10.1109/IPDPSW.2015.35
  • Filename
    7284288