• DocumentCode
    169075
  • Title

    Efficient Execution of Microscopy Image Analysis on CPU, GPU, and MIC Equipped Cluster Systems

  • Author

    Andrade, G. ; Ferreira, Ricardo ; Teodoro, George ; Rocha, Leonardo ; Saltz, Joel H. ; Kurc, Tahsin

  • Author_Institution
    Fed. Univ. of Minas Gerais, Belo Horizonte, Brazil
  • fYear
    2014
  • fDate
    22-24 Oct. 2014
  • Firstpage
    89
  • Lastpage
    96
  • Abstract
    High performance computing is experiencing a major paradigm shift with the introduction of accelerators, such as graphics processing units (GPUs) and Intel Xeon Phi (MIC). These processors have made available a tremendous computing power at low cost, and are transforming machines into hybrid systems equipped with CPUs and accelerators. Although these systems can deliver a very high peak performance, making full use of its resources in real-world applications is a complex problem. Most current applications deployed to these machines are still being executed in a single processor, leaving other devices underutilized. In this paper we explore a scenario in which applications are composed of hierarchical dataflow tasks which are allocated to nodes of a distributed memory machine in coarse-grain, but each of them may be composed of several finer-grain tasks which can be allocated to different devices within the node. We propose and implement novel performance aware scheduling techniques that can be used to allocate tasks to devices. We evaluate our techniques using a pathology image analysis application used to investigate brain cancer morphology, and our experimental evaluation shows that the proposed scheduling strategies significantly outperforms other efficient scheduling techniques, such as Heterogeneous Earliest Finish Time - HEFT, in cooperative executions using CPUs, GPUs, and Masc. also experimentally show that our strategies are less sensitive to inaccuracy in the scheduling input data and that the performance gains are maintained as the application scales.
  • Keywords
    brain; cancer; distributed memory systems; graphics processing units; medical image processing; parallel processing; scheduling; CPU equipped cluster systems; GPU equipped cluster systems; Intel Xeon Phi; MIC equipped cluster systems; Masc; accelerators; brain cancer morphology; distributed memory machine; graphics processing units; hierarchical dataflow tasks; high performance computing; microscopy image analysis; pathology image analysis application; performance aware scheduling techniques; task allocation; Central Processing Unit; Graphics processing units; Image analysis; Microwave integrated circuits; Performance evaluation; Processor scheduling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Architecture and High Performance Computing (SBAC-PAD), 2014 IEEE 26th International Symposium on
  • Conference_Location
    Jussieu
  • ISSN
    1550-6533
  • Type

    conf

  • DOI
    10.1109/SBAC-PAD.2014.15
  • Filename
    6970651