• DocumentCode
    2206261
  • Title

    Design and Implementation of GXP Make -- A Workflow System Based on Make

  • Author

    Taura, Kenjiro ; Matsuzaki, Takuya ; Miwa, Makoto ; Kamoshida, Yoshikazu ; Yokoyama, Daisaku ; Dun, Nan ; Shibata, Takeshi ; Jun, Choi Sung ; Tsujii, Jun Ichi

  • Author_Institution
    Univ. of Tokyo, Tokyo, Japan
  • fYear
    2010
  • fDate
    7-10 Dec. 2010
  • Firstpage
    214
  • Lastpage
    221
  • Abstract
    This paper describes the rational behind designing workflow systems based on the Unix make by showing a number of idioms useful for workflows comprising many tasks. It also demonstrates a specific design and implementation of such a workflow system called GXP make. GXP make supports all the features of GNU make and extends its platforms from single node systems to clusters, clouds, supercomputers, and distributed systems. Interestingly, it is achieved by a very small code base that does not modify GNU make implementation at all. While being not ideal for performance, it achieved a useful performance and scalability of dispatching one million tasks in approximately 16,000 seconds (60 tasks per second, including dependence analysis) on an 8 core Intel Nehalem node. For real applications, recognition and classification of protein-protein interactions from biomedical texts on a supercomputer with more than 8,000 cores are described.
  • Keywords
    Unix; grid computing; multiprocessing systems; scientific information systems; user interfaces; GNU make; GXP make; Unix; multicore system; scientific workflow system; Dispatching; Graphical user interfaces; Java; Libraries; Supercomputers; Syntactics; Web services;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    e-Science (e-Science), 2010 IEEE Sixth International Conference on
  • Conference_Location
    Brisbane, QLD
  • Print_ISBN
    978-1-4244-8957-2
  • Electronic_ISBN
    978-0-7695-4290-4
  • Type

    conf

  • DOI
    10.1109/eScience.2010.43
  • Filename
    5693920