• DocumentCode
    2931679
  • Title

    A Run-time System for Efficient Execution of Scientific Workflows on Distributed Environments

  • Author

    Teodoro, G. ; Tavares, T. ; Ferreira, R. ; Kurc, T. ; Meira, W. ; Guedes, D. ; Pan, T. ; Saltz, J.

  • Author_Institution
    Dept. of Comput. Sci., Univ. Fed. de Minas Gerais
  • fYear
    2006
  • fDate
    17-20 Oct. 2006
  • Firstpage
    81
  • Lastpage
    90
  • Abstract
    Scientific workflow systems have been introduced in response to the demand of researchers from several domains of science who need to process and analyze increasingly larger datasets. The design of these systems is largely based on the observation that data analysis applications can be composed as pipelines or networks of computations on data. In this paper we present a run-time support system that is designed to facilitate this type of computation in distributed computing environments. Our system is optimized for data-intensive workflows, in which efficient management and retrieval of data, coordination of data processing and data movement, and check-pointing of intermediate results are critical and challenging issues. Experimental evaluation of our system shows that linear speedups can be achieved for sophisticated applications, which are implemented as a network of multiple data processing components
  • Keywords
    checkpointing; distributed processing; natural sciences computing; checkpointing; data management; data retrieval; data-intensive workflows; distributed computing; run-time system; scientific workflow execution; Cache storage; Computer networks; Data analysis; Data processing; Distributed computing; Filters; Image edge detection; Information retrieval; Pipelines; Runtime environment;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Architecture and High Performance Computing, 2006. SBAC-PAD '06. 18TH International Symposium on
  • Conference_Location
    Ouro Preto
  • ISSN
    1550-6533
  • Print_ISBN
    0-7695-2704-3
  • Type

    conf

  • DOI
    10.1109/SBAC-PAD.2006.6
  • Filename
    4032419