• DocumentCode
    3202956
  • Title

    On Performance Modeling and Prediction in Support of Scientific Workflow Optimization

  • Author

    Wu, Qishi ; Datla, Vivek V.

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Memphis, Memphis, TN, USA
  • fYear
    2011
  • fDate
    4-9 July 2011
  • Firstpage
    161
  • Lastpage
    168
  • Abstract
    The computing modules in distributed scientific workflows must be mapped to computer nodes in shared network environments for optimal workflow performance. Finding a good workflow mapping scheme critically depends on an accurate prediction of the execution time of each individual computational module in the workflow. The time prediction of a scientific computation does not have a silver bullet as it is determined collectively by several dynamic system factors including concurrent loads, memory size, CPU speed, and also by the complexity of the computational program itself. This paper investigates the problem of modeling scientific computations and predicting their execution time based on a combination of both hardware and software properties. We employ statistical learning techniques to estimate the effective computational power of a given computer node at any point of time and estimate the total number of CPU cycles needed for executing a given computational program on any input data size. We analytically derive an upper bound of the estimation error for execution time prediction given the hardware and software properties. The proposed statistical analysis-based solution to performance modeling and prediction is validated and justified by experimental results measured on the computing nodes that vary significantly in terms of the hardware specifications.
  • Keywords
    computational complexity; learning (artificial intelligence); multiprocessing systems; optimisation; scientific information systems; statistical analysis; workflow management software; CPU speed; computational program complexity; computer node; memory size; performance modeling; scientific workflow optimization; statistical learning technique; workflow mapping scheme; Benchmark testing; Complexity theory; Computational modeling; Estimation; Hardware; Predictive models; Software; Performance modeling; regression techniques; scientific computation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Services (SERVICES), 2011 IEEE World Congress on
  • Conference_Location
    Washington, DC
  • Print_ISBN
    978-1-4577-0879-4
  • Electronic_ISBN
    978-0-7695-4461-8
  • Type

    conf

  • DOI
    10.1109/SERVICES.2011.37
  • Filename
    6012708