DocumentCode :
3612129
Title :
Learning Running-time Prediction Models for Gene-Expression Analysis Workflows
Author :
Monge, David A. ; Holec, Matej ; Zelezny, Filip ; Garcia Garino, Carlos
Author_Institution :
ITIC Res. Inst., Nat. Univ. of Cuyo, Cuyo, Argentina
Volume :
13
Issue :
9
fYear :
2015
Firstpage :
3088
Lastpage :
3095
Abstract :
One of the central issues for the efficient management of Scientific workflow applications is the prediction of tasks performance. This paper proposes a novel approach for constructing performance models for tasks in data-intensive scientific workflows in an autonomous way. Ensemble Machine Learning techniques are used to produce robust combined models with high predictive accuracy. Information derived from workflow systems and the characteristics and provenance of the data are exploited to guarantee the accuracy of the models. A gene-expression analysis workflow application was used as case study over homogeneous and heterogeneous computing environments. Experimental results evidence noticeable improvements while using ensemble models in comparison with single/standalone prediction models. Ensemble learning techniques made it possible to reduce the prediction error with respect to the strategies of a single-model with values ranging from 14.47 percent to 28.36 percent for the homogeneous case, and from 8.34 percent to 17.18 percent for the heterogeneous case.
Keywords :
bioinformatics; genetics; learning (artificial intelligence); bioinformatics; data-intensive scientific workflow; ensemble machine learning technique; gene-expression analysis workflow application; heterogeneous computing environment; running-time prediction model; scientific workflow application; Adaptation models; Analytical models; Benchmark testing; Biological system modeling; Computational modeling; Predictive models; Robustness; Bioinformatics; Distributed Computing; Ensemble Learning; Performance Prediction; Workflows;
fLanguage :
English
Journal_Title :
Latin America Transactions, IEEE (Revista IEEE America Latina)
Publisher :
ieee
ISSN :
1548-0992
Type :
jour
DOI :
10.1109/TLA.2015.7350063
Filename :
7350063
Link To Document :
بازگشت