DocumentCode :
20299
Title :
Cost and Accuracy Aware Scientific Workflow Composition for Service-Oriented Environments
Author :
Chiu, Dereck ; Agrawal, Gagan
Author_Institution :
Sch. of Eng. & Comput. Sci., Washington State Univ., Vancouver, WA, USA
Volume :
6
Issue :
4
fYear :
2013
fDate :
Oct.-Dec. 2013
Firstpage :
470
Lastpage :
483
Abstract :
Large-scale scientific data analysis projects have catalyzed service-based workflow management systems. We present an approach for integrating user preferences on completion time and workflow accuracy in a workflow composition system. The relationship between workflow execution time and the accuracy of results is exploited by our workflow system. Specifically, our system is equipped with a way for users to define cost models on service completion time and error propagation (prevalent in many scientific and data analysis applications). Together with these models and an ontology for describing web service and data dependences, our system plans service-based workflows to answer high-level queries. Our system was evaluated under a real service-based environment against user constraints on time, accuracy, and network bandwidth variations. In the worst case in our experiments, we observed an average deviation of 14.3 percent below the desired time constraints, which suggests that our system is time-conservative. Within varying network bandwidth environments, we can also meet time constraints through sampling, and only a 12.4 percent deviation below time expectations are observed on average. We further show that, though negotiating with services´ error models, our system is capable of planning data reduction measures (e.g., sampling) directly within workflow plans to achieve the desired accuracy.
Keywords :
Web services; data analysis; ontologies (artificial intelligence); query processing; service-oriented architecture; workflow management software; Web service; accuracy aware scientific workflow composition system; cost aware scientific workflow composition system; cost models; data dependences; error propagation; high-level query answering; large-scale scientific data analysis projects; network bandwidth variations; ontology; planning data reduction measures; service completion time; service error models; service-based workflow management systems; service-oriented environments; user preferences; workflow execution time; Accuracy; Databases; Mathematical model; Ontologies; Registers; Time factors; Web services; Workflow management; scientific workflows; web service composition;
fLanguage :
English
Journal_Title :
Services Computing, IEEE Transactions on
Publisher :
ieee
ISSN :
1939-1374
Type :
jour
DOI :
10.1109/TSC.2012.19
Filename :
6226350
Link To Document :
بازگشت