Title :
Integration of Heterogeneous and Non-dedicated Environments for R
Author :
Vera, Gonzalo ; Suppi, Remo
Author_Institution :
Gonzalo Vera & Remo Suppi Comput. Archit. & Oper. Syst. Dept. (CAOS), Univ. Autonoma de Barcelona, Barcelona, Spain
Abstract :
Parallel computing is becoming essential for nowadays data analysis in several disciplines. In order to profit from parallel processing of experimental data, specialized skills, software tools and suitable computing resources are required. Desktop grids and volunteer-based systems have proved themselves as powerful options where distributed idle resources from heterogeneous computers are aggregated to build powerful met computers. Software solutions are required to automate and assist the process of transformation and adaptation of current and new applications to run in these environments. Finally, it is desirable, for the same tool, to provide an efficient solution to orchestrate the execution of these programs using a diversity of dynamic environments. In this paper we describe an implementation of an integrated solution for the R language which allows the transformation and execution of parallel loops in heterogeneous and non-dedicated environments. The results obtained allow us to prove the feasibility of our proposal. Furthermore, several issues that tools like this must consider to improve their performance when integrating heterogeneous systems are described.
Keywords :
Bioinformatics; Biology computing; Clouds; Computer architecture; Concurrent computing; Data analysis; Distributed computing; Grid computing; Parallel processing; Proposals; R language; heterogeneous systems; parallel loops; self-scheduling; volunteer computing;
Conference_Titel :
Cluster, Cloud and Grid Computing (CCGrid), 2010 10th IEEE/ACM International Conference on
Conference_Location :
Melbourne, Australia
Print_ISBN :
978-1-4244-6987-1
DOI :
10.1109/CCGRID.2010.102