DocumentCode :
2338661
Title :
A design approach for numerical libraries in large scale distributed systems
Author :
Emad, Nahid ; Delannoy, Olivier ; Dandouna, Makarem
Author_Institution :
PRiSM Lab., Univ. of Versailles, Versailles, France
fYear :
2010
fDate :
16-19 May 2010
Firstpage :
1
Lastpage :
9
Abstract :
Nowadays, large scale distributed systems gather thousands of nodes with hierarchical memory models. They are heterogeneous, volatile and geographically distributed. The efficient exploitation of such systems requires the conception and adaptation of appropriate numerical methods, the definition of new programming paradigms, new metrics for performance prediction, etc. The modern hybrid numerical methods are well adapted to this kind of systems. This is particularly because of their multi-level parallelism and fault tolerance property. However the programming of these methods for these architectures requires concurrent reuse of sequential and parallel code. But the currently existing numerical libraries aren´t able to exploit the multi-level parallelism offered by theses methods. A few linear algebra numerical libraries make use of object oriented approach allowing modularity and extensibility. Nevertheless, those which offer modularity,sequential and parallel code reuse are almost non-existent. In this paper, we analyze the lacks in existing libraries and propose a design based on a component approach and the strict separation between computation operations, data management and communication control of an application. We present then an application of this design using YML scientific workflow environment (http://yml.prism.uvsq.fr/) jointly with the object oriented LAKe (Linear Algebra Kernel) library. Some numerical experiments on GRID5000 platform validate our approach and show its efficiency.
Keywords :
fault tolerance; grid computing; large-scale systems; linear algebra; object-oriented programming; sequential codes; software libraries; software reusability; GRID5000 platform; data management; fault tolerance; hierarchical memory models; hybrid numerical methods; large scale distributed system; linear algebra kernel; numerical libraries; object oriented programming; parallel code; sequential code; Iterative methods; Lakes; Libraries; Linear algebra; Numerical models; Object oriented modeling; Parallel processing; code reusability; design model; hybrid methods; large scale distributed systems; numerical library; scientific workflow environments;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Systems and Applications (AICCSA), 2010 IEEE/ACS International Conference on
Conference_Location :
Hammamet
Print_ISBN :
978-1-4244-7716-6
Type :
conf
DOI :
10.1109/AICCSA.2010.5586951
Filename :
5586951
Link To Document :
بازگشت