Title :
Federated clusters using the transparent remote Execution (TREx) environment
Author :
Wang, Richert ; Cauich, Enrique ; Scherson, Isaac D.
Author_Institution :
Univ. of California, Irvine, Irvine, CA
Abstract :
Due to the increasing complexity of scientific models, large-scale simulation tools often require a critical amount of computational power to produce results in a reasonable amount of time. For example, multi-system wireless network simulations involve complex algorithms of traffic balancing and communication control on large geographical areas. Moreover many of these intensive applications are designed for single sequential machines and large sums of money are spent on purchasing powerful servers that can give results in a satisfactory amount of time. The aim of this paper is to introduce a general-purpose tool, dubbed transparent remote execution (TREx), which avoids resorting to expensive servers by providing a cost effective, high performance, distributed solution. TREx is a daemon that dynamically exploits idle operational in-use workstations. Based on elaborate rules of computational resource management, this daemon permits a master to scan workstations within a predefined subnetwork and share the workload among the least occupied processing elements. It also provides a clear framework for parallelization that applications can exploit. By providing a simple way of federating computational resources, such a framework could drastically reduce hardware investments.
Keywords :
IP networks; resource allocation; sequential machines; telecommunication traffic; workstation clusters; communication control; computational resource management; federated clusters; large-scale simulation tools; multi-system wireless network simulations; purchasing powerful servers; sequential machines; traffic balancing; transparent remote execution environment; Clustering algorithms; Communication system traffic control; Computational modeling; Costs; Large-scale systems; Network servers; Resource management; Traffic control; Wireless networks; Workstations;
Conference_Titel :
Parallel and Distributed Systems, 2007 International Conference on
Conference_Location :
Hsinchu
Print_ISBN :
978-1-4244-1889-3
Electronic_ISBN :
1521-9097
DOI :
10.1109/ICPADS.2007.4447831