Title :
Facilitating intermediate node discovery for decentralized offloading in High Performance Computing centers
Author :
Schmidt, Benjamin A. ; Butt, Ali R.
Author_Institution :
Dept. of Comput. Sci., Virginia Tech, Blacksburg, VA, USA
Abstract :
Modern high-performance computing applications use large scale simulations to facilitate scientific discovery, such as studying the impact of sub-atomic interactions or searching for a cure for diseases. These applications increasingly use data that is growing exponentially in size. Thus, management of data at high performance computing (HPC) centers is a critical problem, and addressing data-related issues is considered a major step towards realization of efficient resource usage. Result-data offloading is a promising technique that can improve efficiency of HPC centers by moving the application result data quickly to user-specified remote locations. This also increases the overall center serviceability. However, identifying suitable remote locations for use in such decentralized offloading remains an open problem. In this paper, we explore several methods for locating intermediate nodes using peer-to-peer techniques. We facilitate node discovery at each level of the offload, and structure the discovered nodes to support efficient data transfer that can satisfy the Service Level Agreements between the HPC center and the job submission site. Our evaluation, using realistic simulations and actual measurements on the PlanetLab distributed test-bed, shows that, compared to a naive random discovery, controlled routing-table-based advertisements offer an efficient and effective method for discovering appropriate resources: it discovers 211% more nodes in total, and achieves quick discovery by finding 184% more nodes in less than 27% of the time compared to a random-broadcast based approach. Thus, this work provides promising node discovering mechanisms that can facilitate the HPC data offloading process.
Keywords :
electronic data interchange; peer-to-peer computing; random processes; user centred design; PlanetLab distributed test-bed; center serviceability; controlled routing-table-based advertisements; data transfer; data-related issues; decentralized offloading; high performance computing centers; intermediate node discovery; large scale simulations; naive random discovery; peer-to-peer techniques; result-data offloading; scientific discovery; service level agreements; user-specified remote locations; Computational modeling; Computer applications; Diseases; Extraterrestrial measurements; High performance computing; Large-scale systems; Peer to peer computing; Resource management; Testing; Time measurement;
Conference_Titel :
Southeastcon, 2009. SOUTHEASTCON '09. IEEE
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4244-3976-8
Electronic_ISBN :
978-1-4244-3978-2
DOI :
10.1109/SECON.2009.5174092