DocumentCode :
3425845
Title :
Sustainable Adaptive Grid Supercomputing: Multiscale Simulation of Semiconductor Processing across the Pacific
Author :
Takemiya, Hiroshi ; Tanaka, Yoshio ; Sekiguchi, Satoshi ; Ogata, Shuji ; Kalia, Rajiv K. ; Nakano, Aiichiro ; Vashishta, Priya
Author_Institution :
Grid Technol. Res. Center, Nat. Inst. of Adv. Ind. Sci. & Technol.
fYear :
2006
fDate :
Nov. 2006
Firstpage :
23
Lastpage :
23
Abstract :
We propose a reservation-based sustainable adaptive grid supercomputing paradigm to enable tightly coupled computations of considerable scale (involving over 1,000 processors) and duration (over tens of continuous days) on a grid of geographically distributed parallel supercomputers. The paradigm is demonstrated for an adaptive multiscale simulation application, in which accurate but compute-intensive quantum mechanical (QM) simulations are embedded within a classical molecular dynamics (MD) simulation only when and where high fidelity is required. Key technical innovations include: 1) an embedded divide-and-conquer algorithmic framework to maximally expose data and computation localities for enhanced scalability; 2) a buffered-cluster hybridization scheme to adaptively adjust MD/QM boundaries to maintain the model accuracy; and 3) a hybrid grid remote procedure call (GridRPC) + message passing interface (MPI) grid application framework to combine flexibility (adaptive resource allocation and migration), fault tolerance (automated fault recovery), and efficiency (scalable management of large computing resources). We have achieved an automated execution of multiscale MD/QM simulation on a Grid consisting of 6 supercomputer centers in Japan and the US (in total of 150 thousand processor hours) for the dynamic simulation of implanted oxygen atoms in a silicon substrate, in which the number of processors changes dynamically on demand and resources are allocated and migrated dynamically according to both reservations and unexpected faults. The simulation results reveal a strong dependence of the oxygen penetration depth on the incident oxygen-beam position, which is useful information to further advance SIMOX (separation by implanted oxygen) technique to fabricate high speed and low power-consumption semiconductor devices
Keywords :
divide and conquer methods; fault tolerant computing; grid computing; message passing; parallel machines; remote procedure calls; resource allocation; SIMOX; divide-and-conquer algorithm; fault tolerance; message passing interface; molecular dynamics; multiscale simulation; quantum mechanical simulation; remote procedure call; resource allocation; semiconductor processing; sustainable adaptive grid supercomputing; Computational modeling; Computer interfaces; Concurrent computing; Distributed computing; Embedded computing; Grid computing; Quantum computing; Quantum mechanics; Resource management; Supercomputers;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
SC 2006 Conference, Proceedings of the ACM/IEEE
Conference_Location :
Tampa, FL
Print_ISBN :
0-7695-2700-0
Electronic_ISBN :
0-7695-2700-0
Type :
conf
DOI :
10.1109/SC.2006.59
Filename :
4090197
Link To Document :
بازگشت