Title :
AmpNet - a highly available cluster interconnection network
Author :
Apon, Amy ; Wilbur, Larry
Author_Institution :
Arkansas Univ., AR, USA
Abstract :
One of the most important challenges facing computing clusters in the foreseeable future is providing fault tolerant, high availability cluster hardware for non-stop applications. This capability is in addition to high throughput and low latency. This paper presents the advanced multiprocessor network (AmpNet), a gigabit speed cluster interconnect that was designed with these issues in mind. The AmpNet network interface card (NIC) uses network-shared memory as network cache to provide a fault-tolerant, self-healing network with no data loss. Higher-level network centric services use network-shared memory to ensure high availability and continuity of service in applications. In addition, the programmable NIC, with low-latency messaging protocols and field upgradeable soft logic, provides a foundation for researchers who would like to develop additional cluster services and protocols for network centric computing. This paper describes the fault tolerant design and implementation of the AmpNet hardware architecture.
Keywords :
fault tolerant computing; protocols; workstation clusters; AmpNet; AmpNet hardware architecture; AmpNet network interface card; advanced multiprocessor network; cluster hardware; computing clusters; highly available cluster interconnection network; messaging protocols; network cache; network-shared memory; protocols; real-time distributed systems; self-healing network; Availability; Computer networks; Delay; Fault tolerance; Hardware; Logic; Multiprocessor interconnection networks; Network interfaces; Protocols; Throughput;
Conference_Titel :
Parallel and Distributed Processing Symposium, 2003. Proceedings. International
Print_ISBN :
0-7695-1926-1
DOI :
10.1109/IPDPS.2003.1213370