DocumentCode :
3104143
Title :
NwsAlarm: a tool for accurately detecting resource performance degradation
Author :
Krintz, Chandra ; Wolski, Rich
Author_Institution :
Dept. of Comput. Sci. & Eng., California Univ., San Diego, La Jolla, CA, USA
fYear :
2001
fDate :
2001
Firstpage :
404
Lastpage :
413
Abstract :
End-users of high-performance computing resources have come to expect that consistent levels of performance be delivered to their applications. The advancement of the computational grid enables the seamless use of a multitude of computing resources by these users. The combination of these developments has generated a need for users to monitor the end-to-end-performance available to an application. In addition, tools are needed to alert users of degradation in expected performance. We present the NwsAlarm, a Java-based utility that enables users to monitor performance levels of any resource being monitored by the Network Weather Service. The NwsAlarm is invoked by a user without special privileges with a simple click on a Web page link. More importantly the NwsAlarm allows any user of the NwsAlarm to register and set expected performance levels. When performance levels fall below these thresholds, the registered administrators are immediately notified via email. The NwsAlarm uses prediction of performance measurements to filter false alarm values. We exemplify the importance of and accuracy achieved by the NwsAlarm with real examples of performance degradation caused by routing table changes and loss of service on the Abilene, Internet-2 research network used for experimentation with evolving Grid software technology. On average, 92% fewer false alarms are raised by the NwsAlarm than if raw measurements are used
Keywords :
Internet; Java; computer network management; electronic mail; performance evaluation; Abilene; Grid software technology; Internet-2 research network; Java-based utility; Network Weather Service; NwsAlarm tool; Web page link; computational grid; email; false alarm; high-performance computing; performance measurements; resource performance degradation detection; routing table changes; Degradation; Filters; Grid computing; High performance computing; Java; Measurement; Monitoring; Performance loss; Routing; Web pages;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cluster Computing and the Grid, 2001. Proceedings. First IEEE/ACM International Symposium on
Conference_Location :
Brisbane, Qld.
Print_ISBN :
0-7695-1010-8
Type :
conf
DOI :
10.1109/CCGRID.2001.923220
Filename :
923220
Link To Document :
بازگشت