DocumentCode
2197067
Title
Dynamic monitoring of high-performance distributed applications
Author
Gunter, Dan ; Tierney, Brian ; Jackson, Keith ; Lee, Jason ; Stoufer, Martin
Author_Institution
Comput. Sci. Directorate, California Univ., Berkeley, CA, USA
fYear
2002
fDate
2002
Firstpage
163
Lastpage
170
Abstract
Developers and users of high-performance distributed systems often observe performance problems such as unexpectedly low throughput or high latency. Determining the source of the performance problems requires detailed end-to-end instrumentation of all components, including the applications, operating systems, hosts, and networks. However, one must be very careful to design the instrumentation to have extremely low overhead, and not affect the system being monitored. In this paper we present a very light-weight instrumentation system that can be dynamically activated to unobtrusively collect and aggregate detailed end-to-end monitoring information from distributed applications. We also show how emerging "web services" can be used to facilitate remote interaction with this system.
Keywords
Internet; data acquisition; distributed processing; monitoring; GridFTP; NetLogger Toolkit; data acquisition; dynamic monitoring; end-to-end monitoring; high-performance distributed systems; instrumentation system; web services; Computer buffers; Condition monitoring; Distributed computing; Grid computing; High performance computing; Instruments; Laboratories; Libraries; Pipelines; XML;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Distributed Computing, 2002. HPDC-11 2002. Proceedings. 11th IEEE International Symposium on
ISSN
1082-8907
Print_ISBN
0-7695-1686-6
Type
conf
DOI
10.1109/HPDC.2002.1029915
Filename
1029915
Link To Document