DocumentCode
3748392
Title
PATHA: Performance Analysis Tool for HPC Applications
Author
Wucherl Yoo;Michelle Koo; Yi Cao;Alex Sim;Peter Nugent; Kesheng Wu
Author_Institution
Lawrence Berkeley National Laboratory, CA, USA
fYear
2015
Firstpage
1
Lastpage
8
Abstract
Large science projects rely on complex workflows to analyze terabytes or petabytes of data. These jobs are often running over thousands of CPU cores and simultaneously performing data accesses, data movements, and computation. It is difficult to identify bottlenecks or to debug the performance issues in these large workflows. To address these challenges, we have developed Performance Analysis Tool for HPC Applications (PATHA) using the state-of-art open source big data processing tools. Our framework can ingest system logs to extract key performance measures, and apply the most sophisticated statistical tools and data mining methods on the performance data. It utilizes an efficient data processing engine to allow users to interactively analyze a large amount of different types of logs and measurements. To illustrate the functionality of PATHA, we conduct a case study on the workflows from an astronomy project known as the Palomar Transient Factory (PTF). Our study processed 1.6 TB of system logs collected on the NERSC supercomputer Edison. Using PATHA, we were able to identify performance bottlenecks, which reside in three tasks of PTF workflow with the dependency on the density of celestial objects.
Keywords
"Supercomputers","Extraterrestrial measurements","Iron"
Publisher
ieee
Conference_Titel
Computing and Communications Conference (IPCCC), 2015 IEEE 34th International Performance
Electronic_ISBN
2374-9628
Type
conf
DOI
10.1109/PCCC.2015.7410313
Filename
7410313
Link To Document