Title :
ADMiRe: an algebraic data mining approach to system performance analysis
Author :
Jiang, Ning ; Villafane, Roy ; Hua, Kien A. ; Sawant, Abhijit ; Prabhakara, Kiran
Author_Institution :
Sch. of Comput. Sci., Univ. of Central Florida, Orlando, FL, USA
fDate :
7/1/2005 12:00:00 AM
Abstract :
Performance analysis of computing systems is an increasingly difficult task due to growing system complexity. Traditional tools rely on ad hoc procedures. With these, determining which of the manifold system and workload parameters to examine is often a lengthy and highly speculative process. The analysis is often incomplete and, therefore, prone to revealing faulty conclusions and not uncovering useful tuning knowledge. We address this problem by introducing a data mining approach called ADMiRe (analyzer for data mining results). In this scheme, regression analysis is first applied to performance data to discover correlations between various system and workload parameters. The results of this analysis are summarized in sets of regression rules. The user can then formulate intuitive algebraic expressions to manipulate these sets of rules to capture critical information. To demonstrate this approach, we use ADMiRe to analyze an Oracle database system running the TPC-C (Transaction Processing Performance Council) benchmark. The results generated by ADMiRe were confirmed by Oracle experts. We also show that by applying ADMiRe to Microsoft Internet Information Server performance data, we can improve system performance by 20 percent.
Keywords :
data mining; database management systems; symbol manipulation; ADMiRe algebraic data mining; Microsoft Internet Information Server performance data; Oracle database system; TPC-C benchmark; correlation discovery; intuitive algebraic expression formulation; knowledge management; regression analysis; system performance analysis; Councils; Data analysis; Data mining; Database systems; Hardware; Internet; Performance analysis; Regression analysis; System performance; Web server; Index Terms- Data mining; algorithms for data and knowledge management.; performance of systems;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
DOI :
10.1109/TKDE.2005.103