Title :
Interesting Subset Discovery and Its Application on Service Processes
Author :
Natu, Maitreya ; Palshikar, Girish Keshav
Author_Institution :
Tata Res. Dev. & Design Centre, Tata Consultancy Services Ltd., Pune, India
Abstract :
Various real-life datasets can be viewed as a set of records consisting of attributes explaining the records and set of measures evaluating the records. In this paper, we address the problem of automatically discovering interesting subsets from such a dataset, such that the discovered interesting subsets have significantly different characteristics of performance than the rest of the dataset. We present an algorithm to discover such interesting subsets. The proposed algorithm uses a generic domain-independent definition of interestingness and uses various heuristics to intelligently prune the search space in order to build a solution scalable to large size datasets. This paper presents application of the interesting subset discovery algorithm on four real-world case-studies and demonstrates the effectiveness of the interesting subset discovery algorithm in extracting insights in order to identify problem areas and provide improvement recommendations to wide variety of systems.
Keywords :
data mining; attributes; interesting subset discovery algorithm; search space; service processes; Data mining for service processes; Impact analysis; Interesting subset discovery; Subgroup discovery;
Conference_Titel :
Data Mining Workshops (ICDMW), 2010 IEEE International Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
978-1-4244-9244-2
Electronic_ISBN :
978-0-7695-4257-7
DOI :
10.1109/ICDMW.2010.98