DocumentCode :
3307729
Title :
Mining high-quality clusters in pattern-based clustering
Author :
Qian Ma ; Jingfeng Guo
Author_Institution :
Modern Educ. Technol. Manage. Center, HengShui Coll., Hengshui, China
Volume :
2
fYear :
2011
fDate :
26-28 July 2011
Firstpage :
1152
Lastpage :
1156
Abstract :
Pattern-based clustering, which capture the similarity of the patterns exhibited by objects in a subset of dimensions, has broad applications in DNA microarray data analysis, customer segmentation, e-business data analysis, etc. However, pattern-based clustering often returns a large number of highly-overlapping clusters, which makes it hard for users to identify interesting patterns from the huge mining results. Moreover, there lacks a general measurement to evaluate the quality of Clusters which pattern-based clustering obtained. In this paper, we discuss factors which cause highly-overlapping, make error analysis and pattern weighting, and propose qScore as a key evaluation parameters on quality of Clusters. A algorithm which based on qScore is presented to solve the problem of high-overlapping and get better quality clustering results.
Keywords :
data mining; pattern clustering; error analysis; high-quality cluster mining; highly-overlapping clusters; pattern similarity; pattern weighting; pattern-based clustering; qScore; Algorithm design and analysis; Analytical models; Approximation algorithms; Clustering algorithms; Data analysis; Data mining; Error analysis; Pattern similarity; Pattern-based Clustering; error analysis; highly-Overlapping clusters; qScore;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2011 Eighth International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-61284-180-9
Type :
conf
DOI :
10.1109/FSKD.2011.6019708
Filename :
6019708
Link To Document :
بازگشت