Title :
Frequent-pattern based iterative projected clustering
Author :
Yiu, Man Lung ; Mamoulis, Nikos
Author_Institution :
Dept. of Comput. Sci. & Inf. Syst., Hong Kong Univ., China
Abstract :
Irrelevant attributes add noise to high dimensional clusters and make traditional clustering techniques inappropriate. Projected clustering algorithms have been proposed to find the clusters in hidden subspaces. We realize the analogy between mining frequent itemsets and discovering the relevant subspace for a given cluster. We propose a methodology for finding projected clusters by mining frequent itemsets and present heuristics that improve its quality. Our techniques are evaluated with synthetic and real data; they are scalable and discover projected clusters accurately.
Keywords :
data mining; pattern clustering; statistical analysis; frequent itemset mining; hidden subspace; projected cluster discovery; projected clustering algorithm; real data; synthetic data; Character generation; Clustering algorithms; Computer science; Data mining; Databases; Information systems; Itemsets; Iterative algorithms; Lungs; Partitioning algorithms;
Conference_Titel :
Data Mining, 2003. ICDM 2003. Third IEEE International Conference on
Print_ISBN :
0-7695-1978-4
DOI :
10.1109/ICDM.2003.1251009