DocumentCode
1202695
Title
Iterative projected clustering by subspace mining
Author
Yiu, Man Lung ; Mamoulis, Nikos
Author_Institution
Dept. of Comput. Sci. & Inf. Syst., Hong Kong Univ., China
Volume
17
Issue
2
fYear
2005
Firstpage
176
Lastpage
189
Abstract
Irrelevant attributes add noise to high-dimensional clusters and render traditional clustering techniques inappropriate. Recently, several algorithms that discover projected clusters and their associated subspaces have been proposed. We realize the analogy between mining frequent itemsets and discovering dense projected clusters around random points. Based on this, we propose a technique that improves the efficiency of a projected clustering algorithm (DOC). Our method is an optimized adaptation of the frequent pattern tree growth method used for mining frequent itemsets. We propose several techniques that employ the branch and bound paradigm to efficiently discover the projected clusters. An experimental study with synthetic and real data demonstrates that our technique significantly improves on the accuracy and speed of previous techniques.
Keywords
Monte Carlo methods; data mining; database management systems; pattern classification; pattern clustering; tree searching; association rule; branch and bound paradigm; database management; frequent itemset mining; iterative projected clustering; pattern classification; projected clustering algorithm; subspace mining; tree growth method; Acoustic noise; Association rules; Clustering algorithms; Clustering methods; Data mining; Image databases; Itemsets; Iterative algorithms; Lungs; Optimization methods;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2005.29
Filename
1377170
Link To Document