• DocumentCode
    1202695
  • Title

    Iterative projected clustering by subspace mining

  • Author

    Yiu, Man Lung ; Mamoulis, Nikos

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Syst., Hong Kong Univ., China
  • Volume
    17
  • Issue
    2
  • fYear
    2005
  • Firstpage
    176
  • Lastpage
    189
  • Abstract
    Irrelevant attributes add noise to high-dimensional clusters and render traditional clustering techniques inappropriate. Recently, several algorithms that discover projected clusters and their associated subspaces have been proposed. We realize the analogy between mining frequent itemsets and discovering dense projected clusters around random points. Based on this, we propose a technique that improves the efficiency of a projected clustering algorithm (DOC). Our method is an optimized adaptation of the frequent pattern tree growth method used for mining frequent itemsets. We propose several techniques that employ the branch and bound paradigm to efficiently discover the projected clusters. An experimental study with synthetic and real data demonstrates that our technique significantly improves on the accuracy and speed of previous techniques.
  • Keywords
    Monte Carlo methods; data mining; database management systems; pattern classification; pattern clustering; tree searching; association rule; branch and bound paradigm; database management; frequent itemset mining; iterative projected clustering; pattern classification; projected clustering algorithm; subspace mining; tree growth method; Acoustic noise; Association rules; Clustering algorithms; Clustering methods; Data mining; Image databases; Itemsets; Iterative algorithms; Lungs; Optimization methods;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2005.29
  • Filename
    1377170