• DocumentCode
    589211
  • Title

    Incremental Mitosis: Discovering Clusters of Arbitrary Shapes and Densities in Dynamic Data

  • Author

    Ibrahim, Roliana ; Ahmed, Nova ; Yousri, Noha A. ; Ismail, Muhammad Ali

  • Author_Institution
    Comput. & Syst. Eng., Univ. of Alexandria, Alexandria, Egypt
  • Volume
    1
  • fYear
    2012
  • fDate
    12-15 Dec. 2012
  • Firstpage
    102
  • Lastpage
    107
  • Abstract
    While finding natural clusters in high dimensional data is in itself a challenge, the dynamic nature of data adds another greater challenge. Many applications such as Data Warehouses and WWW demand the presence of efficient incremental clustering algorithms to handle their dynamic data. So far, numerous useful incremental clustering algorithms have been developed for large datasets such as incremental K-means, incremental DBSCAN, similarity histogram-based clustering (SHC) and mean shift. However, targeting clusters of different shapes and densities is yet to be efficiently tackled. In this work, an efficient incremental clustering algorithm (Incremental Mitosis) is proposed. It is based on Mitosis clustering algorithm which maximizes the relatedness of distances between patterns of the same cluster. The proposed algorithm is able to discover clusters of arbitrary shapes and densities in dynamic high dimensional data. Experimental results show that the proposed algorithm efficiently clusters the data and maintains the accuracy of Mitosis algorithm.
  • Keywords
    data handling; pattern clustering; unsupervised learning; SHC; WWW; World Wide Web; arbitrary densities; arbitrary shapes; cluster discovery; data warehouses; dynamic data handling; high dimensional data; incremental DBSCAN; incremental K-means; incremental mitosis clustering algorithm; mean shift; similarity histogram-based clustering; unsupervised learning process; Character recognition; Clustering algorithms; Dynamic range; Heuristic algorithms; Indexes; Merging; Shape; arbitrary shapes and densities; dynamic data; high dimensional data; incremental clustering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Applications (ICMLA), 2012 11th International Conference on
  • Conference_Location
    Boca Raton, FL
  • Print_ISBN
    978-1-4673-4651-1
  • Type

    conf

  • DOI
    10.1109/ICMLA.2012.26
  • Filename
    6406596