• DocumentCode
    3758701
  • Title

    A novel algorithm DBCAPSIC for clustering non-numeric data

  • Author

    Jinkun Geng;Daren Ye;Ping Luo

  • Author_Institution
    School of Software, Beihang University, Beijing 100191, China
  • fYear
    2015
  • Firstpage
    295
  • Lastpage
    304
  • Abstract
    Data mining techniques are playing an important role in the analysis of mass network information and big data nowadays. The cluster analysis, as a main kind of method in data mining, draws great interest from researchers of various fields who proposed many algorithms such as k-means algorithm and its variants, density-based algorithm and its variants. However, these algorithms all have their own problems. This paper focuses on some of the problems and proposes a novel algorithm DBCAPSIC. The algorithm overcomes the k-means algorithm´s sensitivity to initial conditions and avoids common density-based algorithms´ "clustering failure" in some cases. Also, the algorithm has the linear time complexity of O{n), compared to the quadratic time complexity of common density-based clustering algorithms.
  • Keywords
    "Clustering algorithms","Algorithm design and analysis","Time complexity","Sensitivity","Partitioning algorithms","Cost function","Data mining"
  • Publisher
    ieee
  • Conference_Titel
    Advanced Information Technology, Electronic and Automation Control Conference (IAEAC), 2015 IEEE
  • Print_ISBN
    978-1-4799-1979-6
  • Type

    conf

  • DOI
    10.1109/IAEAC.2015.7428564
  • Filename
    7428564