• DocumentCode
    3751513
  • Title

    A Short Survey on Data Clustering Algorithms

  • Author

    Ka-Chun Wong

  • Author_Institution
    Dept. of Comput. Sci., City Univ. of Hong Kong, Kowloon Tong, China
  • fYear
    2015
  • Firstpage
    64
  • Lastpage
    68
  • Abstract
    With rapidly increasing data, clustering algorithms are important tools for data analytics in modern research. They have been successfully applied to a wide range of domains, for instance, bioinformatics, speech recognition, and financial analysis. Formally speaking, given a set of data instances, a clustering algorithm is expected to divide the set of data instances into the subsets which maximize the intra-subset similarity and inter-subset dissimilarity, where a similarity measure is defined beforehand. In this work, the state-of-the-arts clustering algorithms are reviewed from design concept to methodology, Different clustering paradigms are discussed. Advanced clustering algorithms are also discussed. After that, the existing clustering evaluation metrics are reviewed. A summary with future insights is provided at the end.
  • Keywords
    "Clustering algorithms","Clustering methods","Indexes","Algorithm design and analysis","Correlation","Benchmark testing","Bioinformatics"
  • Publisher
    ieee
  • Conference_Titel
    Soft Computing and Machine Intelligence (ISCMI), 2015 Second International Conference on
  • Type

    conf

  • DOI
    10.1109/ISCMI.2015.10
  • Filename
    7414675