DocumentCode
3751513
Title
A Short Survey on Data Clustering Algorithms
Author
Ka-Chun Wong
Author_Institution
Dept. of Comput. Sci., City Univ. of Hong Kong, Kowloon Tong, China
fYear
2015
Firstpage
64
Lastpage
68
Abstract
With rapidly increasing data, clustering algorithms are important tools for data analytics in modern research. They have been successfully applied to a wide range of domains, for instance, bioinformatics, speech recognition, and financial analysis. Formally speaking, given a set of data instances, a clustering algorithm is expected to divide the set of data instances into the subsets which maximize the intra-subset similarity and inter-subset dissimilarity, where a similarity measure is defined beforehand. In this work, the state-of-the-arts clustering algorithms are reviewed from design concept to methodology, Different clustering paradigms are discussed. Advanced clustering algorithms are also discussed. After that, the existing clustering evaluation metrics are reviewed. A summary with future insights is provided at the end.
Keywords
"Clustering algorithms","Clustering methods","Indexes","Algorithm design and analysis","Correlation","Benchmark testing","Bioinformatics"
Publisher
ieee
Conference_Titel
Soft Computing and Machine Intelligence (ISCMI), 2015 Second International Conference on
Type
conf
DOI
10.1109/ISCMI.2015.10
Filename
7414675
Link To Document