DocumentCode :
3310519
Title :
Clustering algorithms and validity measures
Author :
Halkidi, M. ; Batistakis, Y. ; Vazirgiannis, M.
Author_Institution :
Dept. of Inf., Athens Univ. of Econ. & Bus., Greece
fYear :
2001
fDate :
2001
Firstpage :
3
Lastpage :
22
Abstract :
Clustering aims at discovering groups and identifying interesting distributions and patterns in data sets. Researchers have extensively studied clustering since it arises in many application domains in engineering and social sciences. In the last years the availability of huge transactional and experimental data sets and the arising requirements for data mining created needs for clustering algorithms that scale and can be applied in diverse domains. The paper surveys clustering methods and approaches available in the literature in a comparative way. It also presents the basic concepts, principles and assumptions upon which the clustering algorithms are based. Another important issue is the validity of the clustering schemes resulting from applying algorithms. This is also related to the inherent features of the data set under concern. We review and compare clustering validity measures available in the literature. Furthermore, we illustrate the issues that are under-addressed by the recent algorithms and we address new research directions
Keywords :
data mining; pattern clustering; transaction processing; very large databases; clustering algorithms; clustering methods; clustering schemes; clustering validity measures; data mining; diverse domains; engineering; huge experimental data sets; interesting distributions; research directions; social sciences; validity measures; Biomedical engineering; Clustering algorithms; Clustering methods; Data mining; Databases; Engineering in medicine and biology; Informatics; Partitioning algorithms; Pattern recognition; Unsupervised learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Scientific and Statistical Database Management, 2001. SSDBM 2001. Proceedings. Thirteenth International Conference on
Conference_Location :
Fairfax, VA
ISSN :
1099-3371
Print_ISBN :
0-7695-1218-6
Type :
conf
DOI :
10.1109/SSDM.2001.938534
Filename :
938534
Link To Document :
بازگشت