DocumentCode :
1842815
Title :
Class compactness for data clustering
Author :
Song, Yuqing
Author_Institution :
Key Lab. of Intell. Inf. Process., Chinese Acad. of Sci., Beijing, China
fYear :
2010
fDate :
4-6 Aug. 2010
Firstpage :
86
Lastpage :
91
Abstract :
In this paper we introduce a compactness based clustering algorithm. The compactness of a data class is measured by comparing the inter-subset and intra-subset distances. The class compactness of a subset is defined as the ratio of the two distances. A subset is called an isolated cluster (or icluster) if its class compactness is greater than 1. All iclusters make a containment tree. We introduce monotonic sequences of iclusters to simplify the structure of the icluster tree, based on which a clustering algorithm is designed. The algorithm has the following advantages: it is effective on data sets with clusters nonlinearly separated, of arbitrary shapes, or of different densities. The effectiveness of the algorithm is demonstrated by experiments.
Keywords :
pattern clustering; class compactness; data clustering; inter-subset distances; intra-subset distances; isolated cluster; monotonic sequences; Algorithm design and analysis; Clustering algorithms; Joining processes; Kernel; Partitioning algorithms; Pediatrics; Shape; class compactness; hierarchical clustering; icluster; monotonic sequence;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Reuse and Integration (IRI), 2010 IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-8097-5
Type :
conf
DOI :
10.1109/IRI.2010.5558958
Filename :
5558958
Link To Document :
بازگشت