DocumentCode :
2813715
Title :
A mathematical model of similarity and clustering
Author :
Sun, Fu-Shing ; Tzeng, Chun-Hung
Author_Institution :
Dept. of Comput. Sci., Ball State Univ., Muncie, IN, USA
Volume :
1
fYear :
2004
fDate :
5-7 April 2004
Firstpage :
460
Abstract :
This paper introduces an abstract model of data similarity and clustering. A similarity on a space Ω is formulated explicitly by a reflexive and symmetric binary relation, called a tolerance relation, for which we introduce three types of coverings of Ω. Given a covering U, a clustering is defined to be minimal sub-covering. To search for an optimal clustering is to minimize the number of clusters, which is intractable in general. This paper proposes a heuristic method to search for sub-optimal clusterings for a given tolerance relation.
Keywords :
data mining; data structures; pattern clustering; search problems; data clustering; data similarity; heuristic method; minimal subcovering; optimal clustering; reflexive binary relation; symmetric binary relation; tolerance relation; Clustering algorithms; Computer science; Data mining; Electronic mail; Euclidean distance; Fractals; Lattices; Mathematical model; Sun; Topology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Technology: Coding and Computing, 2004. Proceedings. ITCC 2004. International Conference on
Print_ISBN :
0-7695-2108-8
Type :
conf
DOI :
10.1109/ITCC.2004.1286499
Filename :
1286499
Link To Document :
بازگشت