Title :
Knowledge-based clustering: a semi-autonomous algorithm using local and global data properties
Author :
Bean, C.L. ; Kambhampati, C.
Author_Institution :
Dept. of Comput. Sci., Hull Univ., UK
Abstract :
Cluster analysis is a heuristic technique used to reveal inherent groupings in data, but most modern clustering algorithms are highly data and person dependent. This paper presents a clustering technique that minimises the need for user-defined parameters and handles both single and mixed attribute type data sets. The algorithm is based on elements of rough set theory and uses a combination of local and global data properties to obtain meaningful clustering solutions. It is self-consistent in its approach to clustering; thus ensuring the same clustering solution when applied to the same data by different users. The results from a range of real-world and synthetic data sets are used to establish its performance.
Keywords :
knowledge based systems; minimisation; pattern clustering; rough set theory; statistical analysis; cluster analysis; clustering algorithms; global data properties; heuristic technique; inherent groupings; knowledge based clustering; local data properties; minimisation; mixed attribute type data sets; rough set theory; semiautonomous algorithm; user defined parameters; Algorithm design and analysis; Clustering algorithms; Computer science; Data analysis; Data mining; Data structures; Humans; Partitioning algorithms; Proposals; Set theory;
Conference_Titel :
Fuzzy Information, 2004. Processing NAFIPS '04. IEEE Annual Meeting of the
Print_ISBN :
0-7803-8376-1
DOI :
10.1109/NAFIPS.2004.1336256