DocumentCode
420299
Title
Knowledge-based clustering: a semi-autonomous algorithm using local and global data properties
Author
Bean, C.L. ; Kambhampati, C.
Author_Institution
Dept. of Comput. Sci., Hull Univ., UK
Volume
1
fYear
2004
fDate
27-30 June 2004
Firstpage
95
Abstract
Cluster analysis is a heuristic technique used to reveal inherent groupings in data, but most modern clustering algorithms are highly data and person dependent. This paper presents a clustering technique that minimises the need for user-defined parameters and handles both single and mixed attribute type data sets. The algorithm is based on elements of rough set theory and uses a combination of local and global data properties to obtain meaningful clustering solutions. It is self-consistent in its approach to clustering; thus ensuring the same clustering solution when applied to the same data by different users. The results from a range of real-world and synthetic data sets are used to establish its performance.
Keywords
knowledge based systems; minimisation; pattern clustering; rough set theory; statistical analysis; cluster analysis; clustering algorithms; global data properties; heuristic technique; inherent groupings; knowledge based clustering; local data properties; minimisation; mixed attribute type data sets; rough set theory; semiautonomous algorithm; user defined parameters; Algorithm design and analysis; Clustering algorithms; Computer science; Data analysis; Data mining; Data structures; Humans; Partitioning algorithms; Proposals; Set theory;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Information, 2004. Processing NAFIPS '04. IEEE Annual Meeting of the
Print_ISBN
0-7803-8376-1
Type
conf
DOI
10.1109/NAFIPS.2004.1336256
Filename
1336256
Link To Document