DocumentCode
3104659
Title
COALA: A Novel Approach for the Extraction of an Alternate Clustering of High Quality and High Dissimilarity
Author
Bae, Eric ; Bailey, James
Author_Institution
NICTA Victoria Lab. Dept. of Comput. Sci. & Software Eng., Melbourne, Univ., Melbourne, VIC
fYear
2006
fDate
18-22 Dec. 2006
Firstpage
53
Lastpage
62
Abstract
Cluster analysis has long been a fundamental task in data mining and machine learning. However, traditional clustering methods concentrate on producing a single solution, even though multiple alternative clusterings may exist. It is thus difficult for the user to validate whether the given solution is in fact appropriate, particularly for large and complex datasets. In this paper we explore the critical requirements for systematically finding a new clustering, given that an already known clustering is available and we also propose a novel algorithm, COALA, to discover this new clustering. Our approach is driven by two important factors; dissimilarity and quality. These are especially important for finding a new clustering which is highly informative about the underlying structure of data, but is at the same time distinctively different from the provided clustering. We undertake an experimental analysis and show that our method is able to outperform existing techniques, for both synthetic and real datasets.
Keywords
pattern clustering; COALA; cluster analysis; data mining; machine learning; multiple alternative clustering; Clustering algorithms; Clustering methods; Computer science; Data mining; Laboratories; Machine learning; Merging; Proteins; Search engines; Software engineering;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2006. ICDM '06. Sixth International Conference on
Conference_Location
Hong Kong
ISSN
1550-4786
Print_ISBN
0-7695-2701-7
Type
conf
DOI
10.1109/ICDM.2006.37
Filename
4053034
Link To Document