• DocumentCode
    3104659
  • Title

    COALA: A Novel Approach for the Extraction of an Alternate Clustering of High Quality and High Dissimilarity

  • Author

    Bae, Eric ; Bailey, James

  • Author_Institution
    NICTA Victoria Lab. Dept. of Comput. Sci. & Software Eng., Melbourne, Univ., Melbourne, VIC
  • fYear
    2006
  • fDate
    18-22 Dec. 2006
  • Firstpage
    53
  • Lastpage
    62
  • Abstract
    Cluster analysis has long been a fundamental task in data mining and machine learning. However, traditional clustering methods concentrate on producing a single solution, even though multiple alternative clusterings may exist. It is thus difficult for the user to validate whether the given solution is in fact appropriate, particularly for large and complex datasets. In this paper we explore the critical requirements for systematically finding a new clustering, given that an already known clustering is available and we also propose a novel algorithm, COALA, to discover this new clustering. Our approach is driven by two important factors; dissimilarity and quality. These are especially important for finding a new clustering which is highly informative about the underlying structure of data, but is at the same time distinctively different from the provided clustering. We undertake an experimental analysis and show that our method is able to outperform existing techniques, for both synthetic and real datasets.
  • Keywords
    pattern clustering; COALA; cluster analysis; data mining; machine learning; multiple alternative clustering; Clustering algorithms; Clustering methods; Computer science; Data mining; Laboratories; Machine learning; Merging; Proteins; Search engines; Software engineering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2006. ICDM '06. Sixth International Conference on
  • Conference_Location
    Hong Kong
  • ISSN
    1550-4786
  • Print_ISBN
    0-7695-2701-7
  • Type

    conf

  • DOI
    10.1109/ICDM.2006.37
  • Filename
    4053034