• DocumentCode
    3307729
  • Title

    Mining high-quality clusters in pattern-based clustering

  • Author

    Qian Ma ; Jingfeng Guo

  • Author_Institution
    Modern Educ. Technol. Manage. Center, HengShui Coll., Hengshui, China
  • Volume
    2
  • fYear
    2011
  • fDate
    26-28 July 2011
  • Firstpage
    1152
  • Lastpage
    1156
  • Abstract
    Pattern-based clustering, which capture the similarity of the patterns exhibited by objects in a subset of dimensions, has broad applications in DNA microarray data analysis, customer segmentation, e-business data analysis, etc. However, pattern-based clustering often returns a large number of highly-overlapping clusters, which makes it hard for users to identify interesting patterns from the huge mining results. Moreover, there lacks a general measurement to evaluate the quality of Clusters which pattern-based clustering obtained. In this paper, we discuss factors which cause highly-overlapping, make error analysis and pattern weighting, and propose qScore as a key evaluation parameters on quality of Clusters. A algorithm which based on qScore is presented to solve the problem of high-overlapping and get better quality clustering results.
  • Keywords
    data mining; pattern clustering; error analysis; high-quality cluster mining; highly-overlapping clusters; pattern similarity; pattern weighting; pattern-based clustering; qScore; Algorithm design and analysis; Analytical models; Approximation algorithms; Clustering algorithms; Data analysis; Data mining; Error analysis; Pattern similarity; Pattern-based Clustering; error analysis; highly-Overlapping clusters; qScore;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery (FSKD), 2011 Eighth International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-61284-180-9
  • Type

    conf

  • DOI
    10.1109/FSKD.2011.6019708
  • Filename
    6019708