• DocumentCode
    399779
  • Title

    Clustering item data sets with association-taxonomy similarity

  • Author

    Yun, Ching-Huang ; Chuang, Kun-Ta ; Chen, Ming-Syan

  • Author_Institution
    Dept. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan
  • fYear
    2003
  • fDate
    19-22 Nov. 2003
  • Firstpage
    697
  • Lastpage
    700
  • Abstract
    We explore here the efficient clustering of item data. Different from those of the traditional data, the features of item data are known to be of high dimensionality and sparsity. In view of the features of item data, we devise here a novel measurement, called the association-taxonomy similarity, and utilize this measurement to perform the clustering. With this association-taxonomy similarity measurement, we develop an efficient clustering algorithm, called algorithm AT (standing for association-taxonomy), for item data. Two validation indexes based on association and taxonomy properties are also devised to assess the quality of clustering for item data. As validated by the real dataset, it is shown by our experimental results that algorithm AT devised here significantly outperforms the prior works in the clustering quality as measured by the validation indexes, indicating the usefulness of association-taxonomy similarity in item data clustering.
  • Keywords
    data mining; statistical analysis; association-taxonomy algorithm; association-taxonomy similarity; item data clustering; real dataset; validation index; Association rules; Clustering algorithms; Data analysis; Data engineering; Data mining; Electronic commerce; Itemsets; Performance evaluation; Taxonomy; Transaction databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2003. ICDM 2003. Third IEEE International Conference on
  • Print_ISBN
    0-7695-1978-4
  • Type

    conf

  • DOI
    10.1109/ICDM.2003.1251011
  • Filename
    1251011