DocumentCode
399779
Title
Clustering item data sets with association-taxonomy similarity
Author
Yun, Ching-Huang ; Chuang, Kun-Ta ; Chen, Ming-Syan
Author_Institution
Dept. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan
fYear
2003
fDate
19-22 Nov. 2003
Firstpage
697
Lastpage
700
Abstract
We explore here the efficient clustering of item data. Different from those of the traditional data, the features of item data are known to be of high dimensionality and sparsity. In view of the features of item data, we devise here a novel measurement, called the association-taxonomy similarity, and utilize this measurement to perform the clustering. With this association-taxonomy similarity measurement, we develop an efficient clustering algorithm, called algorithm AT (standing for association-taxonomy), for item data. Two validation indexes based on association and taxonomy properties are also devised to assess the quality of clustering for item data. As validated by the real dataset, it is shown by our experimental results that algorithm AT devised here significantly outperforms the prior works in the clustering quality as measured by the validation indexes, indicating the usefulness of association-taxonomy similarity in item data clustering.
Keywords
data mining; statistical analysis; association-taxonomy algorithm; association-taxonomy similarity; item data clustering; real dataset; validation index; Association rules; Clustering algorithms; Data analysis; Data engineering; Data mining; Electronic commerce; Itemsets; Performance evaluation; Taxonomy; Transaction databases;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2003. ICDM 2003. Third IEEE International Conference on
Print_ISBN
0-7695-1978-4
Type
conf
DOI
10.1109/ICDM.2003.1251011
Filename
1251011
Link To Document