DocumentCode :
2110981
Title :
Clustering high dimensional data streams based on N-most interesting itemsets mining
Author :
Fujiang Ao ; Jing Du ; Yu Jingyi ; Fuzhi Wang ; Qiong Wang
Author_Institution :
State Key Lab. of Complex Electromagn. Environ. Effects on Electron. & Inf. Syst., Luoyang, China
fYear :
2013
fDate :
23-25 July 2013
Firstpage :
412
Lastpage :
416
Abstract :
The key for clustering high dimensional data streams is finding dense units. Traditional methods apply frequent itemsets mining for finding dense units. Since these methods are not able to differentiate the density of units in subspaces with different dimensions, it is not in favor of finding dense units in the sparse subspace or the higher-dimension subspace. In this paper, we propose an algorithm, called CBNI (Clustering high dimensional data streams Based on N-most interesting Itemsets), which finds dense units based on N-most interesting itemsets mining and can solve this problem. The experimental results show that the CBNI algorithm performs better in terms of the scalability with dimensionality, the scalability with the number of points in dataset, and the cluster purity.
Keywords :
data mining; pattern clustering; CBNI; N-most interesting itemsets mining; cluster purity; dense units; frequent itemsets mining; high dimensional data streams clustering; higher-dimension subspace; sparse subspace; Algorithm design and analysis; Clustering algorithms; Data mining; Itemsets; Knowledge discovery; Partitioning algorithms; Scalability; N-most interesting itemsets; cluster; high dimensional data streams;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2013 10th International Conference on
Conference_Location :
Shenyang
Type :
conf
DOI :
10.1109/FSKD.2013.6816232
Filename :
6816232
Link To Document :
بازگشت