DocumentCode
1571505
Title
Subspace Clustering of High Dimensional Data Streams
Author
Wang, Shuyun ; Fan, Yingjie ; Zhang, Chenghong ; Xu, Hexiang ; Hao, Xiulan ; Hu, Yunfa
Author_Institution
Dept. of Comput. & Inf. Technol., Fudan Univ., Shanghai
fYear
2008
Firstpage
165
Lastpage
170
Abstract
In this paper, SOStream, which is a novel algorithm of clustering over high dimensional online data stream is presented, it is based on subspace.-SOStream partitions the data space into grids, and maintains a superset of all dense units in an online way. A deterministic lower and upper bound of the selectivity of each maintained units are also given. With the maintained potential dense units, SOStream is capable of discovering the clusters in different subspaces over high dimensional data stream with arbitrary shape. The experimental results on real and synthetic datasets demonstrate the effectivity of the approach.
Keywords
data analysis; grid computing; pattern classification; SOStream algorithm; high dimensional data stream; subspace clustering; Clustering algorithms; Conference management; Grid computing; Information science; Information technology; Monitoring; Partitioning algorithms; Space technology; Technology management; Upper bound; Cluster; Data stream; high-dimensional; subspace;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer and Information Science, 2008. ICIS 08. Seventh IEEE/ACIS International Conference on
Conference_Location
Portland, OR
Print_ISBN
978-0-7695-3131-1
Type
conf
DOI
10.1109/ICIS.2008.58
Filename
4529815
Link To Document