Title :
Irregular Grid-Based Clustering over High-Dimensional Data Streams
Author :
Hou, GuiBin ; Yao, RuiXia ; Ren, Jiadong ; Hu, Changzhen
Author_Institution :
Coll. of Inf. Sci. & Eng., Yanshan Univ., Qinhuangdao, China
Abstract :
Clustering high-dimensional data stream is a difficult and important problem. Grid-based algorithms are easily influenced by the size and borders of the grid. To overcome the weakness, we propose a new Irregualr Grid-based Clustering algorithm for high-dimensional data streams, called IGDCL. This method incorporates an irregular grid structure and subspace clustering algorithm. In this paper, an irregular grid structure is generated by means of splitting each dimension into different grid cells. With new data arriving, the irregular grid structure is dynamically adjusted. We assign a fading density function for each data point to embody the evolution of data streams. The final clusters are obtained in subspaces which are formed by dimensions associated with corresponding clusters. Experimental results demonstrate that IGDCL has higher clustering quality than CluStream.
Keywords :
grid computing; media streaming; pattern clustering; CluStream; IGDCL; fading density function; grid cell; high dimensional data stream; irregular grid based clustering; irregular grid structure; Clustering algorithms; Fading; Heuristic algorithms; Noise; Partitioning algorithms; Real time systems; Signal processing algorithms; clustering; high-dimensional data stream; irregular grid;
Conference_Titel :
Pervasive Computing Signal Processing and Applications (PCSPA), 2010 First International Conference on
Conference_Location :
Harbin
Print_ISBN :
978-1-4244-8043-2
Electronic_ISBN :
978-0-7695-4180-8
DOI :
10.1109/PCSPA.2010.195