DocumentCode :
2518820
Title :
A novel approach for mining multiple data streams based on lag correlation
Author :
Zhang, Tiancheng ; Yue, Dejun ; Wang, Yanqiu ; Yu, Ge
Author_Institution :
Coll. of Inf. Sci. & Eng., Northeastern Univ., Shenyang, China
fYear :
2011
fDate :
23-25 May 2011
Firstpage :
2377
Lastpage :
2382
Abstract :
Correlation analysis is a key problem for data stream analysis. In this paper, we propose a correlation analysis method for multiple dimensional data streams, which is based on the Boolean lag representation and the PCA (Principal Component Analysis). Firstly, the raw stream sequence is transformed into the Boolean sequence. By the correlation analysis of Boolean sequences, we can easily find the sequence pairs with lag correlations by means of simple bit operations. Secondly, we compute the lag time and synchronize the multiple dimensional data stream. Thirdly, the PCA method is deployed to reduce the multiple data streams, and we can reconstruct the data streams by a few principal components. The experimental evaluations show that the method has high computation performance with high accuracy.
Keywords :
Boolean functions; correlation methods; data mining; principal component analysis; Boolean lag representation; Boolean sequence; PCA; lag correlation; multiple data streams mining; principal component analysis; Algorithm design and analysis; Complexity theory; Correlation; Educational institutions; Principal component analysis; Synchronization; Transforms; Boolean; PCA; data stream; lag correlation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Control and Decision Conference (CCDC), 2011 Chinese
Conference_Location :
Mianyang
Print_ISBN :
978-1-4244-8737-0
Type :
conf
DOI :
10.1109/CCDC.2011.5968606
Filename :
5968606
Link To Document :
بازگشت