Title :
Efficient Outlier Detection Algorithm for Heterogeneous Data Streams
Author :
Ren, Jiadong ; Wu, Qunhui ; Zhang, Jia ; Hu, Changzhen
Author_Institution :
Coll. of Inf. Sci. & Eng., Yanshan Univ., Qinhuangdao, China
Abstract :
Data streams outlier mining is an important and active research issue in anomaly detection. Most of the existing outlier detection algorithms can only manipulate numeric attributes or categorical attributes. In this paper, we propose an efficient outlier detection algorithm based on heterogeneous data streams, which partitions the stream in chunks. Then each chunk is clustered and the corresponding clustering results are stored in cluster references. The representation degree and the number of adjacent cluster references of each cluster reference are computed to generate the final outlier references, which include potential outliers. Experimental results show that our approach has higher detection precision and better scalability.
Keywords :
data mining; pattern clustering; anomaly detection; cluster references; data streams outlier mining; heterogeneous data streams; outlier detection algorithm; Cities and towns; Clustering algorithms; Computer science; Data engineering; Detection algorithms; Educational institutions; Fuzzy systems; Information science; Knowledge engineering; Scalability; cluster reference; data streams; heterogeneous attributes; outlier detection;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2009. FSKD '09. Sixth International Conference on
Conference_Location :
Tianjin
Print_ISBN :
978-0-7695-3735-1
DOI :
10.1109/FSKD.2009.548