Title :
Detection of Local Outlier over Dynamic Data Streams Using Efficient Partitioning Method
Author :
Elahi, Manzoor ; Li, Kun ; Nisar, Wasif ; Lv, Xinjie ; Wang, Hongan
Author_Institution :
Intell. Eng. Lab., Inst. of Software Chinese Acad. of Sci., Beijing, China
fDate :
March 31 2009-April 2 2009
Abstract :
Outlier detection is the process of detecting the data objects which are grossly different from or inconsistent with the remaining set of data. Some of the important applications in the field of data mining are fraud detection, customer behavior analysis, and intrusion detection. There are number of good research algorithms for detecting outliers if the entire data is available and algorithms can operate in more than single passes to achieve the required results. Among the existing methods, LOF (local outlier factor) a density based method is very efficient in detecting all forms of outliers. LOF algorithm can not be directly applied to the data stream as the large number of nearest neighbor searches, LOF computation and LRD (local reachability distances) can make it highly inefficient for data stream. In this paper we propose a cluster based partitioning algorithm which can divide the stream in safe region and candidate regions. In Second phase apply LOF algorithm over these partitions separately with some slight enhancement for LOF computation over candidate region to achieve accurate results for finding most outstanding outliers. Several experiments on different dataset confirm that our technique can find better outliers with low computational cost than the direct LOF or compared to the other enhancements proposed for LOF.
Keywords :
data mining; object detection; pattern clustering; security of data; cluster based partitioning algorithm; clustering algorithm; customer behavior analysis; data mining; data object detection; databases; density based method; dynamic data stream; intrusion detection; local outlier detection; local reachability distances; partitioning method; Clustering algorithms; Computational efficiency; Data engineering; Data mining; Event detection; Intrusion detection; Nearest neighbor searches; Object detection; Partitioning algorithms; Unsupervised learning; LOF; datastream; mining; outlier;
Conference_Titel :
Computer Science and Information Engineering, 2009 WRI World Congress on
Conference_Location :
Los Angeles, CA
Print_ISBN :
978-0-7695-3507-4
DOI :
10.1109/CSIE.2009.217