DocumentCode
2633521
Title
Detection of Local Outlier over Dynamic Data Streams Using Efficient Partitioning Method
Author
Elahi, Manzoor ; Li, Kun ; Nisar, Wasif ; Lv, Xinjie ; Wang, Hongan
Author_Institution
Intell. Eng. Lab., Inst. of Software Chinese Acad. of Sci., Beijing, China
Volume
4
fYear
2009
fDate
March 31 2009-April 2 2009
Firstpage
76
Lastpage
81
Abstract
Outlier detection is the process of detecting the data objects which are grossly different from or inconsistent with the remaining set of data. Some of the important applications in the field of data mining are fraud detection, customer behavior analysis, and intrusion detection. There are number of good research algorithms for detecting outliers if the entire data is available and algorithms can operate in more than single passes to achieve the required results. Among the existing methods, LOF (local outlier factor) a density based method is very efficient in detecting all forms of outliers. LOF algorithm can not be directly applied to the data stream as the large number of nearest neighbor searches, LOF computation and LRD (local reachability distances) can make it highly inefficient for data stream. In this paper we propose a cluster based partitioning algorithm which can divide the stream in safe region and candidate regions. In Second phase apply LOF algorithm over these partitions separately with some slight enhancement for LOF computation over candidate region to achieve accurate results for finding most outstanding outliers. Several experiments on different dataset confirm that our technique can find better outliers with low computational cost than the direct LOF or compared to the other enhancements proposed for LOF.
Keywords
data mining; object detection; pattern clustering; security of data; cluster based partitioning algorithm; clustering algorithm; customer behavior analysis; data mining; data object detection; databases; density based method; dynamic data stream; intrusion detection; local outlier detection; local reachability distances; partitioning method; Clustering algorithms; Computational efficiency; Data engineering; Data mining; Event detection; Intrusion detection; Nearest neighbor searches; Object detection; Partitioning algorithms; Unsupervised learning; LOF; datastream; mining; outlier;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Information Engineering, 2009 WRI World Congress on
Conference_Location
Los Angeles, CA
Print_ISBN
978-0-7695-3507-4
Type
conf
DOI
10.1109/CSIE.2009.217
Filename
5170965
Link To Document