DocumentCode :
480154
Title :
DB-Outlier Detection Algorithm Using Divide and Conquer Approach over Dynamic DataStream
Author :
Elahi, Manzoor ; Lv, Xinjie ; Nisar, Wasif ; Khan, Imran Ali ; Qiao, Ying ; Wang, Hongan
Author_Institution :
Intell. Eng. Lab., Chinese Acad. of Sci., Beijing
Volume :
4
fYear :
2008
fDate :
12-14 Dec. 2008
Firstpage :
438
Lastpage :
443
Abstract :
Anomaly detection is currently an important and active research problem in many fields and involved in numerous applications. Most of the existing methods are based on distance measure which can produce better results as compared to other methods. But in case of datastream these methods are computationally less efficient. In order to find some specified number of neighbors for each data element make these methods inappropriate for datasteam environment, because of the huge volume of DataStream. Moreover, declaring a point as an outlier as it arrives, can often lead us to a wrong decision. In this paper we introduced an algorithm which divide incoming stream in chunks and make preliminary clusters in each chunk during the first stage. In the second stage K-nearest neighbor approach for outlier detection is applied to each cluster. In this way we can reduce the nearest neighbor searches for each data element as well as can detect better outliers with evolution of data stream. Several experiments on different datasets confirm that our technique can find suitable outliers with low computational cost than the other exiting distance based approaches for outlier detection over data stream.
Keywords :
electronic data interchange; inference mechanisms; pattern recognition; anomaly detection; data element; data stream; divide and conquer approach; dynamic DataStream; k-nearest neighbor approach; nearest neighbor search; outlier detection algorithm; Application software; Computer science; Costs; Data engineering; Data mining; Databases; Detection algorithms; Information technology; Nearest neighbor searches; Software engineering; datastream; mining; outlier;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Software Engineering, 2008 International Conference on
Conference_Location :
Wuhan, Hubei
Print_ISBN :
978-0-7695-3336-0
Type :
conf
DOI :
10.1109/CSSE.2008.1053
Filename :
4722653
Link To Document :
بازگشت