DocumentCode :
3292114
Title :
A Novel Outlier Detection Algorithm for Distributed Databases
Author :
Zhou, Jiaogen ; Zhao, Chunjiang ; Wan, You ; Huang, Wenjiang ; Yang, Baozhu ; Ge, Jixin
Author_Institution :
NERCITA, Beijing Acad. of Agric. & Forestry Sci., Beijing
Volume :
5
fYear :
2008
fDate :
18-20 Oct. 2008
Firstpage :
293
Lastpage :
297
Abstract :
Traditional outlier detection algorithms are designed to apply to centralized databases, not distributed databases. We proposed a novel outlier detection algorithm for distributed databases. Given data assigned to different network nodes of a network platform, where each node has its own memory and hard disc, and the communication between nodes driven by message, the populated data would be non-overlapping. The working way of the network system is a manager-worker mode, that is, that a node as manager is responsible for assigning tasks to worker and querying the results from worker nodes. The algorithm first detected local outliers based on distance on all nodes, and then identified local outliers collected in the central node where a globally screening operation on all local outliers was implemented to achieve really global outliers. To scale the algorithm to massive data and reduce its computing complexity, a data filtering technology was further presented. Experimental results demonstrated that the algorithm effectively and efficiently handled on real and artificial data.
Keywords :
computational complexity; distributed databases; centralized databases; computational complexity; data filtering technology; distributed databases; manager-worker mode; outlier detection algorithm; Algorithm design and analysis; Application software; Classification algorithms; Clustering algorithms; Detection algorithms; Distributed computing; Distributed databases; Filtering; Nearest neighbor searches; Statistical distributions; data mining; distributed database; outlier detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2008. FSKD '08. Fifth International Conference on
Conference_Location :
Jinan Shandong
Print_ISBN :
978-0-7695-3305-6
Type :
conf
DOI :
10.1109/FSKD.2008.422
Filename :
4666540
Link To Document :
بازگشت