Title :
Outlier detection based on k-neighborhood MST
Author :
Qingsheng Zhu ; Xiaogang Fan ; Ji Feng
Author_Institution :
Chongqing Key Lab. of Software Theor. & Technol. Coll. of Comput. Sci., Chongqing Univ., Chongqing, China
Abstract :
Outlier detection is an important task in data mining. It is mainly used for finding strange mechanism or potential danger. This paper presents an outlier detection algorithm based on k-neighborhood minimum spanning tree(MST). This algorithm is applicable to data sets of any arbitrary shape and density and can effectively detect local outliers and local outlying clusters. Taking density and directional factor into consideration, this algorithm proposes a new dissimilarity measure based on k-neighborhood. Then a minimum spanning tree (MST) is built based on this k-neighborhood dissimilarity measure. Finally, the tree is progressively constrained to cutting so that the outliers can be found. Compared with algorithm LOF, COF, KNN and INFLO, the result proves the effectiveness and excellence of this new algorithm.
Keywords :
data mining; trees (mathematics); data mining; dissimilarity measure; k-neighborhood MST; minimum spanning tree; outlier detection; Clustering algorithms; Data mining; Density measurement; Euclidean distance; Partitioning algorithms; Shape; Time complexity; Dissimilarity; MST; Outlier detection; Outlying cluster; k-neighborhood;
Conference_Titel :
Information Reuse and Integration (IRI), 2014 IEEE 15th International Conference on
DOI :
10.1109/IRI.2014.7051960