DocumentCode :
3731474
Title :
Outlier Mining Based on Variance of Angle Technology Research in High-Dimensional Data
Author :
Liu Wenting;Pan Ruikai
Author_Institution :
Coll. of Comput. &
fYear :
2015
Firstpage :
598
Lastpage :
603
Abstract :
Outlier mining in high dimensional data is currently one of the hot areas of data mining. The existing outlier mining methods are based on the distance in the full-dimensional Euclidean space. In high-dimensional data, these methods are bound to deteriorate due to the notorious "dimension disaster" which leads to distance measure can not express the original physical meaning and the low computational efficiency. This paper improves the method of angle-based outlier factor outlier and proposes the method of variance of angle-based outlier factor outlier. It introduces the related theories to guarantee the reliability of the method. The empirical experiments on synthetic data sets show that the method is efficient and scalable to large high-dimensional data sets.
Keywords :
"Data mining","Approximation algorithms","Correlation","Data models","Estimation","Electronic mail","Big data"
Publisher :
ieee
Conference_Titel :
Intelligent Systems and Knowledge Engineering (ISKE), 2015 10th International Conference on
Type :
conf
DOI :
10.1109/ISKE.2015.64
Filename :
7383111
Link To Document :
بازگشت