DocumentCode :
2064637
Title :
A validity index for outlier detection
Author :
Yousri, Noha A.
Author_Institution :
Comput. & Syst. Eng., Alexandria Univ., Alexandria, Egypt
fYear :
2010
fDate :
Nov. 29 2010-Dec. 1 2010
Firstpage :
325
Lastpage :
329
Abstract :
Defining a boundary between inliers and outliers is a major challenge in unsupervised outlier detection. In the absence of labeled data, the true outliers set cannot be evaluated. This lays the burden on both the choice of an efficient outlier detection criterion, and parameter selection. While numerous unsupervised outlier detection criteria, with different parameters, have been proposed, an unsupervised evaluation of outliers is still missing. This work introduces a theoretical basis, and proposes a validity index, to evaluate the quality of outliers. This is not a trivial problem when nothing is known about the structure and density of the data. The proposed index considers the outlierness quality, the deviation between characteristics of outliers and inliers, and the data distortion. Low and high dimensional data sets are used to evaluate the proposed index.
Keywords :
data analysis; pattern classification; pattern clustering; unsupervised learning; data distortion; outlierness quality; parameter selection; true outliers set; unsupervised evaluation; unsupervised outlier detection criteria; validity index; Accuracy; Conferences; Density measurement; Distortion measurement; Indexes; Intelligent systems; Pattern recognition; outlier analysis; validity index;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Systems Design and Applications (ISDA), 2010 10th International Conference on
Conference_Location :
Cairo
Print_ISBN :
978-1-4244-8134-7
Type :
conf
DOI :
10.1109/ISDA.2010.5687245
Filename :
5687245
Link To Document :
بازگشت