Title :
An outlier mining algorithm based on characteristic attribute subspace
Author :
Liu, Aiqin ; Zhang, He
Author_Institution :
Sch. of Comput. Sci. & Technol., Tai-Yuan Univ. of Sci. & Technol., Taiyuan, China
Abstract :
The traditional outlier mining methods are affected by man-made factors and mined outliers can not be analyzed further. In this paper, an outlier mining algorithm based on characteristic attribute subspace is presented. Firstly, the concept of attribute entropy is introduced to calculate corresponding attribute abnormal degree, characteristic attribute subspace and attribute weight. Secondly, subspace outlier factor is computed, and then outliers are found. The method does not depend on beforehand parameters or thresholds and can explain the meaning of the outliers clearly. In the end, experimental results validate the feasibility and effectiveness of the algorithm by utilizing UCI and high-dimensional star spectrum data.
Keywords :
data mining; entropy; attribute abnormal degree; attribute entropy; attribute weight; characteristic attribute subspace; man-made factor; outlier mining algorithm; subspace outlier factor; Accuracy; Seals; Attribute entropy; Characteristic attribute; Outlier; Subspace;
Conference_Titel :
Bio-Inspired Computing: Theories and Applications (BIC-TA), 2010 IEEE Fifth International Conference on
Conference_Location :
Changsha
Print_ISBN :
978-1-4244-6437-1
DOI :
10.1109/BICTA.2010.5645348