Title of article :
Finding key knowledge attribute subspace of outliers in high-dimensional dataset
Author/Authors :
Huang، نويسنده , , Biao and Yang، نويسنده , , Peng، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2011
Pages :
6
From page :
10147
To page :
10152
Abstract :
Outlier detection has important applications in many fields in which the data can contain high dimensions. However, finding the intentional knowledge of outliers will become inefficient and even infeasible in high dimensional space. In this paper, we introduced the concept of rough set and used it as the model of outlier detection and analysis system to realize outlying reduction. Furthermore, by defining outlying partition similarity, we can mine the outliers in the key knowledge attribute subspace rather than in the full dimensional attribute space of dataset. An effective method for finding the key knowledge attribute subspace was proposed. It first finds all outliers in the full attribute space and then, calculates KAS for corresponding projection of each outlier. Finally, the key knowledge attribute subspace can be identified by the value of outlying partition similarity. The experimental results show that our method can be efficiently used in high dimensional dataset to identify outlier.
Keywords :
outlier detection , Attribute subspace , High-dimensional dataset , DATA MINING
Journal title :
Expert Systems with Applications
Serial Year :
2011
Journal title :
Expert Systems with Applications
Record number :
2349850
Link To Document :
بازگشت