مرکز منطقه ای اطلاع رساني علوم و فناوري - Finding key knowledge attribute subspace of outliers in high-dimensional dataset

Title of article :

Finding key knowledge attribute subspace of outliers in high-dimensional dataset

Author/Authors :

Huang، نويسنده , , Biao and Yang، نويسنده , , Peng، نويسنده ,

Issue Information :

روزنامه با شماره پیاپی سال 2011

Pages :

From page :

10147

To page :

10152

Abstract :

Outlier detection has important applications in many fields in which the data can contain high dimensions. However, finding the intentional knowledge of outliers will become inefficient and even infeasible in high dimensional space. In this paper, we introduced the concept of rough set and used it as the model of outlier detection and analysis system to realize outlying reduction. Furthermore, by defining outlying partition similarity, we can mine the outliers in the key knowledge attribute subspace rather than in the full dimensional attribute space of dataset. An effective method for finding the key knowledge attribute subspace was proposed. It first finds all outliers in the full attribute space and then, calculates KAS for corresponding projection of each outlier. Finally, the key knowledge attribute subspace can be identified by the value of outlying partition similarity. The experimental results show that our method can be efficiently used in high dimensional dataset to identify outlier.

Keywords :

outlier detection , Attribute subspace , High-dimensional dataset , DATA MINING

Journal title :

Expert Systems with Applications

Serial Year :

2011

Journal title :

Expert Systems with Applications

Record number :

2349850

Link To Document :

https://search.isc.ac/dl/search/defaultta.aspx?DTC=10&DC=2349850