مرکز منطقه ای اطلاع رساني علوم و فناوري - OutRank: ranking outliers in high dimensional data

DocumentCode :

3335101

Title :

OutRank: ranking outliers in high dimensional data

Author :

Müller, Emmanuel ; Assent, Ira ; Steinhausen, Uwe ; Seidl, Thomas

Author_Institution :

Data Manage. & data exploration group, RWTH Aachen Univ., Aachen

fYear :

2008

fDate :

7-12 April 2008

Firstpage :

600

Lastpage :

603

Abstract :

Outlier detection is an important data mining task for consistency checks, fraud detection, etc. Binary decision making on whether or not an object is an outlier is not appropriate in many applications and moreover hard to parametrize. Thus, recently, methods for outlier ranking have been proposed. Determining the degree of deviation, they do not require setting a decision boundary between outliers and the remaining data. High dimensional and heterogeneous (continuous and categorical attributes) data, however, pose a problem for most outlier ranking algorithms. In this work, we propose our OutRank approach for ranking outliers in heterogeneous high dimensional data. We introduce a consistent model for different attribute types. Our novel scoring functions transform the analyzed structure of the data to a meaningful ranking. Promising results in preliminary experiments show the potential for successful outlier ranking in high dimensional data.

Keywords :

data mining; OutRank; binary decision making; consistency checks; data mining; deviation degree; fraud detection; heterogeneous data; high dimensional data; outlier detection; outlier ranking; scoring functions; Clustering algorithms; Data mining; Databases; Decision making; Density measurement; Disaster management; Object detection; Principal component analysis; Surveillance;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Data Engineering Workshop, 2008. ICDEW 2008. IEEE 24th International Conference on

Conference_Location :

Cancun

Print_ISBN :

978-1-4244-2161-9

Electronic_ISBN :

978-1-4244-2162-6

Type :

conf

DOI :

10.1109/ICDEW.2008.4498387

Filename :

4498387

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3335101