DocumentCode :
2555040
Title :
Research on transform of outliers based on density
Author :
Xin, Ge ; Enjie, Ding
Author_Institution :
Sch. of Comput. Sci. & Technol., China Univ. of Min. & Technol., Xuzhou
fYear :
2008
fDate :
2-4 July 2008
Firstpage :
790
Lastpage :
793
Abstract :
This paper proposed a data transform method based on error-adjusted density of micro-datasets, so as to distinguish the characteristics of outliers efficiently and improve the accuracy of prediction models. It divided the large multi-dimensional data sets into many grid cells, and in each cell assigned each data point to its closest micro-dataset using a nearest neighbor algorithm, data points were represented by calculating the error-adjusted density estimation in each micro-dataset. Thereby, the processed data could embody the information of the area which they belonged to and show the data variation characteristics rightly.
Keywords :
data mining; error statistics; estimation theory; pattern classification; very large databases; data mining; data transform method; grid cell; large multidimensional data set; microdataset error-adjusted density estimation; nearest neighbor classification algorithm; outlier transform; prediction model; Accuracy; Estimation error; Nearest neighbor searches; Predictive models; Data Transform; Density Estimation; Micro-Dataset; Outlier;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Control and Decision Conference, 2008. CCDC 2008. Chinese
Conference_Location :
Yantai, Shandong
Print_ISBN :
978-1-4244-1733-9
Electronic_ISBN :
978-1-4244-1734-6
Type :
conf
DOI :
10.1109/CCDC.2008.4597421
Filename :
4597421
Link To Document :
بازگشت