DocumentCode
408318
Title
A new reparation method for incomplete data in the context of supervised learning
Author
Magnani, Matteo ; Montesi, Danilo
Author_Institution
Dept. of Comput. Sci., Bologna Univ., Italy
Volume
1
fYear
2004
fDate
5-7 April 2004
Firstpage
471
Abstract
Real-world data is often incomplete. There exist many statistical methods to deal with missing items. However, they assume data distributions which are difficult to justify in the context of supervised learning. In this paper we propose a new method of repairing incomplete data. This technique is a variation of a general strategy, here called local imputation. It repairs incomplete records, only when this is reasonable. It is able to identify wrong tuples. It is more general than other similar methods, because of a parametric similarity function. Finally, it also works with noisy data sets.
Keywords
data handling; data integrity; data mining; learning (artificial intelligence); statistical analysis; data distributions; incomplete data repairing; incomplete records repairing; local imputation; noisy data sets; real-world data; reparation method; similarity function; statistical methods; supervised learning; wrong tuple identification; Computer science; Data analysis; Databases; Filling; Informatics; Lips; Mathematics; Statistical analysis; Supervised learning; Technological innovation;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Technology: Coding and Computing, 2004. Proceedings. ITCC 2004. International Conference on
Print_ISBN
0-7695-2108-8
Type
conf
DOI
10.1109/ITCC.2004.1286501
Filename
1286501
Link To Document