DocumentCode
2785552
Title
Cleaning Training-Datasets with Noise-Aware Algorithms
Author
Escalante, H. Jair
Author_Institution
Dept. of Comput. Sci., Instituto Nacional de Astrofisica Optica y Electronica, Puebla
fYear
2006
fDate
Sept. 2006
Firstpage
151
Lastpage
158
Abstract
We introduce a novel learning algorithm for noise elimination. Our algorithm is based on the re-measurement idea for the correction of erroneous observations and is able to discriminate between noisy and noiseless observations by using kernel methods. We apply our noise-aware algorithms to several domains including: astronomy, face recognition and ten machine learning benchmark datasets. Experimental results adding noise and useful anomalies to the data show that our algorithm improves data quality, without having to eliminate any observation from the original dataset
Keywords
data integrity; learning (artificial intelligence); astronomy; data quality; face recognition; kernel methods; learning algorithm; machine learning benchmark datasets; noise elimination; noise-aware algorithms; training dataset cleaning; Cleaning; Computer science; Error correction; Face recognition; Humans; Investments; Kernel; Machine learning; Machine learning algorithms; Optical noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science, 2006. ENC '06. Seventh Mexican International Conference on
Conference_Location
San Luis Potosi
ISSN
1550-4069
Print_ISBN
0-7695-2666-7
Type
conf
DOI
10.1109/ENC.2006.7
Filename
4020874
Link To Document