• DocumentCode
    2785552
  • Title

    Cleaning Training-Datasets with Noise-Aware Algorithms

  • Author

    Escalante, H. Jair

  • Author_Institution
    Dept. of Comput. Sci., Instituto Nacional de Astrofisica Optica y Electronica, Puebla
  • fYear
    2006
  • fDate
    Sept. 2006
  • Firstpage
    151
  • Lastpage
    158
  • Abstract
    We introduce a novel learning algorithm for noise elimination. Our algorithm is based on the re-measurement idea for the correction of erroneous observations and is able to discriminate between noisy and noiseless observations by using kernel methods. We apply our noise-aware algorithms to several domains including: astronomy, face recognition and ten machine learning benchmark datasets. Experimental results adding noise and useful anomalies to the data show that our algorithm improves data quality, without having to eliminate any observation from the original dataset
  • Keywords
    data integrity; learning (artificial intelligence); astronomy; data quality; face recognition; kernel methods; learning algorithm; machine learning benchmark datasets; noise elimination; noise-aware algorithms; training dataset cleaning; Cleaning; Computer science; Error correction; Face recognition; Humans; Investments; Kernel; Machine learning; Machine learning algorithms; Optical noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science, 2006. ENC '06. Seventh Mexican International Conference on
  • Conference_Location
    San Luis Potosi
  • ISSN
    1550-4069
  • Print_ISBN
    0-7695-2666-7
  • Type

    conf

  • DOI
    10.1109/ENC.2006.7
  • Filename
    4020874