• DocumentCode
    408318
  • Title

    A new reparation method for incomplete data in the context of supervised learning

  • Author

    Magnani, Matteo ; Montesi, Danilo

  • Author_Institution
    Dept. of Comput. Sci., Bologna Univ., Italy
  • Volume
    1
  • fYear
    2004
  • fDate
    5-7 April 2004
  • Firstpage
    471
  • Abstract
    Real-world data is often incomplete. There exist many statistical methods to deal with missing items. However, they assume data distributions which are difficult to justify in the context of supervised learning. In this paper we propose a new method of repairing incomplete data. This technique is a variation of a general strategy, here called local imputation. It repairs incomplete records, only when this is reasonable. It is able to identify wrong tuples. It is more general than other similar methods, because of a parametric similarity function. Finally, it also works with noisy data sets.
  • Keywords
    data handling; data integrity; data mining; learning (artificial intelligence); statistical analysis; data distributions; incomplete data repairing; incomplete records repairing; local imputation; noisy data sets; real-world data; reparation method; similarity function; statistical methods; supervised learning; wrong tuple identification; Computer science; Data analysis; Databases; Filling; Informatics; Lips; Mathematics; Statistical analysis; Supervised learning; Technological innovation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Technology: Coding and Computing, 2004. Proceedings. ITCC 2004. International Conference on
  • Print_ISBN
    0-7695-2108-8
  • Type

    conf

  • DOI
    10.1109/ITCC.2004.1286501
  • Filename
    1286501