• DocumentCode
    798139
  • Title

    Adaptive name matching in information integration

  • Author

    Bilenko, Mikhail ; Mooney, Raymond ; Cohen, William ; Ravikumar, Pradeep ; Fienberg, Stephen

  • Author_Institution
    Dept. of Comput. Sci., Texas Univ., Austin, TX, USA
  • Volume
    18
  • Issue
    5
  • fYear
    2003
  • Firstpage
    16
  • Lastpage
    23
  • Abstract
    Identifying approximately duplicate database records that refer to the same entity is essential for information integration. The authors compare and describe methods for combining and learning textual similarity measures for name matching.
  • Keywords
    Internet; distributed databases; learning (artificial intelligence); string matching; text analysis; Internet; Web pages; adaptive name matching; duplicate database records; heterogeneous information sources; information integration; machine learning; string similarity measures; textual similarity measures; Character recognition; Costs; Couplings; Data mining; Databases; Object detection; Optical character recognition software; Optical recording; Uncertainty; Web pages;
  • fLanguage
    English
  • Journal_Title
    Intelligent Systems, IEEE
  • Publisher
    ieee
  • ISSN
    1541-1672
  • Type

    jour

  • DOI
    10.1109/MIS.2003.1234765
  • Filename
    1234765