• DocumentCode
    3105656
  • Title

    Entity Resolution with Markov Logic

  • Author

    Singla, Parag ; Domingos, Pedro

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Univ. of Washington, Seattle, WA
  • fYear
    2006
  • fDate
    18-22 Dec. 2006
  • Firstpage
    572
  • Lastpage
    582
  • Abstract
    Entity resolution is the problem of determining which records in a database refer to the same entities, and is a crucial and expensive step in the data mining process. Interest in it has grown rapidly, and many approaches have been proposed. However, they tend to address only isolated aspects of the problem, and are often ad hoc. This paper proposes a well-founded, integrated solution to the entity resolution problem based on Markov logic. Markov logic combines first-order logic and probabilistic graphical models by attaching weights to first-order formulas, and viewing them as templates for features of Markov networks. We show how a number of previous approaches can be formulated and seamlessly combined in Markov logic, and how the resulting learning and inference problems can be solved efficiently. Experiments on two citation databases show the utility of this approach, and evaluate the contribution of the different components.
  • Keywords
    Markov processes; data mining; database management systems; entity-relationship modelling; formal logic; graph theory; inference mechanisms; learning (artificial intelligence); probability; Markov logic; Markov networks; citation databases; data mining; entity resolution; first-order logic; inference problems; learning problems; probabilistic graphical models; Computer science; Couplings; Data engineering; Data mining; Graphical models; Joining processes; Logistics; Markov random fields; Probabilistic logic; Spatial databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2006. ICDM '06. Sixth International Conference on
  • Conference_Location
    Hong Kong
  • ISSN
    1550-4786
  • Print_ISBN
    0-7695-2701-7
  • Type

    conf

  • DOI
    10.1109/ICDM.2006.65
  • Filename
    4053083