• DocumentCode
    3106947
  • Title

    Object Identification with Constraints

  • Author

    Rendle, Steffen ; Schmidt-Thieme, Lars

  • Author_Institution
    Dept. of Comput. Sci., Freiburg Univ., Freiburg
  • fYear
    2006
  • fDate
    18-22 Dec. 2006
  • Firstpage
    1026
  • Lastpage
    1031
  • Abstract
    Object identification aims at identifying different representations of the same object based on noisy attributes such as descriptions of the same product in different online shops or references to the same paper in different publications. Numerous solutions have been proposed for solving this task, almost all of them based on similarity functions of a pair of objects. Although today the similarity functions are learned from a set of labeled training data, the structural information given by the labeled data is not used. By formulating a generic model for object identification we show how almost any proposed identification model can easily be extended for satisfying structural constraints. Therefore we propose a model that uses structural information given as pairwise constraints to guide collective decisions about object identification in addition to a learned similarity measure. We show with empirical experiments on public and on real-life data that combining both structural information and attribute-based similarity enormously increases the overall performance for object identification tasks.
  • Keywords
    object-oriented databases; pattern clustering; database; object identification; semi-supervised clustering; similarity functions; structural information; Computer science; Couplings; Data mining; Databases; Manufacturing; Merging; Object detection; Predictive models; Testing; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2006. ICDM '06. Sixth International Conference on
  • Conference_Location
    Hong Kong
  • ISSN
    1550-4786
  • Print_ISBN
    0-7695-2701-7
  • Type

    conf

  • DOI
    10.1109/ICDM.2006.117
  • Filename
    4053147