DocumentCode
3106947
Title
Object Identification with Constraints
Author
Rendle, Steffen ; Schmidt-Thieme, Lars
Author_Institution
Dept. of Comput. Sci., Freiburg Univ., Freiburg
fYear
2006
fDate
18-22 Dec. 2006
Firstpage
1026
Lastpage
1031
Abstract
Object identification aims at identifying different representations of the same object based on noisy attributes such as descriptions of the same product in different online shops or references to the same paper in different publications. Numerous solutions have been proposed for solving this task, almost all of them based on similarity functions of a pair of objects. Although today the similarity functions are learned from a set of labeled training data, the structural information given by the labeled data is not used. By formulating a generic model for object identification we show how almost any proposed identification model can easily be extended for satisfying structural constraints. Therefore we propose a model that uses structural information given as pairwise constraints to guide collective decisions about object identification in addition to a learned similarity measure. We show with empirical experiments on public and on real-life data that combining both structural information and attribute-based similarity enormously increases the overall performance for object identification tasks.
Keywords
object-oriented databases; pattern clustering; database; object identification; semi-supervised clustering; similarity functions; structural information; Computer science; Couplings; Data mining; Databases; Manufacturing; Merging; Object detection; Predictive models; Testing; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2006. ICDM '06. Sixth International Conference on
Conference_Location
Hong Kong
ISSN
1550-4786
Print_ISBN
0-7695-2701-7
Type
conf
DOI
10.1109/ICDM.2006.117
Filename
4053147
Link To Document