DocumentCode :
3106947
Title :
Object Identification with Constraints
Author :
Rendle, Steffen ; Schmidt-Thieme, Lars
Author_Institution :
Dept. of Comput. Sci., Freiburg Univ., Freiburg
fYear :
2006
fDate :
18-22 Dec. 2006
Firstpage :
1026
Lastpage :
1031
Abstract :
Object identification aims at identifying different representations of the same object based on noisy attributes such as descriptions of the same product in different online shops or references to the same paper in different publications. Numerous solutions have been proposed for solving this task, almost all of them based on similarity functions of a pair of objects. Although today the similarity functions are learned from a set of labeled training data, the structural information given by the labeled data is not used. By formulating a generic model for object identification we show how almost any proposed identification model can easily be extended for satisfying structural constraints. Therefore we propose a model that uses structural information given as pairwise constraints to guide collective decisions about object identification in addition to a learned similarity measure. We show with empirical experiments on public and on real-life data that combining both structural information and attribute-based similarity enormously increases the overall performance for object identification tasks.
Keywords :
object-oriented databases; pattern clustering; database; object identification; semi-supervised clustering; similarity functions; structural information; Computer science; Couplings; Data mining; Databases; Manufacturing; Merging; Object detection; Predictive models; Testing; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2006. ICDM '06. Sixth International Conference on
Conference_Location :
Hong Kong
ISSN :
1550-4786
Print_ISBN :
0-7695-2701-7
Type :
conf
DOI :
10.1109/ICDM.2006.117
Filename :
4053147
Link To Document :
بازگشت