DocumentCode :
2710999
Title :
Active Learning of Equivalence Relations by Minimizing the Expected Loss Using Constraint Inference
Author :
Rendle, Steffen ; Schmidt-Thieme, Lars
Author_Institution :
Machine Learning Lab., Inst. for Comput. Sci. Univ. of Hildesheim, Hildesheim
fYear :
2008
fDate :
15-19 Dec. 2008
Firstpage :
1001
Lastpage :
1006
Abstract :
Selecting promising queries is the key to effective active learning. In this paper, we investigate selection techniques for the task of learning an equivalence relation where the queries are about pairs of objects. As the target relation satisfies the axioms of transitivity, from one queried pair additional constraints can be inferred. We derive both the upper and lower bound on the number of queries needed to converge to the optimal solution. Besides restricting the set of possible solutions, constraints can be used as training data for learning a similarity measure. For selecting queries that result in a large number of meaningful constraints, we present an approximative optimal selection technique that greedily minimizes the expected loss in each round of active learning. This technique makes use of inference of expected constraints. Besides the theoretical results, an extensive evaluation for the application of record linkage shows empirically that the proposed selection method leads to both interesting and a high number of constraints.
Keywords :
learning (artificial intelligence); active learning; constraint inference; equivalence relations; queries; Computer science; Constraint theory; Couplings; Data mining; Machine learning; Merging; Sampling methods; Social network services; Training data; Uncertainty; Active Learning; Equivalence Relation; Record Linkage;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2008. ICDM '08. Eighth IEEE International Conference on
Conference_Location :
Pisa
ISSN :
1550-4786
Print_ISBN :
978-0-7695-3502-9
Type :
conf
DOI :
10.1109/ICDM.2008.41
Filename :
4781215
Link To Document :
بازگشت