DocumentCode
75092
Title
Overcoming Asymmetry in Entity Graphs
Author
Taesung Lee ; Young-rok Cha ; Seung-won Hwang
Author_Institution
Dept. of Comput. Sci. & Eng., POSTECH, Pohang, South Korea
Volume
26
Issue
12
fYear
2014
fDate
Dec. 1 2014
Firstpage
3051
Lastpage
3063
Abstract
This paper studies the problem of mining named entity translations by aligning comparable corpora. Current state-of-the-art approaches mine a translation pair by aligning an entity graph in one language to another based on node similarity or propagated similarity of related entities. However, they, building on the assumption of “symmetry”, quickly deteriorate on “weakly” comparable corpora with some asymmetry. In this paper, we pursue two directions for overcoming relation and entity asymmetry respectively. The first approach starts from weakly comparable corpora (for high recall) then ensures precision by selective propagation only to entities of symmetric relations. The second approach starts from parallel corpora (for high precision) then enhances recall by extending the translation matrix based on node similarity and contextual similarity. Our experimental results on English-Chinese corpora show that both approaches are effective and complementary. Our combined approach outperforms the best-performing baseline in terms of F1-score by up to 0.28.
Keywords
data mining; entity-relationship modelling; graph theory; knowledge engineering; English-Chinese corpora; F1-score; contextual similarity; entity graphs; knowledge engineering methodologies; node similarity; parallel corpora; translation matrix; Context modeling; Electronic publishing; Graph theory; Internet; Semantics; Knowledge modeling; entity translation; knowledge engineering methodologies;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2014.2316799
Filename
6787004
Link To Document