Title :
Reducing semantic drift in bootstrapping for entity relation extraction
Author :
Chen Sijia ; Li Yan ; Chen Guang
Author_Institution :
Sch. of Inf. & Commun. Eng., Beijing Univ. of Posts & Telecommun., Beijing, China
Abstract :
This paper presents a novel bootstrapping algorithm for entity relation extraction. Shortest dependency patterns connecting entity pairs in sentences are captured initially and in turn applied to extract new binary relationships. The patterns are evaluated through correlation detection. In addition, we effectively prevent semantic drift by co-training with trigger words. Experiments for slot filling on the Knowledge Base Population (KBP) newspaper corpora show that our enhanced bootstrapping system achieves an 11% F1-score improvement over traditional bootstrapping algorithm.
Keywords :
entity-relationship modelling; learning (artificial intelligence); semantic networks; text analysis; F1-score improvement; KBP newspaper corpora; binary relationships; bootstrapping algorithm; bootstrapping system; correlation detection; entity pairs; entity relation extraction; knowledge base population newspaper corpora; semantic drift; sentences; shortest dependency patterns; slot filling; trigger words cotraining; Context; Correlation; Data mining; Information retrieval; Natural language processing; Semantics; Training; bootstrapping; dependency pattern; relation extraction; semantic drift; trigger word;
Conference_Titel :
Mechatronic Sciences, Electric Engineering and Computer (MEC), Proceedings 2013 International Conference on
Conference_Location :
Shengyang
Print_ISBN :
978-1-4799-2564-3
DOI :
10.1109/MEC.2013.6885371