DocumentCode
660917
Title
Open Information Extraction via Contextual Sentence Decomposition
Author
Bast, Hannah ; Haussmann, Elmar
Author_Institution
Dept. of Comput. Sci., Univ. of Freiburg, Freiburg, Germany
fYear
2013
fDate
16-18 Sept. 2013
Firstpage
154
Lastpage
159
Abstract
We show how contextual sentence decomposition (CSD), a technique originally developed for high-precision semantic search, can be used for open information extraction (OIE). Intuitively, CSD decomposes a sentence into the parts that semantically "belong together". By identifying the (implicit or explicit) verb in each such part, we obtain facts like in OIE. We compare our system, called CSD-IE, to three state-of-the-art OIE systems: ReVerb, OLLIE, and ClausIE. We consider the following aspects: accuracy (does the extracted triple express a meaningful fact, which is also expressed in the original sentence), minimality (can the extracted triple be further decomposed into smaller meaningful triples), coverage (percentage of text contained in at least one extracted triple), and number of facts extracted. We show how CSD-IE clearly outperforms ReVerb and OLLIE in terms of coverage and recall, but at comparable accuracy and minimality, and how CSD-IE achieves precision and recall comparable to ClausIE, but at significantly better minimality.
Keywords
information retrieval; CSD technique; CSD-IE system; ClausIE system; OLLIE system; ReVerb system; accuracy aspect; contextual sentence decomposition technique; coverage aspect; explicit verb; high-precision semantic search; implicit verb; minimality aspect; open information extraction; recall aspect; Accuracy; Context; Data mining; Educational institutions; Information retrieval; Semantics; Thyristors; contextual sentence decomposition; open information extraction; semantic search;
fLanguage
English
Publisher
ieee
Conference_Titel
Semantic Computing (ICSC), 2013 IEEE Seventh International Conference on
Conference_Location
Irvine, CA
Type
conf
DOI
10.1109/ICSC.2013.36
Filename
6693511
Link To Document