DocumentCode
2830948
Title
Proximity Window Context Method for Term Extraction in Ontology Learning from Text
Author
Abramowicz, Witold ; Wisniewski, Marek
Author_Institution
Poznan Univ. of Econ., Poznan
fYear
2008
fDate
1-5 Sept. 2008
Firstpage
215
Lastpage
219
Abstract
The ontology learning from text cycle consists of the consecutive phases of term, synonym, concept, taxonomy and relation extraction. The paper touches the problems of a low efficiency in the current term extraction methods which are handled by a combination of statistic (frequency-based) and linguistic approaches. We present a novel method to extract terms that uses only shallow linguistic information. It is proposed to explore a different set of linguistic layers and support a classic POS n-gram model with additional context information based on proximity window features. The method is evaluated on two substantially different corpora to produce better results than the classic measures, including standard n-gram models and frequency-based approaches.
Keywords
computational linguistics; learning (artificial intelligence); ontologies (artificial intelligence); text analysis; classic POS n-gram model; linguistic approach; ontology learning; proximity window context method; statistic approach; term extraction; text cycle; Context modeling; Data mining; Databases; Expert systems; Frequency measurement; Information analysis; Ontologies; Phase measurement; Statistics; Taxonomy; Ontology learning; POS n-gram model; term extraction;
fLanguage
English
Publisher
ieee
Conference_Titel
Database and Expert Systems Application, 2008. DEXA '08. 19th International Workshop on
Conference_Location
Turin
ISSN
1529-4188
Print_ISBN
978-0-7695-3299-8
Type
conf
DOI
10.1109/DEXA.2008.133
Filename
4624718
Link To Document