• DocumentCode
    2830948
  • Title

    Proximity Window Context Method for Term Extraction in Ontology Learning from Text

  • Author

    Abramowicz, Witold ; Wisniewski, Marek

  • Author_Institution
    Poznan Univ. of Econ., Poznan
  • fYear
    2008
  • fDate
    1-5 Sept. 2008
  • Firstpage
    215
  • Lastpage
    219
  • Abstract
    The ontology learning from text cycle consists of the consecutive phases of term, synonym, concept, taxonomy and relation extraction. The paper touches the problems of a low efficiency in the current term extraction methods which are handled by a combination of statistic (frequency-based) and linguistic approaches. We present a novel method to extract terms that uses only shallow linguistic information. It is proposed to explore a different set of linguistic layers and support a classic POS n-gram model with additional context information based on proximity window features. The method is evaluated on two substantially different corpora to produce better results than the classic measures, including standard n-gram models and frequency-based approaches.
  • Keywords
    computational linguistics; learning (artificial intelligence); ontologies (artificial intelligence); text analysis; classic POS n-gram model; linguistic approach; ontology learning; proximity window context method; statistic approach; term extraction; text cycle; Context modeling; Data mining; Databases; Expert systems; Frequency measurement; Information analysis; Ontologies; Phase measurement; Statistics; Taxonomy; Ontology learning; POS n-gram model; term extraction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Database and Expert Systems Application, 2008. DEXA '08. 19th International Workshop on
  • Conference_Location
    Turin
  • ISSN
    1529-4188
  • Print_ISBN
    978-0-7695-3299-8
  • Type

    conf

  • DOI
    10.1109/DEXA.2008.133
  • Filename
    4624718