• DocumentCode
    2347461
  • Title

    A linguistic light approach to multilingualism in lexical layers for ontologies

  • Author

    Troussov, Alexander ; Judge, John ; Sogrin, Mikhail ; Akrout, Amine ; Davis, Brian ; Handschuh, Siegfried

  • fYear
    2008
  • fDate
    20-22 Oct. 2008
  • Firstpage
    375
  • Lastpage
    379
  • Abstract
    Semantic Web ontologies are being increasingly used in modern text analytics applications and ontology-based information extraction (OBIE) as a means to provide a semantic backbone either for modelling the internal conceptual data structures of the text analytics (TA) engine or to model the knowledge base to drive the analysis of unstructured information in raw text and subsequent Knowledge acquisition and population. creating and targeting language resources (LR)s from a TA to an ontology can be time consuming and costly.The authors describe a user-friendly method for ontology engineers to augment an ontologies with a lexical layer which provides a flexible framework to identify term mentions of ontology concepts in raw text. In this paper we explore multilinguality in these lexical layers using the same framework. We discuss a number of potential issues for the ldquolinguistic lightrdquo lexical extensions for ontologies (LEON) approach when looking at languages more morphologically rich and which have more complex linguistic constraints than English. We show how the LEON approach can cope with these phenomena once the morphological normaliser used in the lexical analysis process is able to generalise sufficiently well for the language concerned.
  • Keywords
    information retrieval; knowledge acquisition; linguistics; ontologies (artificial intelligence); semantic Web; internal conceptual data structures; language resources; lexical layers; ontology-based information extraction; semantic Web ontologies; subsequent knowledge acquisition; text analytics applications; Data engineering; Data mining; Data structures; Engines; Information analysis; Knowledge acquisition; Natural languages; Ontologies; Semantic Web; Spine;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Information Technology, 2008. IMCSIT 2008. International Multiconference on
  • Conference_Location
    Wisia
  • Print_ISBN
    978-83-60810-14-9
  • Type

    conf

  • DOI
    10.1109/IMCSIT.2008.4747268
  • Filename
    4747268