• DocumentCode
    2727988
  • Title

    Discovering Subsumption Hierarchies of Ontology Concepts from Text Corpora

  • Author

    Zavitsanos, E. ; Paliouras, G. ; Vouros, G.A.

  • Author_Institution
    Inst. of Informatics & Telecommun., Aghia Paraskevi
  • fYear
    2007
  • fDate
    2-5 Nov. 2007
  • Firstpage
    402
  • Lastpage
    408
  • Abstract
    This paper proposes a method for learning ontologies given a corpus of text documents. The method identifies concepts in documents and organizes them into a subsumption hierarchy, without presupposing the existence of a seed ontology. The method uncovers latent topics in terms of which document text is being generated. These topics form the concepts of the new ontology. This is done in a language neutral way, using probabilistic space reduction techniques over the original term space of the corpus. Given multiple sets of concepts (latent topics) being discovered, the proposed method constructs a subsumption hierarchy by performing conditional independence tests among pairs of latent topics, given a third one. The paper provides experimental results over the GENIA corpus from the domain of biomedicine.
  • Keywords
    data mining; learning (artificial intelligence); ontologies (artificial intelligence); text analysis; latent topics; ontologies learning; ontology concepts; probabilistic space reduction; seed ontology; subsumption hierarchies dicovery; text corpora; text documents; Biomedical engineering; Humans; Informatics; Linear discriminant analysis; Machine learning; Ontologies; Performance evaluation; Shape; Systems engineering and theory; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence, IEEE/WIC/ACM International Conference on
  • Conference_Location
    Fremont, CA
  • Print_ISBN
    978-0-7695-3026-0
  • Type

    conf

  • DOI
    10.1109/WI.2007.55
  • Filename
    4427123