• DocumentCode
    839946
  • Title

    Probabilistic Topic Models for Learning Terminological Ontologies

  • Author

    Wei Wang ; Barnaghi, Payam ; Bargiela, Andrzej

  • Author_Institution
    Sch. of Comput. Sci., Univ. of Nottingham, Semanyih, Malaysia
  • Volume
    22
  • Issue
    7
  • fYear
    2010
  • fDate
    7/1/2010 12:00:00 AM
  • Firstpage
    1028
  • Lastpage
    1040
  • Abstract
    Probabilistic topic models were originally developed and utilized for document modeling and topic extraction in Information Retrieval. In this paper, we describe a new approach for automatic learning of terminological ontologies from text corpus based on such models. In our approach, topic models are used as efficient dimension reduction techniques, which are able to capture semantic relationships between word-topic and topic-document interpreted in terms of probability distributions. We propose two algorithms for learning terminological ontologies using the principle of topic relationship and exploiting information theory with the probabilistic topic models learned. Experiments with different model parameters were conducted and learned ontology statements were evaluated by the domain experts. We have also compared the results of our method with two existing concept hierarchy learning methods on the same data set. The study shows that our method outperforms other methods in terms of recall and precision measures. The precision level of the learned ontology is sufficient for it to be deployed for the purpose of browsing, navigation, and information search and retrieval in digital libraries.
  • Keywords
    digital libraries; document handling; information retrieval; knowledge acquisition; ontologies (artificial intelligence); statistical distributions; automatic learning; digital libraries; dimension reduction techniques; document modeling; information retrieval; information search; probabilistic topic models; probability distributions; terminological ontologies learning; text corpus; topic extraction; topic-document interpretation; Knowledge acquisition; ontology; ontology learning; probabilistic topic models.;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2009.122
  • Filename
    4912206