• DocumentCode
    3046211
  • Title

    Combining Named Entities with WordNet and Using Query-Oriented Spreading Activation for Semantic Text Search

  • Author

    Ngo, Vuong M. ; Cao, Tru H. ; Le, Tuan M V

  • Author_Institution
    Fac. of Comput. Sci. & Eng., Ho Chi Minh City Univ. of Technol., Ho Chi Minh City, Vietnam
  • fYear
    2010
  • fDate
    1-4 Nov. 2010
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Purely keyword-based text search is not satisfactory because named entities and WordNet words are also important elements to define the content of a document or a query in which they occur. Named entities have ontological features, namely, their aliases, classes, and identifiers. Words in WordNet also have ontological features, namely, their synonyms, hypernyms, hyponyms, and senses. Those features of concepts may be hidden from their textual appearance. Besides, there are related concepts that do not appear in a query, but can bring out the meaning of the query if they are added. We propose an ontology-based generalized Vector Space Model to semantic text search. It exploits ontological features of named entities and WordNet words, and develops a query-oriented spreading activation algorithm to expand queries. In addition, it combines and utilizes advantages of different ontologies for semantic annotation and searching. Experiments on a benchmark dataset show that, in terms of the MAP measure, our model is 42.5% better than the purely keyword-based model, and 32.3% and 15.9% respectively better than the ones using only WordNet or named entities.
  • Keywords
    ontologies (artificial intelligence); query processing; text analysis; WordNet; generalized vector space model; hypernyms feature; hyponyms feature; keyword-based text search; named entity; ontology; query-oriented spreading activation; semantic annotation; semantic search; semantic text search; senses feature; synonyms feature; Cities and towns; Earthquakes; Ontologies; Search engines; Semantics; Testing; USA Councils;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computing and Communication Technologies, Research, Innovation, and Vision for the Future (RIVF), 2010 IEEE RIVF International Conference on
  • Conference_Location
    Hanoi
  • Print_ISBN
    978-1-4244-8074-6
  • Type

    conf

  • DOI
    10.1109/RIVF.2010.5633401
  • Filename
    5633401