• DocumentCode
    1787404
  • Title

    Post-analysis of Keyword-Based Search Results Using Entity Mining, Linked Data, and Link Analysis at Query Time

  • Author

    Fafalios, Pavlos ; Tzitzikas, Yannis

  • Author_Institution
    Inst. of Comput. Sci., FORTH-ICS, Heraklion, Greece
  • fYear
    2014
  • fDate
    16-18 June 2014
  • Firstpage
    36
  • Lastpage
    43
  • Abstract
    The integration of the classical Web (of documents) with the emerging Web of Data is a challenging vision. In this paper we focus on an integration approach during searching which aims at enriching the responses of non-semantic search systems (e.g. professional search systems, web search engines) with semantic information, i.e. Linked Open Data (LOD), and exploiting the outcome for providing an overview of the search space and allowing the users (apart from restricting it) to explore the related LOD. We use named entities (e.g. persons, locations, etc.) as the "glue" for automatically connecting search hits with LOD. We consider a scenario where this entity-based integration is performed at query time with no human effort, and no a-priori indexing, which is beneficial in terms of configurability and freshness. To realize this scenario one has to tackle various challenges. One spiny issue is that the number of identified entities can be high, the same is true for the semantic information about these entities that can be fetched from the available LOD (i.e. their properties and associations with other entities). To this end, in this paper we propose a Link Analysis-based method which is used for (a) ranking (and thus selecting to show) the more important semantic information related to the search results, (b) deriving and showing top-K semantic graphs. In the sequel, we report the results of a survey regarding the marine domain with promising results, and comparative results that illustrate the effectiveness of the proposed (Page Rank-based) ranking scheme. Finally, we report experimental results regarding efficiency showing that the proposed functionality can be offered even at query time.
  • Keywords
    Internet; data mining; information analysis; query processing; LOD; Page Rank-based ranking scheme; Web of Data; entity mining; keyword-based search results; link analysis; linked open data; named entities; nonsemantic search systems; query time; search space; semantic information; top-K semantic graphs; Engines; Knowledge based systems; Resource description framework; Search problems; Semantics; Web pages; Web search; entity mining; link analysis; linked data; results post-analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Semantic Computing (ICSC), 2014 IEEE International Conference on
  • Conference_Location
    Newport Beach, CA
  • Print_ISBN
    978-1-4799-4002-8
  • Type

    conf

  • DOI
    10.1109/ICSC.2014.11
  • Filename
    6881999