• DocumentCode
    2456637
  • Title

    Improving Web Page Retrieval Using Search Context from Clicked Domain Names

  • Author

    Li, Rongmei

  • Author_Institution
    Sch. of Electr., Math., & Comput. Sci., Univ. of Twente, Enschede, Netherlands
  • fYear
    2009
  • fDate
    Aug. 31 2009-Sept. 4 2009
  • Firstpage
    393
  • Lastpage
    397
  • Abstract
    Search context is a crucial factor that helps to understand a user´s information need in ad-hoc Web page retrieval. A query log of a search engine contains rich information on issued queries and their corresponding clicked Web pages. The clicked data implies its relevance to the query and can be used to define the topical context. However, the log is usually not completely available due to privacy concerns. In this paper, we derive clicked pages from clicked domains and use the surrounding query context to enhance retrieval performance. One strategy is to promote clicked pages directly in the initial retrieval result. Another strategy is to expand the original query using selected terms from the clicked pages. Our experimental results on the TREC GOV2 data and a query log of a major search engine show that both strategies can boost retrieval performance compared to the standard language model and pseudo relevance feedback (PRF) model. Their good performance on early precision allows us to apply PRF further for even more accurate result that is comparable to the performance of true relevance feedback.
  • Keywords
    Internet; query formulation; relevance feedback; search engines; TREC GOV2 data; ad-hoc Web page retrieval; clicked Web pages; clicked domain names; pseudo relevance feedback model; query log; search context; search engine; standard language model; user information; Context modeling; Data mining; Degradation; Feedback; Information retrieval; Intrusion detection; Privacy; Search engines; Uniform resource locators; Web pages; Language Models; Parsimonious Language Models; Pseudo Relevance Feedback; Query Expansion; Web Information Retrieval;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Database and Expert Systems Application, 2009. DEXA '09. 20th International Workshop on
  • Conference_Location
    Linz
  • ISSN
    1529-4188
  • Print_ISBN
    978-0-7695-3763-4
  • Type

    conf

  • DOI
    10.1109/DEXA.2009.59
  • Filename
    5337090