• DocumentCode
    1787778
  • Title

    Diversification of web search results using post-retrieval clustering

  • Author

    Kumar, Sudhakar ; Jain, S.K. ; Sharma, R.M.

  • Author_Institution
    Dept. of Comput. Eng., Nat. Inst. of Technol., Kurukshetra, India
  • fYear
    2014
  • fDate
    26-28 Sept. 2014
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Diversification of results in web search engines is a very attractive area for researchers now a days. Information retrieval techniques mainly focus on the relevance of the documents retrieved but these techniques often fail to satisfy each user. In this work, we present a coverage based diversification using post retrieval clustering. We model clusters corresponding to the query based on the features of the web pages such as web pages of similar features are to be in one cluster and web pages from dissimilar features are to be in different clusters. A query can retrieve relevant and diverse result set if all the results cover as many clusters as possible.
  • Keywords
    information retrieval; search engines; Web pages; Web search diversification; Web search engine; coverage based diversification; information retrieval; post-retrieval clustering; Algorithm design and analysis; Clustering algorithms; Feature extraction; Search engines; Vectors; Web pages; Web search; Clustering; Cosine Similarity; Relevance; Result diversification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Communication Technology (ICCCT), 2014 International Conference on
  • Conference_Location
    Allahabad
  • Print_ISBN
    978-1-4799-6757-5
  • Type

    conf

  • DOI
    10.1109/ICCCT.2014.7001460
  • Filename
    7001460