• DocumentCode
    1909596
  • Title

    A Clustering Approach to Improving Pseudo-Relevance Feedback: Improving Retrieval Effetiveness by Removing Noisy Documents

  • Author

    Changchun Li ; Jun-yi Wang

  • Author_Institution
    Coll. of Comput. Sci., Inner Mongolia Univ., Huhhot, China
  • fYear
    2012
  • fDate
    14-16 Dec. 2012
  • Firstpage
    35
  • Lastpage
    38
  • Abstract
    Pseudo relevance feedback is an effective technique for improving retrieval results, which assumes a small number of top-ranked documents in the initial retrieval results are relevant and selects from these documents related terms to the query to improve the query representation through query expansion. However, these documents are often a mixture of relevant and irrelevant documents. The relevance feedback is quite effective and performs significantly better than pseudo-relevance feedback, which needs the user explicitly provides information on relevant documents to a query. This paper makes a case for the use of query-specific density clustering in IR on the grounds of improved retrieval effectiveness in a fully automatic manner and without relevance information provided by human and the experimental results show that significant improvements can be obtained on several collections when our new model FWN (Feedback Without Noise) is used.
  • Keywords
    document handling; pattern clustering; query processing; relevance feedback; FWN model; clustering approach; feedback without noise model; noisy document removal; pseudorelevance feedback; query-specific density clustering; retrieval effectiveness; Density Clustering; Information Retrieval; Language Model; Query Expansion; relevance feedback;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Science and Engineering (ISISE), 2012 International Symposium on
  • Conference_Location
    Shanghai
  • ISSN
    2160-1283
  • Print_ISBN
    978-1-4673-5680-0
  • Type

    conf

  • DOI
    10.1109/ISISE.2012.17
  • Filename
    6495293