DocumentCode
1909596
Title
A Clustering Approach to Improving Pseudo-Relevance Feedback: Improving Retrieval Effetiveness by Removing Noisy Documents
Author
Changchun Li ; Jun-yi Wang
Author_Institution
Coll. of Comput. Sci., Inner Mongolia Univ., Huhhot, China
fYear
2012
fDate
14-16 Dec. 2012
Firstpage
35
Lastpage
38
Abstract
Pseudo relevance feedback is an effective technique for improving retrieval results, which assumes a small number of top-ranked documents in the initial retrieval results are relevant and selects from these documents related terms to the query to improve the query representation through query expansion. However, these documents are often a mixture of relevant and irrelevant documents. The relevance feedback is quite effective and performs significantly better than pseudo-relevance feedback, which needs the user explicitly provides information on relevant documents to a query. This paper makes a case for the use of query-specific density clustering in IR on the grounds of improved retrieval effectiveness in a fully automatic manner and without relevance information provided by human and the experimental results show that significant improvements can be obtained on several collections when our new model FWN (Feedback Without Noise) is used.
Keywords
document handling; pattern clustering; query processing; relevance feedback; FWN model; clustering approach; feedback without noise model; noisy document removal; pseudorelevance feedback; query-specific density clustering; retrieval effectiveness; Density Clustering; Information Retrieval; Language Model; Query Expansion; relevance feedback;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Science and Engineering (ISISE), 2012 International Symposium on
Conference_Location
Shanghai
ISSN
2160-1283
Print_ISBN
978-1-4673-5680-0
Type
conf
DOI
10.1109/ISISE.2012.17
Filename
6495293
Link To Document