Title of article
High performance query expansion using adaptive co-training
Author/Authors
Jimmy Xiangji Huang، نويسنده , , Jun Miao، نويسنده , , Ben He، نويسنده ,
Issue Information
دوماهنامه با شماره پیاپی سال 2013
Pages
13
From page
441
To page
453
Abstract
The quality of feedback documents is crucial to the effectiveness of query expansion (QE) in ad hoc retrieval. Recently, machine learning methods have been adopted to tackle this issue by training classifiers from feedback documents. However, the lack of proper training data has prevented these methods from selecting good feedback documents. In this paper, we propose a new method, called AdapCOT, which applies co-training in an adaptive manner to select feedback documents for boosting QE’s effectiveness. Co-training is an effective technique for classification over limited training data, which is particularly suitable for selecting feedback documents. The proposed AdapCOT method makes use of a small set of training documents, and labels the feedback documents according to their quality through an iterative process. Two exclusive sets of term-based features are selected to train the classifiers. Finally, QE is performed on the labeled positive documents. Our extensive experiments show that the proposed method improves QE’s effectiveness, and outperforms strong baselines on various standard TREC collections.
Keywords
co-training , Query expansion , relevance feedback
Journal title
Information Processing and Management
Serial Year
2013
Journal title
Information Processing and Management
Record number
1229368
Link To Document