Title of article :
Query-level loss functions for information retrieval
Author/Authors :
Tao Qin، نويسنده , , Xu-Dong Zhang، نويسنده , , Ming-Feng Tsai، نويسنده , , De-Sheng Wang، نويسنده , , Tie-Yan Liu، نويسنده , , Hang Li، نويسنده ,
Issue Information :
دوماهنامه با شماره پیاپی سال 2008
Abstract :
Many machine learning technologies such as support vector machines, boosting, and neural networks have been applied to the ranking problem in information retrieval. However, since originally the methods were not developed for this task, their loss functions do not directly link to the criteria used in the evaluation of ranking. Specifically, the loss functions are defined on the level of documents or document pairs, in contrast to the fact that the evaluation criteria are defined on the level of queries. Therefore, minimizing the loss functions does not necessarily imply enhancing ranking performances. To solve this problem, we propose using query-level loss functions in learning of ranking functions. We discuss the basic properties that a query-level loss function should have and propose a query-level loss function based on the cosine similarity between a ranking list and the corresponding ground truth. We further design a coordinate descent algorithm, referred to as RankCosine, which utilizes the proposed loss function to create a generalized additive ranking model. We also discuss whether the loss functions of existing ranking algorithms can be extended to query-level. Experimental results on the datasets of TREC web track, OHSUMED, and a commercial web search engine show that with the use of the proposed query-level loss function we can significantly improve ranking accuracies. Furthermore, we found that it is difficult to extend the document-level loss functions to query-level loss functions.
Keywords :
Query-level loss function , Learning to Rank , information retrieval , RankCosine
Journal title :
Information Processing and Management
Journal title :
Information Processing and Management