• Title of article

    Query-level loss functions for information retrieval

  • Author/Authors

    Tao Qin، نويسنده , , Xu-Dong Zhang، نويسنده , , Ming-Feng Tsai، نويسنده , , De-Sheng Wang، نويسنده , , Tie-Yan Liu، نويسنده , , Hang Li، نويسنده ,

  • Issue Information
    دوماهنامه با شماره پیاپی سال 2008
  • Pages
    18
  • From page
    838
  • To page
    855
  • Abstract
    Many machine learning technologies such as support vector machines, boosting, and neural networks have been applied to the ranking problem in information retrieval. However, since originally the methods were not developed for this task, their loss functions do not directly link to the criteria used in the evaluation of ranking. Specifically, the loss functions are defined on the level of documents or document pairs, in contrast to the fact that the evaluation criteria are defined on the level of queries. Therefore, minimizing the loss functions does not necessarily imply enhancing ranking performances. To solve this problem, we propose using query-level loss functions in learning of ranking functions. We discuss the basic properties that a query-level loss function should have and propose a query-level loss function based on the cosine similarity between a ranking list and the corresponding ground truth. We further design a coordinate descent algorithm, referred to as RankCosine, which utilizes the proposed loss function to create a generalized additive ranking model. We also discuss whether the loss functions of existing ranking algorithms can be extended to query-level. Experimental results on the datasets of TREC web track, OHSUMED, and a commercial web search engine show that with the use of the proposed query-level loss function we can significantly improve ranking accuracies. Furthermore, we found that it is difficult to extend the document-level loss functions to query-level loss functions.
  • Keywords
    Query-level loss function , Learning to Rank , information retrieval , RankCosine
  • Journal title
    Information Processing and Management
  • Serial Year
    2008
  • Journal title
    Information Processing and Management
  • Record number

    1228770