• DocumentCode
    108905
  • Title

    Learning the Gain Values and Discount Factors of Discounted Cumulative Gains

  • Author

    Ke Zhou ; Hongyuan Zha ; Yi Chang ; Gui-Rong Xue

  • Author_Institution
    Georgia Inst. of Technol., Atlanta, GA, USA
  • Volume
    26
  • Issue
    2
  • fYear
    2014
  • fDate
    Feb. 2014
  • Firstpage
    391
  • Lastpage
    404
  • Abstract
    Evaluation metric is an essential and integral part of a ranking system. In the past, several evaluation metrics have been proposed in information retrieval and web search, among them Discounted Cumulative Gain (DCG) has emerged as one that is widely adopted for evaluating the performance of ranking functions used in web search. However, the two sets of parameters, the gain values and discount factors, used in DCG are usually determined in a rather ad-hoc way, and their impacts have not been carefully analyzed. In this paper, we first show that DCG is generally not coherent, i.e., comparing the performance of ranking functions using DCG very much depends on the particular gain values and discount factors used. We then propose a novel methodology that can learn the gain values and discount factors from user preferences over rankings, modeled as a special case of learning linear utility functions. We also discuss how to extend our methods to handle tied preference pairs and how to explore active learning to reduce preference labeling. Numerical simulations illustrate the effectiveness of our proposed methods. Moreover, experiments are also conducted over a side-by-side comparison data set from a commercial search engine to validate the proposed methods on real-world data.
  • Keywords
    learning (artificial intelligence); search engines; DCG; Web search; commercial search engine; discount factors; discounted cumulative gain; evaluation metric; gain values; information retrieval; linear utility functions learning; numerical simulations; preference labeling; ranking functions; ranking system; side-by-side comparison data; user preferences; Measurement; Optimization; Production; Search engines; Vectors; Web search; Discounted cumulative gains; evaluation metric; machine learning; user preference; utility function;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2012.252
  • Filename
    6399471