• DocumentCode
    2771083
  • Title

    Conditional Models for Non-smooth Ranking Loss Functions

  • Author

    Dubey, Avinava ; Machchhar, Jinesh ; Bhattacharyya, Chiranjib ; Chakrabarti, Soumen

  • fYear
    2009
  • fDate
    6-9 Dec. 2009
  • Firstpage
    129
  • Lastpage
    138
  • Abstract
    Learning to rank is an important area at the interface of machine learning, information retrieval and Web search. The central challenge in optimizing various measures of ranking loss is that the objectives tend to be non-convex and discontinuous. To make such functions amenable to gradient based optimization procedures one needs to design clever bounds. In recent years, boosting, neural networks, support vector machines, and many other techniques have been applied. However, there is little work on directly modeling a conditional probability Pr(y|xq) where y is a permutation of the documents to be ranked and xq represents their feature vectors with respect to a query q. A major reason is that the space of y is huge: n! if n documents must be ranked. We first propose an intuitive and appealing expected loss minimization objective, and give an efficient shortcut to evaluate it despite the huge space of ys. Unfortunately, the optimization is non-convex, so we propose a convex approximation. We give a new, efficient Monte Carlo sampling method to compute the objective and gradient of this approximation, which can then be used in a quasi-Newton optimizer like LBFGS. Extensive experiments with the widely-used LETOR dataset show large ranking accuracy improvements beyond recent and competitive algorithms.
  • Keywords
    Monte Carlo methods; convex programming; information retrieval; learning (artificial intelligence); neural nets; support vector machines; LETOR dataset; Monte Carlo sampling method; Web search; boosting; conditional probability; convex approximation; information retrieval; machine learning; neural networks; non-convex optimization; nonsmooth ranking loss functions; quasi-Newton optimizer; support vector machines; Boosting; Design optimization; Information retrieval; Loss measurement; Machine learning; Monte Carlo methods; Neural networks; Optimization methods; Support vector machines; Web search; Conditional Models; Learning to Rank; Monte Carlo Sampling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2009. ICDM '09. Ninth IEEE International Conference on
  • Conference_Location
    Miami, FL
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4244-5242-2
  • Electronic_ISBN
    1550-4786
  • Type

    conf

  • DOI
    10.1109/ICDM.2009.49
  • Filename
    5360238