• DocumentCode
    3164984
  • Title

    Fast Pairwise Query Selection for Large-Scale Active Learning to Rank

  • Author

    Buyue Qian ; Xiang Wang ; Jun Wang ; Hongfei Li ; Nan Cao ; Weifeng Zhi ; Davidson, Ian

  • Author_Institution
    IBM T. J. Watson Res., Yorktown Heights, NY, USA
  • fYear
    2013
  • fDate
    7-10 Dec. 2013
  • Firstpage
    607
  • Lastpage
    616
  • Abstract
    Pair wise learning to rank algorithms (such as Rank SVM) teach a machine how to rank objects given a collection of ordered object pairs. However, their accuracy is highly dependent on the abundance of training data. To address this limitation and reduce annotation efforts, the framework of active pair wise learning to rank was introduced recently. However, in such a framework the number of possible query pairs increases quadratic ally with the number of instances. In this work, we present the first scalable pair wise query selection method using a layered (two-step) hashing framework. The first step relevance hashing aims to retrieve the strongly relevant or highly ranked points, and the second step uncertainty hashing is used to nominate pairs whose ranking is uncertain. The proposed framework aims to efficiently reduce the search space of pair wise queries and can be used with any pair wise learning to rank algorithm with a linear ranking function. We evaluate our approach on large-scale real problems and show it has comparable performance to exhaustive search. The experimental results demonstrate the effectiveness of our approach, and validate the efficiency of hashing in accelerating the search of massive pair wise queries.
  • Keywords
    file organisation; learning (artificial intelligence); query processing; active pair wise learning; algorithm ranking; fast pairwise query selection; large-scale active learning; layered hashing framework; scalable pair wise query selection method; search space reduction; uncertainty hashing; Accuracy; Artificial neural networks; Databases; Search problems; Training data; Uncertainty; Vectors; Active Learning; Hashing; Learning to Rank;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2013 IEEE 13th International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1550-4786
  • Type

    conf

  • DOI
    10.1109/ICDM.2013.54
  • Filename
    6729545