• DocumentCode
    3604087
  • Title

    Active Learning from Relative Comparisons

  • Author

    Sicheng Xiong ; Yuanli Pei ; Rosales, Romer ; Fern, Xiaoli Z.

  • Author_Institution
    LinkedIn, Mountain View, CA, USA
  • Volume
    27
  • Issue
    12
  • fYear
    2015
  • Firstpage
    3166
  • Lastpage
    3175
  • Abstract
    This work focuses on active learning from relative comparison information. A relative comparison specifies, for a data triplet (xi, xj, xk), that instance xi is more similar to xj than to xk. Such constraints, when available, have been shown to be useful toward learning tasks such as defining appropriate distance metrics or finding good clustering solutions. In real-world applications, acquiring constraints often involves considerable human effort, as it requires the user to manually inspect the instances. This motivates us to study how to select and query the most useful relative comparisons to achieve effective learning with minimum user effort. Given an underlying class concept that is employed by the user to provide such constraints, we present an information-theoretic criterion that selects the triplet whose answer leads to the highest expected information gain about the classes of a set of examples. Directly applying the proposed criterion requires examining O(n3) triplets with n instances, which is prohibitive even for datasets of moderate size. We show that a randomized selection strategy can be used to reduce the selection pool from O(n3) to O(n) with minimal loss in efficiency, allowing us to scale up to considerably larger problems. Experiments show that the proposed method consistently outperforms baseline policies.
  • Keywords
    computational complexity; learning (artificial intelligence); query processing; O(n3) triplets; active learning; clustering process; data triplet; distance metrics; expected information gain; information-theoretic criterion; query processing; randomized selection strategy; relative comparison information; Clustering algorithms; Complexity theory; Learning systems; Measurement; Uncertainty; Active Learning; Active learning; Relative Comparisons; relative comparisons;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2015.2462365
  • Filename
    7172547