• DocumentCode
    642
  • Title

    Finding Rare Classes: Active Learning with Generative and Discriminative Models

  • Author

    Hospedales, Timothy M. ; Gong, Shaogang ; Xiang, Tao

  • Author_Institution
    Sch. of Electron. Eng. & Comput. Sci., Queen Mary Univ. of London, London, UK
  • Volume
    25
  • Issue
    2
  • fYear
    2013
  • fDate
    Feb. 2013
  • Firstpage
    374
  • Lastpage
    386
  • Abstract
    Discovering rare categories and classifying new instances of them are important data mining issues in many fields, but fully supervised learning of a rare class classifier is prohibitively costly in labeling effort. There has therefore been increasing interest both in active discovery: to identify new classes quickly, and active learning: to train classifiers with minimal supervision. These goals occur together in practice and are intrinsically related because examples of each class are required to train a classifier. Nevertheless, very few studies have tried to optimise them together, meaning that data mining for rare classes in new domains makes inefficient use of human supervision. Developing active learning algorithms to optimise both rare class discovery and classification simultaneously is challenging because discovery and classification have conflicting requirements in query criteria. In this paper, we address these issues with two contributions: a unified active learning model to jointly discover new categories and learn to classify them by adapting query criteria online; and a classifier combination algorithm that switches generative and discriminative classifiers as learning progresses. Extensive evaluation on a batch of standard UCI and vision data sets demonstrates the superiority of this approach over existing methods.
  • Keywords
    data mining; learning (artificial intelligence); pattern classification; UCI data sets; active discovery; active learning; classifier combination algorithm; classifier training; data mining issues; discriminative classifiers; discriminative models; generative classifiers; generative models; instance classification; query criteria; rare category discovery; rare class classification; rare class classifier; rare class discovery; supervised learning; vision data sets; Adaptation models; Approximation methods; Data mining; Data models; Learning systems; Support vector machines; Uncertainty; Active learning; classification; discriminative models; generative models; imbalanced learning; rare class discovery;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2011.231
  • Filename
    6081866