• DocumentCode
    2292671
  • Title

    Weakly supervised discriminative localization and classification: a joint learning process

  • Author

    Nguyen, Minh Hoai ; Torresani, Lorenzo ; De la Torre, Fernando ; Rother, Carsten

  • Author_Institution
    Carnegie Mellon Univ., Pittsburgh, PA, USA
  • fYear
    2009
  • fDate
    Sept. 29 2009-Oct. 2 2009
  • Firstpage
    1925
  • Lastpage
    1932
  • Abstract
    Visual categorization problems, such as object classification or action recognition, are increasingly often approached using a detection strategy: a classifier function is first applied to candidate subwindows of the image or the video, and then the maximum classifier score is used for class decision. Traditionally, the subwindow classifiers are trained on a large collection of examples manually annotated with masks or bounding boxes. The reliance on time-consuming human labeling effectively limits the application of these methods to problems involving very few categories. Furthermore, the human selection of the masks introduces arbitrary biases (e.g. in terms of window size and location) which may be suboptimal for classification. In this paper we propose a novel method for learning a discriminative subwindow classifier from examples annotated with binary labels indicating the presence of an object or action of interest, but not its location. During training, our approach simultaneously localizes the instances of the positive class and learns a subwindow SVM to recognize them. We extend our method to classification of time series by presenting an algorithm that localizes the most discriminative set of temporal segments in the signal. We evaluate our approach on several datasets for object and action recognition and show that it achieves results similar and in many cases superior to those obtained with full supervision.
  • Keywords
    image classification; learning (artificial intelligence); object recognition; support vector machines; action recognition; discriminative subwindow classifier; human labeling; image classification; joint learning process; object classification; object recognition; subwindow SVM; support vector machine; time series classification; video classifier function; visual categorization problems; weakly supervised discriminative localization; Animals; Computational complexity; Computer vision; Humans; Image recognition; Image segmentation; Object detection; Robustness; Support vector machine classification; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision, 2009 IEEE 12th International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1550-5499
  • Print_ISBN
    978-1-4244-4420-5
  • Electronic_ISBN
    1550-5499
  • Type

    conf

  • DOI
    10.1109/ICCV.2009.5459426
  • Filename
    5459426