• DocumentCode
    949485
  • Title

    Scene Classification Using a Hybrid Generative/Discriminative Approach

  • Author

    Bosch, Anna ; Zisserman, Andrew ; Muoz, X.

  • Author_Institution
    Univ. de Girona, Girona
  • Volume
    30
  • Issue
    4
  • fYear
    2008
  • fDate
    4/1/2008 12:00:00 AM
  • Firstpage
    712
  • Lastpage
    727
  • Abstract
    We investigate whether dimensionality reduction using a latent generative model is beneficial for the task of weakly supervised scene classification. In detail, we are given a set of labeled images of scenes (for example, coast, forest, city, river, etc.), and our objective is to classify a new image into one of these categories. Our approach consists of first discovering latent ";topics"; using probabilistic Latent Semantic Analysis (pLSA), a generative model from the statistical text literature here applied to a bag of visual words representation for each image, and subsequently, training a multiway classifier on the topic distribution vector for each image. We compare this approach to that of representing each image by a bag of visual words vector directly and training a multiway classifier on these vectors. To this end, we introduce a novel vocabulary using dense color SIFT descriptors and then investigate the classification performance under changes in the size of the visual vocabulary, the number of latent topics learned, and the type of discriminative classifier used (k-nearest neighbor or SVM). We achieve superior classification performance to recent publications that have used a bag of visual word representation, in all cases, using the authors\´ own data sets and testing protocols. We also investigate the gain in adding spatial information. We show applications to image retrieval with relevance feedback and to scene classification in videos.
  • Keywords
    image classification; support vector machines; latent generative model; multiway classifier; probabilistic latent semantic analysis; relevance feedback; statistical text literature; supervised scene classification; topic distribution vector; visual word representation; visual words vector; Scene Classification; Spatial Information; pLSA; Algorithms; Discriminant Analysis; Image Enhancement; Image Interpretation, Computer-Assisted; Pattern Recognition, Automated; Photography; Reproducibility of Results; Sensitivity and Specificity; Subtraction Technique; Video Recording;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2007.70716
  • Filename
    4359337