• DocumentCode
    254081
  • Title

    Generalized Max Pooling

  • Author

    Murray, Naila ; Perronnin, Florent

  • fYear
    2014
  • fDate
    23-28 June 2014
  • Firstpage
    2473
  • Lastpage
    2480
  • Abstract
    State-of-the-art patch-based image representations involve a pooling operation that aggregates statistics computed from local descriptors. Standard pooling operations include sum- and max-pooling. Sum-pooling lacks discriminability because the resulting representation is strongly influenced by frequent yet often uninformative descriptors, but only weakly influenced by rare yet potentially highly-informative ones. Max-pooling equalizes the influence of frequent and rare descriptors but is only applicable to representations that rely on count statistics, such as the bag-of-visual-words (BOV)and its soft- and sparse-coding extensions. We propose a novel pooling mechanism that achieves the same effect as max-pooling but is applicable beyond the BOV and especially to the state-of-the-art Fisher Vector -- hence the name Generalized Max Pooling (GMP). It involves equalizing the similarity between each patch and the pooled representation, which is shown to be equivalent to re-weighting the per-patch statistics. We show on five public image classification benchmarks that the proposed GMP can lead to significant performance gains with respect to heuristic alternatives.
  • Keywords
    image classification; image coding; image representation; vectors; BOV; Fisher vector; bag-of-visual-words; generalized max pooling; local descriptors; max-pooling; patch-based image representations; per-patch statistics; pooled representation; public image classification benchmarks; soft-coding extension; sparse-coding extension; standard pooling operations; sum-pooling; Birds; Computer vision; Encoding; Kernel; Standards; Vectors; Visualization; classification; image representations; pooling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
  • Conference_Location
    Columbus, OH
  • Type

    conf

  • DOI
    10.1109/CVPR.2014.317
  • Filename
    6909713