• DocumentCode
    1644268
  • Title

    The effects of segmentation and feature choice in a translation model of object recognition

  • Author

    Barnard, Kobus ; Duygulu, Pinar ; Guru, Raghavendra ; Gabbur, Prasad ; Forsyth, David

  • Author_Institution
    Dept. of Comput. Sci., Arizona Univ., Tucson, AZ, USA
  • Volume
    2
  • fYear
    2003
  • Abstract
    We work with a model of object recognition where words must be placed on image regions. This approach means that large scale experiments are relatively easy, so we can evaluate the effects of various early and midlevel vision algorithms on recognition performance. We evaluate various image segmentation algorithms by determining word prediction accuracy for images segmented in various ways and represented by various features. We take the view that good segmentations respect object boundaries, and so word prediction should be better for a better segmentation. However, it is usually very difficult in practice to obtain segmentations that do not break up objects, so most practitioners attempt to merge segments to get better putative object representations. We demonstrate that our paradigm of word prediction easily allows us to predict potentially useful segment merges, even for segments that do not look similar (for example, merging the black and white halves of a penguin is not possible with feature-based segmentation; the main cue must be "familiar configuration"). These studies focus on unsupervised learning of recognition. However, we show that word prediction can be markedly improved by providing supervised information for a relatively small number of regions together with large quantities of unsupervised information. This supervisory information allows a better and more discriminative choice of features and breaks possible symmetries.
  • Keywords
    feature extraction; image representation; image segmentation; object recognition; word processing; familiar configuration; feature-based segmentation; image feature; image region; image segmentation; object boundary; object recognition; object representation; recognition performance; segment merging; supervised information; supervisory information; translation model; unsupervised learning; vision algorithm; word prediction; Computer Society; Computer science; Computer vision; Frequency; Gaussian processes; Labeling; Object recognition; Pattern recognition; Probability distribution; Shape;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition, 2003. Proceedings. 2003 IEEE Computer Society Conference on
  • ISSN
    1063-6919
  • Print_ISBN
    0-7695-1900-8
  • Type

    conf

  • DOI
    10.1109/CVPR.2003.1211532
  • Filename
    1211532