• DocumentCode
    2172469
  • Title

    Image parsing: unifying segmentation, detection, and recognition

  • Author

    Tu, Zhuowen ; Chen, Xiangrong ; Yuille, Alan L. ; Zhu, Song-Chun

  • Author_Institution
    California Univ., Los Angeles, CA, USA
  • fYear
    2003
  • fDate
    13-16 Oct. 2003
  • Firstpage
    18
  • Abstract
    We propose a general framework for parsing images into regions and objects. In this framework, the detection and recognition of objects proceed simultaneously with image segmentation in a competitive and cooperative manner. We illustrate our approach on natural images of complex city scenes where the objects of primary interest are faces and text. This method makes use of bottom-up proposals combined with top-down generative models using the data driven Markov chain Monte Carlo (DDMCMC) algorithm, which is guaranteed to converge to the optimal estimate asymptotically. More precisely, we define generative models for faces, text, and generic regions- e.g. shading, texture, and clutter. These models are activated by bottom-up proposals. The proposals for faces and text are learnt using a probabilistic version of AdaBoost. The DDMCMC combines reversible jump and diffusion dynamics to enable the generative models to explain the input images in a competitive and cooperative manner. Our experiments illustrate the advantages and importance of combining bottom-up and top-down models and of performing segmentation and object detection/recognition simultaneously.
  • Keywords
    Markov processes; Monte Carlo methods; face recognition; image segmentation; natural scenes; object detection; object recognition; AdaBoost; bottom-up model; data driven Markov chain Monte Carlo algorithm; diffusion dynamics; image parsing; image segmentation; natural image; object detection; object recognition; top-down model; Cities and towns; Computer vision; Face detection; Image generation; Image recognition; Image segmentation; Layout; Monte Carlo methods; Object detection; Proposals;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision, 2003. Proceedings. Ninth IEEE International Conference on
  • Conference_Location
    Nice, France
  • Print_ISBN
    0-7695-1950-4
  • Type

    conf

  • DOI
    10.1109/ICCV.2003.1238309
  • Filename
    1238309