• DocumentCode
    969718
  • Title

    Building models of animals from video

  • Author

    Ramanan, D. ; Forsyth, D.A. ; Barnard, K.

  • Author_Institution
    Toyota Technol. Inst., Chicago, IL
  • Volume
    28
  • Issue
    8
  • fYear
    2006
  • Firstpage
    1319
  • Lastpage
    1334
  • Abstract
    This paper argues that tracking, object detection, and model building are all similar activities. We describe a fully automatic system that builds 2D articulated models known as pictorial structures from videos of animals. The learned model can be used to detect the animal in the original video - in this sense, the system can be viewed as a generalized tracker (one that is capable of modeling objects while tracking them). The learned model can be matched to a visual library; here, the system can be viewed as a video recognition algorithm. The learned model can also be used to detect the animal in novel images - in this case, the system can be seen as a method for learning models for object recognition. We find that we can significantly improve the pictorial structures by augmenting them with a discriminative texture model learned from a texture library. We develop a novel texture descriptor that outperforms the state-of-the-art for animal textures. We demonstrate the entire system on real video sequences of three different animals. We show that we can automatically track and identify the given animal. We use the learned models to recognize animals from two data sets; images taken by professional photographers from the Corel collection, and assorted images from the Web returned by Google. We demonstrate quite good performance on both data sets. Comparing our results with simple baselines, we show that, for the Google set, we can detect, localize, and recover part articulations from a collection demonstrably hard for object recognition
  • Keywords
    image sequences; image texture; object detection; object recognition; video signal processing; 2D articulated models; Corel collection; Google set; animal model building; animal textures; animal videos; discriminative texture model; object detection; object recognition; pictorial structures; texture descriptor; texture library; video recognition algorithm; video sequences; visual library; Animal structures; Buildings; Deformable models; Head; Leg; Libraries; Object detection; Object recognition; Shape; Video sequences; Tracking; object recognition; shape.; texture; video analysis; Algorithms; Animals; Artificial Intelligence; Computer Simulation; Image Enhancement; Image Interpretation, Computer-Assisted; Imaging, Three-Dimensional; Information Storage and Retrieval; Models, Anatomic; Models, Biological; Movement; Pattern Recognition, Automated; Photography; Video Recording;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2006.155
  • Filename
    1642665