• DocumentCode
    253686
  • Title

    Seeing What You´re Told: Sentence-Guided Activity Recognition in Video

  • Author

    Siddharth, N. ; Barbu, Andrei ; Siskind, Jeffrey Mark

  • fYear
    2014
  • fDate
    23-28 June 2014
  • Firstpage
    732
  • Lastpage
    739
  • Abstract
    We present a system that demonstrates how the compositional structure of events, in concert with the compositional structure of language, can interplay with the underlying focusing mechanisms in video action recognition, providing a medium for top-down and bottom-up integration as well as multi-modal integration between vision and language. We show how the roles played by participants (nouns), their characteristics (adjectives), the actions performed (verbs), the manner of such actions (adverbs), and changing spatial relations between participants (prepositions), in the form of whole-sentence descriptions mediated by a grammar, guides the activity-recognition process. Further, the utility and expressiveness of our framework is demonstrated by performing three separate tasks in the domain of multi-activity video: sentence-guided focus of attention, generation of sentential description, and query-based search, simply by leveraging the framework in different manners.
  • Keywords
    object recognition; video retrieval; video signal processing; bottom-up integration; event compositional structure; focusing mechanisms; language compositional structure; multimodal integration; query-based search; sentence-guided activity recognition; sentence-guided attention focus; sentential description generation; spatial relations; top-down integration; video action recognition; whole-sentence descriptions; Detectors; Feature extraction; Grammar; Hidden Markov models; Lattices; Semantics; Tracking;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
  • Conference_Location
    Columbus, OH
  • Type

    conf

  • DOI
    10.1109/CVPR.2014.99
  • Filename
    6909494