• DocumentCode
    3014580
  • Title

    Leveraging temporal, contextual and ordering constraints for recognizing complex activities in video

  • Author

    Laxton, Benjamin ; Lim, Jongwoo ; Kriegman, David

  • Author_Institution
    California Univ., San Diego
  • fYear
    2007
  • fDate
    17-22 June 2007
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    We present a scalable approach to recognizing and describing complex activities in video sequences. We are interested in long-term, sequential activities that may have several parallel streams of action. Our approach integrates temporal, contextual and ordering constraints with output from low-level visual detectors to recognize complex, long-term activities. We argue that a hierarchical, object-oriented design lends our solution to be scalable in that higher-level reasoning components are independent from the particular low-level detector implementation and that recognition of additional activities and actions can easily be added. Three major components to realize this design are: a dynamic Bayesian network structure for representing activities comprised of partially ordered sub-actions, an object-oriented action hierarchy for building arbitrarily complex action detectors and an approximate Viterbi-like algorithm for inferring the most likely observed sequence of actions. Additionally, this study proposes the Erlang distribution as a comprehensive model of idle time between actions and frequency of observing new actions. We show results for our approach on real video sequences containing complex activities.
  • Keywords
    Bayes methods; image sequences; object-oriented methods; video signal processing; Erlang distribution; approximate Viterbi-like algorithm; contextual constraint; dynamic Bayesian network; object-oriented action hierarchy; ordering constraint; temporal constraint; video sequence; Algorithm design and analysis; Bayesian methods; Computer science; Computerized monitoring; Detectors; Object oriented modeling; State estimation; Streaming media; Video sequences; Viterbi algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition, 2007. CVPR '07. IEEE Conference on
  • Conference_Location
    Minneapolis, MN
  • ISSN
    1063-6919
  • Print_ISBN
    1-4244-1179-3
  • Electronic_ISBN
    1063-6919
  • Type

    conf

  • DOI
    10.1109/CVPR.2007.383074
  • Filename
    4270099