• DocumentCode
    744302
  • Title

    Temporal Localization of Actions with Actoms

  • Author

    Gaidon, A. ; Harchaoui, Zaid ; Schmid, Cordelia

  • Author_Institution
    Xerox Res. Centre Eur., Meylan, France
  • Volume
    35
  • Issue
    11
  • fYear
    2013
  • Firstpage
    2782
  • Lastpage
    2795
  • Abstract
    We address the problem of localizing actions, such as opening a door, in hours of challenging video data. We propose a model based on a sequence of atomic action units, termed "actoms," that are semantically meaningful and characteristic for the action. Our actom sequence model (ASM) represents an action as a sequence of histograms of actom-anchored visual features, which can be seen as a temporally structured extension of the bag-of-features. Training requires the annotation of actoms for action examples. At test time, actoms are localized automatically based on a nonparametric model of the distribution of actoms, which also acts as a prior on an action\´s temporal structure. We present experimental results on two recent benchmarks for action localization "Coffee and Cigarettes" and the "DLSBP" dataset. We also adapt our approach to a classification-by-localization set-up and demonstrate its applicability on the challenging "Hollywood 2" dataset. We show that our ASM method outperforms the current state of the art in temporal action localization, as well as baselines that localize actions with a sliding window method.
  • Keywords
    image classification; image motion analysis; video signal processing; ASM; Coffee and Cigarettes dataset; DLSBP dataset; Hollywood 2 dataset; actom sequence model; actom-anchored visual features; atomic action units; bag-of-features; classification-by-localization; nonparametric model; sliding window method; temporal action localization; video data; Adaptation models; Hidden Markov models; Histograms; Spatiotemporal phenomena; Support vector machines; Training; Visualization; Action recognition; actoms; temporal localization; video analysis; Actigraphy; Algorithms; Artificial Intelligence; Humans; Image Interpretation, Computer-Assisted; Pattern Recognition, Automated; Subtraction Technique; Video Recording; Whole Body Imaging;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2013.65
  • Filename
    6487513