DocumentCode :
8404
Title :
Categorizing Dynamic Textures Using a Bag of Dynamical Systems
Author :
Ravichandran, Arunkumar ; Chaudhry, Rizwan ; Vidal, Rene
Author_Institution :
UCLA Vision Lab., Univ. of California, Los Angeles, Los Angeles, CA, USA
Volume :
35
Issue :
2
fYear :
2013
fDate :
Feb. 2013
Firstpage :
342
Lastpage :
353
Abstract :
We consider the problem of categorizing video sequences of dynamic textures, i.e., nonrigid dynamical objects such as fire, water, steam, flags, etc. This problem is extremely challenging because the shape and appearance of a dynamic texture continuously change as a function of time. State-of-the-art dynamic texture categorization methods have been successful at classifying videos taken from the same viewpoint and scale by using a Linear Dynamical System (LDS) to model each video, and using distances or kernels in the space of LDSs to classify the videos. However, these methods perform poorly when the video sequences are taken under a different viewpoint or scale. In this paper, we propose a novel dynamic texture categorization framework that can handle such changes. We model each video sequence with a collection of LDSs, each one describing a small spatiotemporal patch extracted from the video. This Bag-of-Systems (BoS) representation is analogous to the Bag-of-Features (BoF) representation for object recognition, except that we use LDSs as feature descriptors. This choice poses several technical challenges in adopting the traditional BoF approach. Most notably, the space of LDSs is not euclidean; hence, novel methods for clustering LDSs and computing codewords of LDSs need to be developed. We propose a framework that makes use of nonlinear dimensionality reduction and clustering techniques combined with the Martin distance for LDSs to tackle these issues. Our experiments compare the proposed BoS approach to existing dynamic texture categorization methods and show that it can be used for recognizing dynamic textures in challenging scenarios which could not be handled by existing methods.
Keywords :
feature extraction; image sequences; image texture; pattern clustering; video signal processing; BoF; BoS; LDS; Martin distance; bag-of-features representation; bag-of-systems representation; clustering techniques; codewords; dynamic texture categorization methods; feature descriptors; fire; flags; linear dynamical system; nonlinear dimensionality reduction; nonrigid dynamical objects; object recognition; spatiotemporal patch; steam; video classification; video sequence categorization; water; Feature extraction; Heuristic algorithms; Measurement; Observability; Spatiotemporal phenomena; Training; Video sequences; Dynamic textures; categorization; linear dynamical systems; Algorithms; Artificial Intelligence; Image Enhancement; Image Interpretation, Computer-Assisted; Imaging, Three-Dimensional; Pattern Recognition, Automated; Reproducibility of Results; Sensitivity and Specificity;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2012.83
Filename :
6178260
Link To Document :
بازگشت