• DocumentCode
    78807
  • Title

    A Scalable and Accurate Descriptor for Dynamic Textures Using Bag of System Trees

  • Author

    Mumtaz, Adeel ; Coviello, Emanuele ; Lanckriet, Gert R. G. ; Chan, Antoni B.

  • Author_Institution
    Dept. of Comput. Sci., City Univ. of Hong Kong, Hong Kong, China
  • Volume
    37
  • Issue
    4
  • fYear
    2015
  • fDate
    April 1 2015
  • Firstpage
    697
  • Lastpage
    712
  • Abstract
    The bag-of-systems (BoS) representation is a descriptor of motion in a video, where dynamic texture (DT) codewords represent the typical motion patterns in spatio-temporal patches extracted from the video. The efficacy of the BoS descriptor depends on the richness of the codebook, which depends on the number of codewords in the codebook. However, for even modest sized codebooks, mapping videos onto the codebook results in a heavy computational load. In this paper we propose the BoS Tree, which constructs a bottom-up hierarchy of codewords that enables efficient mapping of videos to the BoS codebook. By leveraging the tree structure to efficiently index the codewords, the BoS Tree allows for fast look-ups in the codebook and enables the practical use of larger, richer codebooks. We demonstrate the effectiveness of BoS Trees on classification of four video datasets, as well as on annotation of a video dataset and a music dataset. Finally, we show that, although the fast look-ups of BoS Tree result in different descriptors than BoS for the same video, the overall distance (and kernel) matrices are highly correlated resulting in similar classification performance.
  • Keywords
    feature extraction; image classification; image motion analysis; image representation; image texture; matrix algebra; music; trees (mathematics); video coding; BoS representation; DT codewords; bag of system trees; bag-of-system representation; distance matrices; dynamic texture codewords; dynamic textures; motion patterns; music dataset annotation; spatio-temporal patch extraction; tree structure; video dataset annotation; video dataset classification; video mapping; Clustering algorithms; Heuristic algorithms; Histograms; Indexing; Training; Vectors; Vegetation; Dynamic textures; bag of systems; dynamic texture recognition; efficient indexing; large codebooks; music annotation; video annotation;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2014.2359432
  • Filename
    6905854