• DocumentCode
    3284754
  • Title

    Hierarchical video clustering

  • Author

    Petrovic, Nemanja ; Jojic, Nebojsa ; Huang, Thomas S.

  • Author_Institution
    Inst. of Beckman, Illinois Univ., IL, USA
  • fYear
    2004
  • fDate
    29 Sept.-1 Oct. 2004
  • Firstpage
    423
  • Lastpage
    426
  • Abstract
    We present a novel generative model for video that models video as mixture of transformed video scenes. The learning procedure automatically clusters video frames into video scenes and objects. The learning algorithm is based on a hierarchical, on-line EM algorithm. Fast Fourier transform (FFT) is used for rapid computations in E and M step of the EM algorithm. We use the model to: 1. perform video clustering by grouping similar (up to translation and scale) video frames into clusters; 2. robustly stabilize video by inferring translation and scale intensity for each frame. We believe that video scene modeling of this kind is essential to bridge the "semantic gap" in video understanding. We illustrate this with several excellent results, both in terms of speed and accuracy.
  • Keywords
    fast Fourier transforms; learning (artificial intelligence); pattern clustering; video databases; video signal processing; fast Fourier transform; hierarchical video clustering; learning algorithm; online EM algorithm; semantic gap; transformed video scene; Bridges; Clustering algorithms; Fast Fourier transforms; Graphical models; Indexing; Information retrieval; Layout; Random variables; Robustness; Spatial databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia Signal Processing, 2004 IEEE 6th Workshop on
  • Print_ISBN
    0-7803-8578-0
  • Type

    conf

  • DOI
    10.1109/MMSP.2004.1436583
  • Filename
    1436583