Hierarchical video clustering

Author

Petrovic, Nemanja ; Jojic, Nebojsa ; Huang, Thomas S.

Author_Institution

Inst. of Beckman, Illinois Univ., IL, USA

fYear

2004

fDate

29 Sept.-1 Oct. 2004

Firstpage

423

Lastpage

426

Abstract

We present a novel generative model for video that models video as mixture of transformed video scenes. The learning procedure automatically clusters video frames into video scenes and objects. The learning algorithm is based on a hierarchical, on-line EM algorithm. Fast Fourier transform (FFT) is used for rapid computations in E and M step of the EM algorithm. We use the model to: 1. perform video clustering by grouping similar (up to translation and scale) video frames into clusters; 2. robustly stabilize video by inferring translation and scale intensity for each frame. We believe that video scene modeling of this kind is essential to bridge the "semantic gap" in video understanding. We illustrate this with several excellent results, both in terms of speed and accuracy.

Keywords

fast Fourier transforms; learning (artificial intelligence); pattern clustering; video databases; video signal processing; fast Fourier transform; hierarchical video clustering; learning algorithm; online EM algorithm; semantic gap; transformed video scene; Bridges; Clustering algorithms; Fast Fourier transforms; Graphical models; Indexing; Information retrieval; Layout; Random variables; Robustness; Spatial databases;

fLanguage

English

Publisher

ieee

Conference_Titel

Multimedia Signal Processing, 2004 IEEE 6th Workshop on

Print_ISBN

0-7803-8578-0

Type

conf

DOI

10.1109/MMSP.2004.1436583

Filename

1436583