DocumentCode
3284754
Title
Hierarchical video clustering
Author
Petrovic, Nemanja ; Jojic, Nebojsa ; Huang, Thomas S.
Author_Institution
Inst. of Beckman, Illinois Univ., IL, USA
fYear
2004
fDate
29 Sept.-1 Oct. 2004
Firstpage
423
Lastpage
426
Abstract
We present a novel generative model for video that models video as mixture of transformed video scenes. The learning procedure automatically clusters video frames into video scenes and objects. The learning algorithm is based on a hierarchical, on-line EM algorithm. Fast Fourier transform (FFT) is used for rapid computations in E and M step of the EM algorithm. We use the model to: 1. perform video clustering by grouping similar (up to translation and scale) video frames into clusters; 2. robustly stabilize video by inferring translation and scale intensity for each frame. We believe that video scene modeling of this kind is essential to bridge the "semantic gap" in video understanding. We illustrate this with several excellent results, both in terms of speed and accuracy.
Keywords
fast Fourier transforms; learning (artificial intelligence); pattern clustering; video databases; video signal processing; fast Fourier transform; hierarchical video clustering; learning algorithm; online EM algorithm; semantic gap; transformed video scene; Bridges; Clustering algorithms; Fast Fourier transforms; Graphical models; Indexing; Information retrieval; Layout; Random variables; Robustness; Spatial databases;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia Signal Processing, 2004 IEEE 6th Workshop on
Print_ISBN
0-7803-8578-0
Type
conf
DOI
10.1109/MMSP.2004.1436583
Filename
1436583
Link To Document