DocumentCode :
1225670
Title :
Finding structure in home videos by probabilistic hierarchical clustering
Author :
Gatica-Perez, Daniel ; Loui, Alexander ; Sun, Ming-Ting
Author_Institution :
Dalle Molle Inst. for Perceptual Artificial Intelligence, Martigny, Switzerland
Volume :
13
Issue :
6
fYear :
2003
fDate :
6/1/2003 12:00:00 AM
Firstpage :
539
Lastpage :
548
Abstract :
Accessing, organizing, and manipulating home videos present technical challenges due to their unrestricted content and lack of storyline. We present a methodology to discover cluster structure in home videos, which uses video shots as the unit of organization, and is based on two concepts: (1) the development of statistical models of visual similarity, duration, and temporal adjacency of consumer video segments and (2) the reformulation of hierarchical clustering as a sequential binary Bayesian classification process. A Bayesian formulation allows for the incorporation of prior knowledge of the structure of home video and offers the advantages of a principled methodology. Gaussian mixture models are used to represent the class-conditional distributions of intra- and inter-segment visual and temporal features. The models are then used in the probabilistic clustering algorithm, where the merging order is a variation of highest confidence first, and the merging criterion is maximum a posteriori. The algorithm does not need any ad-hoc parameter determination. We present extensive results on a 10-h home-video database with ground truth which thoroughly validate the performance of our methodology with respect to cluster detection, individual shot-cluster labeling, and the effect of prior selection.
Keywords :
Bayes methods; feature extraction; image classification; image segmentation; pattern clustering; probability; statistical analysis; video databases; video signal processing; Bayesian formulation; Gaussian mixture models; class-conditional distributions; cluster detection; consumer video segments; duration; ground truth; home videos structure; home-video database; inter-segment temporal features; inter-segment visual features; intra-segment temporal features; intra-segment visual features; maximum a posteriori merging criterion; probabilistic clustering algorithm; probabilistic hierarchical clustering; sequential binary Bayesian classification; shot-cluster labeling; statistical models; temporal adjacency; video shots; video-segment feature extraction; video-segment feature selection; visual similarity; Bayesian methods; Clustering algorithms; Decision theory; Labeling; Merging; Organizing; Spatial databases; Sun; Video recording; Visual databases;
fLanguage :
English
Journal_Title :
Circuits and Systems for Video Technology, IEEE Transactions on
Publisher :
ieee
ISSN :
1051-8215
Type :
jour
DOI :
10.1109/TCSVT.2003.813428
Filename :
1207412
Link To Document :
بازگشت