• DocumentCode
    1225670
  • Title

    Finding structure in home videos by probabilistic hierarchical clustering

  • Author

    Gatica-Perez, Daniel ; Loui, Alexander ; Sun, Ming-Ting

  • Author_Institution
    Dalle Molle Inst. for Perceptual Artificial Intelligence, Martigny, Switzerland
  • Volume
    13
  • Issue
    6
  • fYear
    2003
  • fDate
    6/1/2003 12:00:00 AM
  • Firstpage
    539
  • Lastpage
    548
  • Abstract
    Accessing, organizing, and manipulating home videos present technical challenges due to their unrestricted content and lack of storyline. We present a methodology to discover cluster structure in home videos, which uses video shots as the unit of organization, and is based on two concepts: (1) the development of statistical models of visual similarity, duration, and temporal adjacency of consumer video segments and (2) the reformulation of hierarchical clustering as a sequential binary Bayesian classification process. A Bayesian formulation allows for the incorporation of prior knowledge of the structure of home video and offers the advantages of a principled methodology. Gaussian mixture models are used to represent the class-conditional distributions of intra- and inter-segment visual and temporal features. The models are then used in the probabilistic clustering algorithm, where the merging order is a variation of highest confidence first, and the merging criterion is maximum a posteriori. The algorithm does not need any ad-hoc parameter determination. We present extensive results on a 10-h home-video database with ground truth which thoroughly validate the performance of our methodology with respect to cluster detection, individual shot-cluster labeling, and the effect of prior selection.
  • Keywords
    Bayes methods; feature extraction; image classification; image segmentation; pattern clustering; probability; statistical analysis; video databases; video signal processing; Bayesian formulation; Gaussian mixture models; class-conditional distributions; cluster detection; consumer video segments; duration; ground truth; home videos structure; home-video database; inter-segment temporal features; inter-segment visual features; intra-segment temporal features; intra-segment visual features; maximum a posteriori merging criterion; probabilistic clustering algorithm; probabilistic hierarchical clustering; sequential binary Bayesian classification; shot-cluster labeling; statistical models; temporal adjacency; video shots; video-segment feature extraction; video-segment feature selection; visual similarity; Bayesian methods; Clustering algorithms; Decision theory; Labeling; Merging; Organizing; Spatial databases; Sun; Video recording; Visual databases;
  • fLanguage
    English
  • Journal_Title
    Circuits and Systems for Video Technology, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1051-8215
  • Type

    jour

  • DOI
    10.1109/TCSVT.2003.813428
  • Filename
    1207412