• DocumentCode
    80237
  • Title

    Video Tomographs and a Base Detector Selection Strategy for Improving Large-Scale Video Concept Detection

  • Author

    Sidiropoulos, Panagiotis ; Mezaris, Vasileios ; Kompatsiaris, Ioannis

  • Author_Institution
    Mullard Space Sci. Lab., Univ. Coll. London, London, UK
  • Volume
    24
  • Issue
    7
  • fYear
    2014
  • fDate
    Jul-14
  • Firstpage
    1251
  • Lastpage
    1264
  • Abstract
    In this paper, we deal with the problem of video concept detection to use the concept detection results toward a more effective concept-based video retrieval. The key novelties of this paper are as follows: 1) the use of spatio-temporal video slices (tomographs) in the same way that visual keyframes are typically used in video concept detection schemes. These spatio-temporal slices capture in a compact way motion patterns that are useful for detecting semantic concepts and are used for training a number of base detectors. The latter augment the set of keyframe-based base detectors that can be trained using different frame representations. 2) The introduction of a generic methodology, built upon a genetic algorithm, for controlling which subset of the available base detectors (consequently, which subset of the possible shot representations) should be combined for developing an optimal detector for each specific concept. This methodology is directly applicable to the learning of hundreds of diverse concepts, while diverging from the one-size-fits-all approach that is typically used in problems of this size. The proposed techniques are evaluated on the datasets of the 2011 and 2012 Semantic Indexing Task of TRECVID, each comprising several hundred hours of heterogeneous video clips and ground-truth annotations for tens of concepts that exhibit significant variation in terms of generality, complexity, and human participation. The experimental results manifest the merit of the proposed techniques.
  • Keywords
    video retrieval; video signal processing; base detector selection strategy; generic methodology; ground truth annotations; improving large-scale video concept detection; motion patterns; semantic indexing; spatio temporal slices; spatio temporal video slices; video clips; video retrieval; video tomographs; visual keyframes; Accuracy; Cameras; Computational efficiency; Detectors; Feature extraction; Streaming media; Visualization; Feature extraction; genetic algorithms; image sequence analysis; machine learning algorithms; video concept detection; video signal processing;
  • fLanguage
    English
  • Journal_Title
    Circuits and Systems for Video Technology, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1051-8215
  • Type

    jour

  • DOI
    10.1109/TCSVT.2014.2302554
  • Filename
    6727470