• DocumentCode
    2951103
  • Title

    Video News Shot Labeling Refinement via Shot Rhythm Models

  • Author

    Kender, John R. ; Naphade, Milind R.

  • Author_Institution
    Dept. of Comput. Sci., Columbia Univ., New York, NY
  • fYear
    2006
  • fDate
    9-12 July 2006
  • Firstpage
    37
  • Lastpage
    40
  • Abstract
    We present a three-step post-processing method for increasing the precision of video shot labels in the domain of television news. First, we demonstrate that news shot sequences can be characterized by rhythms of alternation (due to dialogue), repetition (due to persistent background settings), or both. Thus a temporal model is necessarily third-order Markov. Second, we demonstrate that the output of feature detectors derived from machine learning methods (in particular, from SVMs) can be converted into probabilities in a more effective way than two suggested existing methods. This is particularly true when detectors are errorful due to sparse training sets, as is common in this domain. Third, we demonstrate that a straightforward application of the Viterbi algorithm on a third-order FSM, constructed from observed transition probabilities and converted feature detector outputs, can refine feature label precision at little cost. We show that on a test corpus of TRECVID 2005 news videos annotated with 39 LSCOM-lite features, the mean increase in the measure of average precision (AP) was 4%, with some of the rarer and more difficult features having relative increases in AP of as much as 67%
  • Keywords
    Markov processes; feature extraction; finite state machines; indexing; learning (artificial intelligence); probability; video signal processing; FSM; LSCOM-lite feature; Viterbi algorithm; feature detector; machine learning method; probability; shot rhythm model; shot sequence; television news; temporal model; third-order Markov; three-step post-processing method; video shot label; Computer science; Computer vision; Detectors; Government; Labeling; Learning systems; Ontologies; Rhythm; TV; Viterbi algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2006 IEEE International Conference on
  • Conference_Location
    Toronto, Ont.
  • Print_ISBN
    1-4244-0366-7
  • Electronic_ISBN
    1-4244-0367-7
  • Type

    conf

  • DOI
    10.1109/ICME.2006.262544
  • Filename
    4036530