• DocumentCode
    433081
  • Title

    Discovering meaningful multimedia patterns with audio-visual concepts and associated text

  • Author

    Xie, L. ; Kennedy, L. ; Chang, S.-F. ; Divakaran, A. ; Sun, H. ; Lin, C.-Y.

  • Author_Institution
    Dept. of Electr. Eng., Columbia Univ., USA
  • Volume
    4
  • fYear
    2004
  • fDate
    24-27 Oct. 2004
  • Firstpage
    2383
  • Abstract
    The work presents the first effort to automatically annotate the semantic meanings of temporal video patterns obtained through unsupervised discovery processes. This problem is interesting in domains where neither perceptual patterns nor semantic concepts have simple structures. The patterns in video are modeled with hierarchical hidden Markov models (HHMM), with efficient algorithms to learn the parameters, the model complexity and the relevant features; the meanings are contained in words of the speech transcript of the video. The pattern-word association is obtained via cooccurrence analysis and statistical machine translation models. Promising results are obtained through extensive experiments on 20+ hours of TRECVID news videos: video patterns that associate with distinct topics such as el-nino and politics are identified; the HHMM temporal structure model compares favorably to a nontemporal clustering algorithm.
  • Keywords
    audio-visual systems; hidden Markov models; language translation; multimedia communication; pattern clustering; semantic Web; temporal logic; unsupervised learning; video signal processing; HHMM; TRECVID news video; audio-visual concept; automatic annotation; cooccurrence analysis; hierarchical hidden Markov model; multimedia pattern; semantic meaning; statistical machine translation model; temporal video pattern; unsupervised discovery process; Algorithm design and analysis; Clustering algorithms; Games; Hidden Markov models; Pattern analysis; Speech; Statistics; Sun; Supervised learning; Tagging;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Processing, 2004. ICIP '04. 2004 International Conference on
  • ISSN
    1522-4880
  • Print_ISBN
    0-7803-8554-3
  • Type

    conf

  • DOI
    10.1109/ICIP.2004.1421580
  • Filename
    1421580