DocumentCode
433081
Title
Discovering meaningful multimedia patterns with audio-visual concepts and associated text
Author
Xie, L. ; Kennedy, L. ; Chang, S.-F. ; Divakaran, A. ; Sun, H. ; Lin, C.-Y.
Author_Institution
Dept. of Electr. Eng., Columbia Univ., USA
Volume
4
fYear
2004
fDate
24-27 Oct. 2004
Firstpage
2383
Abstract
The work presents the first effort to automatically annotate the semantic meanings of temporal video patterns obtained through unsupervised discovery processes. This problem is interesting in domains where neither perceptual patterns nor semantic concepts have simple structures. The patterns in video are modeled with hierarchical hidden Markov models (HHMM), with efficient algorithms to learn the parameters, the model complexity and the relevant features; the meanings are contained in words of the speech transcript of the video. The pattern-word association is obtained via cooccurrence analysis and statistical machine translation models. Promising results are obtained through extensive experiments on 20+ hours of TRECVID news videos: video patterns that associate with distinct topics such as el-nino and politics are identified; the HHMM temporal structure model compares favorably to a nontemporal clustering algorithm.
Keywords
audio-visual systems; hidden Markov models; language translation; multimedia communication; pattern clustering; semantic Web; temporal logic; unsupervised learning; video signal processing; HHMM; TRECVID news video; audio-visual concept; automatic annotation; cooccurrence analysis; hierarchical hidden Markov model; multimedia pattern; semantic meaning; statistical machine translation model; temporal video pattern; unsupervised discovery process; Algorithm design and analysis; Clustering algorithms; Games; Hidden Markov models; Pattern analysis; Speech; Statistics; Sun; Supervised learning; Tagging;
fLanguage
English
Publisher
ieee
Conference_Titel
Image Processing, 2004. ICIP '04. 2004 International Conference on
ISSN
1522-4880
Print_ISBN
0-7803-8554-3
Type
conf
DOI
10.1109/ICIP.2004.1421580
Filename
1421580
Link To Document