DocumentCode :
463740
Title :
Probabilistic Graphical Model for Auto-Annotation, Content-Based Retrieval, and Classification of TV Clips Containing Audio, Video, and Text
Author :
Putthividhya, D. ; Attias, H.T. ; Nagarajan, Srikantan S. ; Lee, Taewoo
Author_Institution :
Institute for Neural Comput., La Jolla, CA, USA
Volume :
2
fYear :
2007
fDate :
15-20 April 2007
Abstract :
We present a probabilistic graphical model that learns the joint statistical structures of text, audio, and video for the purpose of classification and retrieval of multimedia documents. The proposed model, which we call multi-modal LDA (MM-LDA), builds on the basic latent Dirichlet allocation (LDA) model by postulating common hidden factors, termed topics, that are shared among the 3 data modalities. These hidden topics correspond to patterns of word co-occurrences in multimedia documents and describe how text words co-occur with certain visual and acoustic features. We demonstrate the power of MM-LDA in representing TV clips containing closed-captions, video, and audio, and show promising results in 3 challenging applications: TV clip classification, retrieval, and auto-annotation.
Keywords :
content-based retrieval; multimedia systems; statistical analysis; TV clip classification; acoustic features; autoannotation probabilistic graphical model; content-based retrieval; joint statistical structures; multimedia document retrieval; multimodal latent Dirichlet allocation; Content based retrieval; Dictionaries; Explosives; Graphical models; Information retrieval; Linear discriminant analysis; Radiology; Streaming media; TV; Video sharing; Automatic video annotation; Content-based video retrieval; Multi-modal data representation; Multimedia information retrieval; Multimedia processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.366354
Filename :
4217527
Link To Document :
بازگشت