DocumentCode
3167873
Title
Integration of multimodal features for video scene classification based on HMM
Author
Huang, J. ; Liu, Z. ; Wang, Y. ; Chen, Y. ; Wong, E.K.
Author_Institution
Dept. of Electr. Eng., Polytech. Univ., Brooklyn, NY, USA
fYear
1999
fDate
1999
Firstpage
53
Lastpage
58
Abstract
Along with the advances in multimedia and Internet technology, a huge amount of data, including digital video and audio, are generated daily. Tools for the efficient indexing and retrieval of such data are indispensable. With multi-modal information present in the data, effective integration is necessary and is still a challenging problem. In this paper, we present four different methods for integrating audio and visual information for video classification based on a hidden Markov model (HMM): direct concatenation, product HMM, two-stage HMM, and integration by neural network. Our results have shown significant improvements over using a single modality
Keywords
Internet; audio-visual systems; content-based retrieval; database indexing; hidden Markov models; multimedia systems; neural nets; video databases; video signal processing; Internet; audio-visual information; data retrieval; direct concatenation; hidden Markov model; indexing; multi-modal information; multimedia; multimodal feature integration; neural network; video scene classification; Bandwidth; Frequency; Games; Hidden Markov models; Internet; Layout; Speech synthesis; TV; Videoconference; Weather forecasting;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia Signal Processing, 1999 IEEE 3rd Workshop on
Conference_Location
Copenhagen
Print_ISBN
0-7803-5610-1
Type
conf
DOI
10.1109/MMSP.1999.793797
Filename
793797
Link To Document