• DocumentCode
    3167873
  • Title

    Integration of multimodal features for video scene classification based on HMM

  • Author

    Huang, J. ; Liu, Z. ; Wang, Y. ; Chen, Y. ; Wong, E.K.

  • Author_Institution
    Dept. of Electr. Eng., Polytech. Univ., Brooklyn, NY, USA
  • fYear
    1999
  • fDate
    1999
  • Firstpage
    53
  • Lastpage
    58
  • Abstract
    Along with the advances in multimedia and Internet technology, a huge amount of data, including digital video and audio, are generated daily. Tools for the efficient indexing and retrieval of such data are indispensable. With multi-modal information present in the data, effective integration is necessary and is still a challenging problem. In this paper, we present four different methods for integrating audio and visual information for video classification based on a hidden Markov model (HMM): direct concatenation, product HMM, two-stage HMM, and integration by neural network. Our results have shown significant improvements over using a single modality
  • Keywords
    Internet; audio-visual systems; content-based retrieval; database indexing; hidden Markov models; multimedia systems; neural nets; video databases; video signal processing; Internet; audio-visual information; data retrieval; direct concatenation; hidden Markov model; indexing; multi-modal information; multimedia; multimodal feature integration; neural network; video scene classification; Bandwidth; Frequency; Games; Hidden Markov models; Internet; Layout; Speech synthesis; TV; Videoconference; Weather forecasting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia Signal Processing, 1999 IEEE 3rd Workshop on
  • Conference_Location
    Copenhagen
  • Print_ISBN
    0-7803-5610-1
  • Type

    conf

  • DOI
    10.1109/MMSP.1999.793797
  • Filename
    793797