• DocumentCode
    442649
  • Title

    News video classification based on multi-modal information fusion

  • Author

    Lie, Wen-Nung ; Su, Chen-Kang

  • Author_Institution
    Dept. of Electr. Eng., Nat. Chung Chen Univ., Taiwan
  • Volume
    1
  • fYear
    2005
  • fDate
    11-14 Sept. 2005
  • Abstract
    A multi-modal information fusion technique integrating the closed caption, anchor´s speech, and visual information for TV news video classification is presented. By recognizing closed-caption characters from video, phrases of single- and double-character are found for classification. On the other hand, content of the anchor´s speech signal is not recognized, but instead, labeled with pre-trained cluster means by using a level-building DP (dynamic programming) algorithm. Visual information, including the color and motion features, is extracted from the news footage part for classification. The above three information is individually classified by using statistical relevance factor (RF) or SVM (support vector machine) technique, amounting to 7 different classifiers. Results of multiple classifiers are then combined to get fused outputs by using a modified Bayesian technique. Experiments show that the proposed fusion system is capable of increasing the classification rate by 14% with respect to the best single-modal system. Our Bayesian fusion rule also outperforms the best product rule presented in J. Kittler, et al (1998) by 3%.
  • Keywords
    Bayes methods; dynamic programming; feature extraction; image classification; sensor fusion; support vector machines; Bayesian fusion rule; TV news; anchor speech signal; closed-caption characters; feature extraction; level-building dynamic programming; modified Bayesian technique; multimodal information fusion; multiple classifiers; statistical relevance factor; support vector machine; video classification; visual information; Bayesian methods; Character recognition; Clustering algorithms; Dynamic programming; Feature extraction; Heuristic algorithms; Speech recognition; Support vector machine classification; Support vector machines; TV;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Processing, 2005. ICIP 2005. IEEE International Conference on
  • Print_ISBN
    0-7803-9134-9
  • Type

    conf

  • DOI
    10.1109/ICIP.2005.1529975
  • Filename
    1529975