• DocumentCode
    3149513
  • Title

    Multi-modal information fusion for news story segmentation in broadcast video

  • Author

    Feng, Bailan ; Ding, Peng ; Chen, Jiansong ; Bai, Jinfeng ; Xu, Su ; Xu, Bo

  • Author_Institution
    Digital Content Technol. Res. Center, Inst. of Autom., Beijing, China
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    1417
  • Lastpage
    1420
  • Abstract
    With the fast development of high-speed network and digital video recording technologies, broadcast video has been playing a more and more important role in our daily life. In this paper, we propose a novel news story segmentation scheme which can segment broadcast video into story units with multi-modal information fusion (MMIF) strategy. Compared with traditional methods, the proposed scheme extracts a wealth of semantic-level features including anchor person, topic caption, face, silence, acoustic change, audio keywords and textual content. Parallel to this, we make use of a multi-modal information fusion strategy for news story boundary characterization by joining these visual, audio and textual cues. Encouraging experimental results on News Vision dataset demonstrate the effectiveness of the proposed scheme.
  • Keywords
    digital video broadcasting; feature extraction; video signal processing; News Vision dataset; acoustic change extraction; anchor person extraction; audio keyword extraction; broadcast video segmentation; digital video recording technology; face extraction; high-speed network; multimodal information fusion strategy; news story boundary characterization; news story segmentation; semantic-level feature extraction; silence extraction; textual content extraction; topic caption extraction; Acoustics; Detectors; Face; Feature extraction; Hidden Markov models; Support vector machines; Visualization; Anchor Person Detection; Audio Detection; Broadcast Video; News Story Segmentation; Topic Caption Detection and Track;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6288156
  • Filename
    6288156