DocumentCode
3149513
Title
Multi-modal information fusion for news story segmentation in broadcast video
Author
Feng, Bailan ; Ding, Peng ; Chen, Jiansong ; Bai, Jinfeng ; Xu, Su ; Xu, Bo
Author_Institution
Digital Content Technol. Res. Center, Inst. of Autom., Beijing, China
fYear
2012
fDate
25-30 March 2012
Firstpage
1417
Lastpage
1420
Abstract
With the fast development of high-speed network and digital video recording technologies, broadcast video has been playing a more and more important role in our daily life. In this paper, we propose a novel news story segmentation scheme which can segment broadcast video into story units with multi-modal information fusion (MMIF) strategy. Compared with traditional methods, the proposed scheme extracts a wealth of semantic-level features including anchor person, topic caption, face, silence, acoustic change, audio keywords and textual content. Parallel to this, we make use of a multi-modal information fusion strategy for news story boundary characterization by joining these visual, audio and textual cues. Encouraging experimental results on News Vision dataset demonstrate the effectiveness of the proposed scheme.
Keywords
digital video broadcasting; feature extraction; video signal processing; News Vision dataset; acoustic change extraction; anchor person extraction; audio keyword extraction; broadcast video segmentation; digital video recording technology; face extraction; high-speed network; multimodal information fusion strategy; news story boundary characterization; news story segmentation; semantic-level feature extraction; silence extraction; textual content extraction; topic caption extraction; Acoustics; Detectors; Face; Feature extraction; Hidden Markov models; Support vector machines; Visualization; Anchor Person Detection; Audio Detection; Broadcast Video; News Story Segmentation; Topic Caption Detection and Track;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location
Kyoto
ISSN
1520-6149
Print_ISBN
978-1-4673-0045-2
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2012.6288156
Filename
6288156
Link To Document