Title :
Multi-modal feature integration for story boundary detection in broadcast news
Author :
Lu, Mi-mi ; Xie, Lei ; Fu, Zhong-hua ; Jiang, Dong-mei ; Zhang, Yan-Ning
Author_Institution :
Shaanxi Provincial Key Lab. of Speech & Image Inf. Process., Northwestern Polytech. Univ., Xi´´an, China
fDate :
Nov. 29 2010-Dec. 3 2010
Abstract :
This paper investigates how to integrate multi-modal features for story boundary detection in broadcast news. The detection problem is formulated as a classification task, i.e., classifying each candidate into boundary/non-boundary based on a set of features. We use a diverse collection of features from text, audio and video modalities: lexical features capturing the semantic shifts of news topics and audio/video features reflecting the editorial rules of broadcast news. We perform a comprehensive evaluation on boundary detection performance for six popular classifiers, including decision tree (DT), Bayesian network (BN), naive Bayesian (NB) classifier, multi-layer peceptron (MLP), support vector machines (SVM) and maximum entropy (ME) classifier. Results show that BN and DT can generally achieve superior performances over other classifiers and BN offers the best F1-measure. Analysis of BN and DT reveals important inter-feature dependencies and complementarities that contribute significantly to the performance gain.
Keywords :
Bayes methods; belief networks; broadcasting; decision trees; feature extraction; multilayer perceptrons; multimedia communication; pattern classification; support vector machines; text analysis; Bayesian network; audio-video feature; broadcast news; classification task; decision tree; editorial rule; lexical feature; maximum entropy classifier; multilayer peceptron; multimodal feature integration; naive Bayesian classifier; semantic shift; story boundary detection; support vector machine; Bayesian methods; Face; Feature extraction; Niobium; Semantics; Speech; Support vector machines; feature integration; multi-modal; story boundary detection; story segmentation; topic detection and tracking;
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
Conference_Location :
Tainan
Print_ISBN :
978-1-4244-6244-5
DOI :
10.1109/ISCSLP.2010.5684854