• DocumentCode
    2017007
  • Title

    Multi-modal feature integration for story boundary detection in broadcast news

  • Author

    Lu, Mi-mi ; Xie, Lei ; Fu, Zhong-hua ; Jiang, Dong-mei ; Zhang, Yan-Ning

  • Author_Institution
    Shaanxi Provincial Key Lab. of Speech & Image Inf. Process., Northwestern Polytech. Univ., Xi´´an, China
  • fYear
    2010
  • fDate
    Nov. 29 2010-Dec. 3 2010
  • Firstpage
    420
  • Lastpage
    425
  • Abstract
    This paper investigates how to integrate multi-modal features for story boundary detection in broadcast news. The detection problem is formulated as a classification task, i.e., classifying each candidate into boundary/non-boundary based on a set of features. We use a diverse collection of features from text, audio and video modalities: lexical features capturing the semantic shifts of news topics and audio/video features reflecting the editorial rules of broadcast news. We perform a comprehensive evaluation on boundary detection performance for six popular classifiers, including decision tree (DT), Bayesian network (BN), naive Bayesian (NB) classifier, multi-layer peceptron (MLP), support vector machines (SVM) and maximum entropy (ME) classifier. Results show that BN and DT can generally achieve superior performances over other classifiers and BN offers the best F1-measure. Analysis of BN and DT reveals important inter-feature dependencies and complementarities that contribute significantly to the performance gain.
  • Keywords
    Bayes methods; belief networks; broadcasting; decision trees; feature extraction; multilayer perceptrons; multimedia communication; pattern classification; support vector machines; text analysis; Bayesian network; audio-video feature; broadcast news; classification task; decision tree; editorial rule; lexical feature; maximum entropy classifier; multilayer peceptron; multimodal feature integration; naive Bayesian classifier; semantic shift; story boundary detection; support vector machine; Bayesian methods; Face; Feature extraction; Niobium; Semantics; Speech; Support vector machines; feature integration; multi-modal; story boundary detection; story segmentation; topic detection and tracking;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
  • Conference_Location
    Tainan
  • Print_ISBN
    978-1-4244-6244-5
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2010.5684854
  • Filename
    5684854