• DocumentCode
    1749855
  • Title

    New approaches to audio-visual segmentation of TV news for automatic topic retrieval

  • Author

    Iurgel, U. ; Meermeier, R. ; Eickeler, S. ; Rigoll, G.

  • Author_Institution
    Dept. of Comput. Sci., Gerhard-Mercator-University Duisburg, Germany
  • Volume
    3
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    1397
  • Abstract
    This paper presents two new real-time approaches to segmentation of TV news shows into topics. The goal of this research work is the high precision retrieval of topics from TV news. For that purpose, the detection of correct topic boundaries is of great importance. We introduce a stochastic and a rule-based topic model based on HMM. The former combines features from the visual as well as from the audio channel of the news show, whereas the latter uses the video channel only. They are compared to the detection of topics using only the audio channel, which is common for many other approaches. The paper contains the following innovations: (1) the detected segment boundaries correspond directly to topics and not to video or audio cuts, as in most other segmentation methods; (2) an advanced stochastic topic model is introduced that uses audio as well as video features; (3) the introduced HMM-based approaches both outperform the audio-based approach. One algorithm has a very good topic boundary detection rate, whereas the other minimizes the number of wrongly inserted boundaries without missing too many real boundaries
  • Keywords
    audio signal processing; content-based retrieval; feature extraction; hidden Markov models; image segmentation; multimedia databases; real-time systems; video signal processing; HMM; TV news; audio channel; audio-visual segmentation; automatic topic retrieval; real-time segmentation; rule-based topic model; stochastic model; topic boundary detection; video channel; visual features; Automatic speech recognition; Computer science; Hidden Markov models; Layout; Speech recognition; Stochastic processes; Streaming media; TV; Technological innovation; Weather forecasting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
  • Conference_Location
    Salt Lake City, UT
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7041-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2001.941190
  • Filename
    941190