• DocumentCode
    417174
  • Title

    A multimedia approach for audio segmentation in TV broadcast news

  • Author

    Perez-Freire, Luis ; Garcia-Mateo, Carmen

  • Author_Institution
    ETSI Telecomunicacion, Vigo Univ., Spain
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    The paper deals with the task of audio segmentation in TV broadcast news. A multimedia approach for this purpose, by means of audio and video processing, is proposed. Thus, the segmentation system is composed by two differentiated parts: one analyzes the audio stream, and is based on the well-known Bayesian information criterion (BIC), whereas the other part extracts useful information from the video stream to improve the performance of BIC. An investigation of parameters involved in BIC formulation is also accomplished, in order to achieve the best results possible in our experimental framework: the database Transcrigal-DB. The final system provides significative improvements in both overall performance and robustness.
  • Keywords
    Bayes methods; audio signal processing; multimedia computing; speech processing; speech recognition; video signal processing; Bayesian information criterion; TV broadcast news; audio processing; audio segmentation; automatic speech recognition; multimedia approach; nonspeech fragments; speech fragments; video processing; Automatic speech recognition; Bayesian methods; Data mining; Digital multimedia broadcasting; Loudspeakers; Speech recognition; Statistics; Streaming media; TV broadcasting; Telecommunication standards;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1325999
  • Filename
    1325999