• DocumentCode
    863614
  • Title

    Automatic Meeting Segmentation Using Dynamic Bayesian Networks

  • Author

    Dielmann, Alfred ; Renals, Steve

  • Author_Institution
    Centre for Speech Technol. Res., Edinburgh Univ.
  • Volume
    9
  • Issue
    1
  • fYear
    2007
  • Firstpage
    25
  • Lastpage
    36
  • Abstract
    Multiparty meetings are a ubiquitous feature of organizations, and there are considerable economic benefits that would arise from their automatic analysis and structuring. In this paper, we are concerned with the segmentation and structuring of meetings (recorded using multiple cameras and microphones) into sequences of group meeting actions such as monologue, discussion and presentation. We outline four families of multimodal features based on speaker turns, lexical transcription, prosody, and visual motion that are extracted from the raw audio and video recordings. We relate these low-level features to more complex group behaviors using a multistream modelling framework based on multi-stream dynamic Bayesian networks (DBNs). This results in an effective approach to the segmentation problem, resulting in an action error rate of 12.2%, compared with 43% using an approach based on hidden Markov models. Moreover, the multistream DBN developed here leaves scope for many further improvements and extensions
  • Keywords
    audio recording; audio signal processing; belief networks; feature extraction; multimedia computing; user interfaces; video recording; video signal processing; automatic meeting segmentation; dynamic Bayesian networks; lexical transcription; multimodal feature extraction; multiparty meetings; multistream DBN; multistream modelling framework; prosody; raw audio recording; raw video recording; speaker turns; visual motion; Audio recording; Automatic speech recognition; Bayesian methods; Cameras; Hidden Markov models; Meeting planning; Microphones; Minutes; Streaming media; Video recording; Multimodal; meeting actions; multistream;
  • fLanguage
    English
  • Journal_Title
    Multimedia, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1520-9210
  • Type

    jour

  • DOI
    10.1109/TMM.2006.886337
  • Filename
    4032608