• DocumentCode
    3632732
  • Title

    Graphical models for multi-modal automatic video editing in meetings

  • Author

    Benedikt Hornler;Dejan Arsic;Bjorn Schuller;Gerhard Rigoll

  • Author_Institution
    Technische Universit?t M?nchen, Institute for Human-Machine-Communication, 80290 Munich, Germany
  • fYear
    2009
  • fDate
    7/1/2009 12:00:00 AM
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    In this work we present a multi-modal video editing system for meetings, which uses graphical models for the segmentation and classification of the video modes. The task of video editing is about selecting the camera, that represents the meeting in the best way out of various available cameras. Therefore a new training structure for graphical models was developed. This is necessary for the learning of boundaries combined with the video mode itself. All developed and known decoding structures can be easily connected for an EM-training to our training structure. The achieved results of the system are state of the art.
  • Keywords
    "Graphical models","Videoconference","Cameras","Streaming media","Video recording","Computer displays","Pattern recognition","Decoding","Machine learning","Man machine systems"
  • Publisher
    ieee
  • Conference_Titel
    Digital Signal Processing, 2009 16th International Conference on
  • ISSN
    1546-1874
  • Print_ISBN
    978-1-4244-3297-4
  • Electronic_ISBN
    2165-3577
  • Type

    conf

  • DOI
    10.1109/ICDSP.2009.5201117
  • Filename
    5201117