• DocumentCode
    1396555
  • Title

    Visual voice activity detection with optical flow

  • Author

    Aubrey, Andrew J. ; Hicks, Y.A. ; Chambers, Jonathon A.

  • Author_Institution
    Geometric Computing and Computer Vision Group, School of Computer Science, Cardiff University, CF24 3AA, UK
  • Volume
    4
  • Issue
    6
  • fYear
    2010
  • fDate
    12/1/2010 12:00:00 AM
  • Firstpage
    463
  • Lastpage
    472
  • Abstract
    Current voice activity detection methods generally utilise only acoustic information. Therefore they are susceptible to false classification because of the presence of other acoustic sources such as another speaker or non-stationary noise. To address this issue, the authors propose a new method of voice activity detection using solely visual information in the form of a speaker´s mouth region. Such video information is not affected by the acoustic environment. Simulations show that a high percentage correct silence detection (CSD) can be obtained with a low percentage false silence detection (FSD). Comparisons with two other visual voice activity detectors show the proposed method to be consistently more accurate, and on average yields a 4% improvement in CSD. The usefulness of the method is confirmed by applying it to a previously published audio??visual convolutive blind source separation algorithm, to increase the intelligibility of a speaker.
  • fLanguage
    English
  • Journal_Title
    Image Processing, IET
  • Publisher
    iet
  • ISSN
    1751-9659
  • Type

    jour

  • DOI
    10.1049/iet-ipr.2009.0042
  • Filename
    5659510