• DocumentCode
    3559453
  • Title

    Visual Lip Activity Detection and Speaker Detection Using Mouth Region Intensities

  • Author

    Siatras, Spyridon ; Nikolaidis, Nikos ; Krinidis, Michail ; Pitas, Ioannis

  • Author_Institution
    Dept. of Inf., Aristotle Univ. of Thessaloniki, Thessaloniki
  • Volume
    19
  • Issue
    1
  • fYear
    2009
  • Firstpage
    133
  • Lastpage
    137
  • Abstract
    In this letter, we introduce a novel approach for lip activity detection and speaker detection, using solely visual information. The main idea in this work is to apply signal detection algorithms to a simple and easily extracted feature from the mouth region. We argue that the increased average value and standard deviation of the number of pixels with low intensities that the mouth region of a speaking person demonstrates can be used as visual cues for detecting visual speech. We then proceed in deriving a statistical algorithm that utilizes this fact for the efficient characterization of visual speech and silence in video sequences. Furthermore, we employ the lip activity detection method in order to determine the active speaker(s) in a multi-person environment.
  • Keywords
    face recognition; image recognition; object detection; statistics; mouth region intensity; multiperson environment; signal detection algorithms; speaker detection; speaking person; statistical algorithm; visual information; visual lip activity detection; visual speech detection; Speaker detection; visual speech detection;
  • fLanguage
    English
  • Journal_Title
    Circuits and Systems for Video Technology, IEEE Transactions on
  • Publisher
    ieee
  • Conference_Location
    12/9/2008 12:00:00 AM
  • ISSN
    1051-8215
  • Type

    jour

  • DOI
    10.1109/TCSVT.2008.2009262
  • Filename
    4703545