• DocumentCode
    3429977
  • Title

    Speech Activity Detection with Lip Movement Image Signals

  • Author

    Lee, Soo-jong ; Park, Jun ; Kim, Eung-Kyeu

  • Author_Institution
    ETRI, Daejeon
  • fYear
    2007
  • fDate
    22-24 Aug. 2007
  • Firstpage
    403
  • Lastpage
    406
  • Abstract
    This paper describes an attempt to correlate lip movement visual information acquired via a camera with speech audio information acquired via a microphone from a human speaker in order to prevent audio created by external noise from being misrecognized as speech emitted by said speaker. Images of the face of a human speaker are acquired via a PC camera and are then separated into images that indicate lip movement and images that do not indicate lip movement. The data of lip movement image signals is saved in shared memory and shared with the speech recognition process. This data is analyzed by the speech activity detection process, which is a pre-processing step of sound recognition. We combined a speech recognition processor and an image recognizer, and the interworking function successfully operated at the rate of 99.3%.
  • Keywords
    computer vision; image motion analysis; image recognition; object recognition; speech recognition; PC camera; face images; human speaker; image recognition; lip movement image signals; lip movement visual information; microphone; sound recognition; speech activity detection; speech audio information; speech recognition; Acoustic noise; Cameras; Data analysis; Face; Humans; Microphones; Signal processing; Speech analysis; Speech enhancement; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications, Computers and Signal Processing, 2007. PacRim 2007. IEEE Pacific Rim Conference on
  • Conference_Location
    Victoria, BC
  • Print_ISBN
    978-1-4244-1189-4
  • Electronic_ISBN
    1-4244-1190-4
  • Type

    conf

  • DOI
    10.1109/PACRIM.2007.4313259
  • Filename
    4313259