Title :
A real-time lipreading LSI for word recognition
Author :
Nakamura, Kazuhiro ; Murakami, Noriaki ; Takagi, Kazuyoshi ; Takagi, Naofumi
Author_Institution :
Center for Inf. Media Studies, Nagoya Univ., Japan
Abstract :
In the paper, we present a real-time lip-reading LSI for recognizing spoken words from lip movement. The LSI recognizes up to 8 words based on the hidden Markov model (HMM). The LSI accepts the 256×256 8-bit gray-scale images from a camera, and outputs the 3-bit symbol code of words for 43 images (corresponding to 1.53 s). We present a lip-reading algorithm optimized for hardware implementation. We have designed the lip-reading LSI and fabricated a 4.9 mm×4.9 mm chip using 0.35 μm process via VDEC Rohm. The LSI performs real-time recognition at 40 MHz operation.
Keywords :
circuit optimisation; hidden Markov models; image recognition; integrated circuit design; integrated circuit testing; large scale integration; speech recognition equipment; VDEC Rohm process; camera gray-scale images; hardware implementation; hidden Markov model; lip movement; optimized lip-reading algorithm; real-time lip-reading LSI; real-time recognition; spoken word recognition; symbol code output; Cameras; Gray-scale; Hardware design languages; Hidden Markov models; Humans; Image edge detection; Image recognition; Large scale integration; Speech recognition; Vector quantization;
Conference_Titel :
ASIC, 2002. Proceedings. 2002 IEEE Asia-Pacific Conference on
Print_ISBN :
0-7803-7363-4
DOI :
10.1109/APASIC.2002.1031592