DocumentCode :
1835448
Title :
Feature analysis for automatic speechreading
Author :
Scanlon, P. ; Reilly, R.
Author_Institution :
Dept. of Electron. & Electr. Eng., Univ. Coll. Dublin, Ireland
fYear :
2001
fDate :
2001
Firstpage :
625
Lastpage :
630
Abstract :
Audio-visual automatic speech recognition systems use visual information to enhance ASR systems in clean and noisy environments. This paper investigates a number of different visual feature extraction methods. It was observed that when performing visual speech recognition the visual feature vector requires a base level of detail for improved recognition. Geometric feature extraction provides lower recognition than pixel based methods due to the loss of characteristic speech information such as protrusion etc. Downsampling of images reduces visual recognition scores due to the loss of detail in the images. Also, the role of dynamic features was investigated for improved recognition. It was observed that static features alone outperform a combination of both static and dynamic features when restricting the dimension of the feature vector e.g. 50. This illustrates that the need for a certain level of detail in visual speech recognition is a higher priority than dynamic information. Once this base level of detail is attained the dynamic features should then be able to improve the recognition rate
Keywords :
audio-visual systems; feature extraction; image sampling; speech recognition; video signal processing; ASR systems; audio-visual automatic speech recognition systems; automatic speechreading; clean environment; dynamic features; feature analysis; geometric feature extraction; image downsampling; noisy environment; pixel based methods; recognition rate; static features; video processing; visual feature extraction methods; visual speech recognition; Automatic speech recognition; Data mining; Feature extraction; Humans; Image recognition; Pixel; Speech analysis; Speech enhancement; Speech recognition; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia Signal Processing, 2001 IEEE Fourth Workshop on
Conference_Location :
Cannes
Print_ISBN :
0-7803-7025-2
Type :
conf
DOI :
10.1109/MMSP.2001.962802
Filename :
962802
Link To Document :
بازگشت