DocumentCode
2875240
Title
Audio-visual speech recognition in a Portuguese language based application
Author
Pera, Vitor ; Sá, Filipe ; Afonso, Pedro ; Ferreira, Ricardo
Author_Institution
Fac. of Eng., Porto Univ., Portugal
Volume
2
fYear
2003
fDate
10-12 Dec. 2003
Firstpage
688
Abstract
We present in this article experimental results obtained with an automatic speech recogniser developed for a speaker dependent and continuous speech alphanumeric recognition application based on the European Portuguese language. An audio-visual speech recognition approach was followed to design and build this system. Besides the well known complementary between the acoustic and the visual information for speech recognition purposes, the visual features are obviously immune to any acoustic disturbance, thus making the system more robust in acoustically contaminated environments. The results presented clearly show that the inclusion of a video stream, using a multi-stream decoding formalism, decreases the word error rate in approximately 56%rel over a wide range of acoustical signal-noise ratio.
Keywords
acoustic signal processing; audio signal processing; decoding; feature extraction; natural languages; speech recognition; video signal processing; Portuguese language; acoustic information; acoustical signal to noise ratio; audio visual speech recognition; automatic speech recogniser; continuous speech alphanumeric recognition; multistream decoding; speaker dependent recognition; video stream; visual information; word error rate; Audio databases; Automatic speech recognition; Decoding; Natural languages; Particle separators; Robustness; Speech recognition; Streaming media; Video compression; Visual databases;
fLanguage
English
Publisher
ieee
Conference_Titel
Industrial Technology, 2003 IEEE International Conference on
Print_ISBN
0-7803-7852-0
Type
conf
DOI
10.1109/ICIT.2003.1290738
Filename
1290738
Link To Document