DocumentCode
3226486
Title
European and American Audio-Visual Speech Recognition, Using SVM in Portuguese Language
Author
de Andrade Bresolin, A. ; Da Silva Freitas, Diamantino Rui ; Neto, Adrião Duarte Dória ; Alsina, Pablo Javier
Author_Institution
UTFPR, Technol. Fed. Univ. of the Parana, Curitiba, Brazil
fYear
2008
fDate
25-27 March 2008
Firstpage
511
Lastpage
511
Abstract
This paper proposes an audio-visual speech recognition system using SVM (support vector machine) in European and American Portuguese language. The main objective in this work is to find a model that can be used in both languages. Furthermore, two new methods to extract the mouth region (ROI-Region of interest) and lip contour are presented. Two audio and four video features are used in the experiments. These features are combined in pairs, totalizing eight tests in the speaker dependent-case. Experiments were performed at various SNRs (0-40dB) with additive white Gaussian noise. The results showed that the proposed method can be used in both languages without any adaption.
Keywords
AWGN; audio signal processing; linguistics; speech recognition; support vector machines; video signal processing; American Portuguese language; European language; SVM; additive white Gaussian noise; audio-visual speech recognition system; support vector machine; Acoustics; Data compression; Data engineering; Mel frequency cepstral coefficient; Mouth; Natural languages; Principal component analysis; Speech recognition; Support vector machines; Testing; Image Pattern Recognition; Neural Networks; Speech Recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Compression Conference, 2008. DCC 2008
Conference_Location
Snowbird, UT
ISSN
1068-0314
Print_ISBN
978-0-7695-3121-2
Type
conf
DOI
10.1109/DCC.2008.32
Filename
4483338
Link To Document