DocumentCode :
639775
Title :
SFAVD: Sharif Farsi audio visual database
Author :
Naraghi, Zeinab ; Jamzad, Mansour
fYear :
2013
fDate :
28-30 May 2013
Firstpage :
417
Lastpage :
421
Abstract :
With increasing use of computers in everyday life, improved communication between machines and human is needed. To make a right communication and understand a humankind face which is made in a graphical environment, implementing the audio and visual projects like lip reading, audio and visual speech recognition and lip making are needed. Lack of a complete audio and visual database for this application in Farsi language made us provide a new complete Farsi database for this project that is called SFAVD. It is a unique audio and visual database which in addition to considering Farsi conceptual and speech structure, it considers influence of speech on lip changes. This database is created for the main goal of natural and humankind representation of strings of lip movements for Farsi language. SFAVD covers most applicable words, all phones, diaphones and common syllables in sentences.
Keywords :
audio databases; audio-visual systems; face recognition; natural languages; speech recognition; visual databases; Farsi conceptual structure; Farsi speech structure; SFAVD; Sharif Farsi audio visual database; graphical environment; humankind face; lip making; lip reading; speech recognition; Face; Feature extraction; Speech; Vectors; Visual databases; Visualization; Audio Visual Database; Farsi language; lip movement animation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information and Knowledge Technology (IKT), 2013 5th Conference on
Conference_Location :
Shiraz
Print_ISBN :
978-1-4673-6489-8
Type :
conf
DOI :
10.1109/IKT.2013.6620103
Filename :
6620103
Link To Document :
بازگشت