DocumentCode :
3441904
Title :
A fuzzy synchronization algorithm for bimodal speech signals
Author :
Pahor, V.
Author_Institution :
Dipt. di Elettrotecnica, Elettronica ed Inf., Trieste Univ., Italy
Volume :
3
fYear :
1998
fDate :
1998
Firstpage :
553
Abstract :
This paper describes a rule-based fuzzy system that estimates the relationship between acoustic and visual speech and uses this estimate for the synchronization of not aligned audio-visual signals. The relations are quantified by means of a set of rules, which associate typical mouth shapes (visual classes) to specific acoustic classes. The visual and acoustic classes are learned from training data using automatic clustering algorithms, relying on the clustering tendency of the extracted feature vectors and without performing phonetic recognition. Nevertheless, the categorical fuzzy structure of the system allows one to recognize, with some degree of uncertainty, the phonetic and visematic characteristics of the speech signal, making thus possible the integration of other sources of information in a human-like way
Keywords :
feature extraction; fuzzy set theory; pattern clustering; speech recognition; synchronisation; acoustic classes; acoustic speech; automatic clustering algorithms; bimodal speech signals; extracted feature vectors; fuzzy synchronization algorithm; lip synchronization; mouth shapes; phonetic characteristics; rule-based fuzzy system; training data; unaligned audio-visual signals; visematic characteristics; visual classes; visual speech; Automatic speech recognition; Character recognition; Clustering algorithms; Data mining; Feature extraction; Fuzzy systems; Mouth; Shape; Speech recognition; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electronics, Circuits and Systems, 1998 IEEE International Conference on
Conference_Location :
Lisboa
Print_ISBN :
0-7803-5008-1
Type :
conf
DOI :
10.1109/ICECS.1998.814089
Filename :
814089
Link To Document :
بازگشت