Title :
A fuzzy synchronization algorithm for bimodal speech signals
Author_Institution :
Dipt. di Elettrotecnica, Elettronica ed Inf., Trieste Univ., Italy
Abstract :
This paper describes a rule-based fuzzy system that estimates the relationship between acoustic and visual speech and uses this estimate for the synchronization of not aligned audio-visual signals. The relations are quantified by means of a set of rules, which associate typical mouth shapes (visual classes) to specific acoustic classes. The visual and acoustic classes are learned from training data using automatic clustering algorithms, relying on the clustering tendency of the extracted feature vectors and without performing phonetic recognition. Nevertheless, the categorical fuzzy structure of the system allows one to recognize, with some degree of uncertainty, the phonetic and visematic characteristics of the speech signal, making thus possible the integration of other sources of information in a human-like way
Keywords :
feature extraction; fuzzy set theory; pattern clustering; speech recognition; synchronisation; acoustic classes; acoustic speech; automatic clustering algorithms; bimodal speech signals; extracted feature vectors; fuzzy synchronization algorithm; lip synchronization; mouth shapes; phonetic characteristics; rule-based fuzzy system; training data; unaligned audio-visual signals; visematic characteristics; visual classes; visual speech; Automatic speech recognition; Character recognition; Clustering algorithms; Data mining; Feature extraction; Fuzzy systems; Mouth; Shape; Speech recognition; Training data;
Conference_Titel :
Electronics, Circuits and Systems, 1998 IEEE International Conference on
Conference_Location :
Lisboa
Print_ISBN :
0-7803-5008-1
DOI :
10.1109/ICECS.1998.814089