Title :
Unsupervised Stream Weight Computation in a Segmentaion Task: Application to Audio-Visual Speech Recognition
Author :
Sanchez-Soto, E. ; Daoudi, Khalid ; Potamianos, Alexandros
Author_Institution :
IRIT-CNRS, Toulouse, France
Abstract :
We propose an efficient algorithm for unsupervised stream weight estimation in a segmentation task. Our method uses only the information carried by the test signal and the trained models. The work is based on results presented previously for the classification problem where it is indicated that the optimal stream weights are inversely proportional to the single stream misclassification error. We approximate this error relation by the intra- and inter-class distance ratio over the measured class distributions. This approach is then generalized to the segmentation problem by computing the distances among all the concerned classes. The proposed unsupervised estimation algorithm is evaluated on a an audio-visual speech recognition task. The obtained performances are comparable to the supervised minimum error training approach, up to a certain SNR level.
Keywords :
audio-visual systems; least mean squares methods; speech recognition; video signal processing; audio-visual speech recognition; inter-class distance ratio; intra-class distance ratio; segmentation problem; unsupervised estimation algorithm; unsupervised stream weight computation; Audio recording; Automatic speech recognition; Information resources; Lips; Signal processing; Signal processing algorithms; Speech processing; Speech recognition; Streaming media; Testing; Audio-Visual Speech Recognition; Fusion Methods; Stream Weight Estimation;
Conference_Titel :
Signal Processing and Communications, 2007. ICSPC 2007. IEEE International Conference on
Conference_Location :
Dubai
Print_ISBN :
978-1-4244-1235-8
Electronic_ISBN :
978-1-4244-1236-5
DOI :
10.1109/ICSPC.2007.4728440