Title :
Estimating acoustic-labial weights in connected speech recognition systems based on HMM
Author_Institution :
LIA, Avignon, France
Abstract :
Describes an approach for weighting the contribution of the acoustic and visual sources of information in a bimodal connected speech recognition system. We consider that a different acoustic-labial weight is attached to each recognition unit. The values of the weighting vector are optimised in order to minimise the error rate on a learning set. Experiments are performed on a two-speakers audiovisual database, composed of connected letters, with two different acoustic-labial speech recognition systems. For both speakers and both systems, the weights optimisation allows us to increase the recognition rate of our bimodal system
Keywords :
acoustic signal processing; hidden Markov models; image recognition; maximum likelihood estimation; speech recognition; acoustic-labial weights; bimodal connected speech recognition system; connected letters; error rate; learning set; two-speakers audiovisual database; Error analysis; Hidden Markov models; Information resources; Lips; Loudspeakers; Maximum likelihood estimation; Neural networks; Speech processing; Speech recognition; Visual databases;
Conference_Titel :
Systems, Man, and Cybernetics, 1997. Computational Cybernetics and Simulation., 1997 IEEE International Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
0-7803-4053-1
DOI :
10.1109/ICSMC.1997.625743