DocumentCode :
1274707
Title :
Multimodal integration-a statistical view
Author :
Wu, Lizhong ; Oviatt, Sharon L. ; Cohen, Philip R.
Author_Institution :
Dept. of Comput. Sci. & Eng., Oregon Graduate Inst. of Sci. & Technol., Portland, OR, USA
Volume :
1
Issue :
4
fYear :
1999
fDate :
12/1/1999 12:00:00 AM
Firstpage :
334
Lastpage :
341
Abstract :
We present a statistical approach to developing multimodal recognition systems and, in particular, to integrating the posterior probabilities of parallel input signals involved in the multimodal system. We first identify the primary factors that influence multimodal recognition performance by evaluating the multimodal recognition probabilities. We then develop two techniques, an estimate approach and a learning approach, which are designed to optimize accurate recognition during the multimodal integration process. We evaluate these methods using Quickset, a speech/gesture multimodal system, and report evaluation results based on an empirical corpus collected with Quickset. From an architectural perspective, the integration technique presented offers enhanced robustness. It also is premised on more realistic assumptions than previous multimodal systems using semantic fusion. From a methodological standpoint, the evaluation techniques that we describe provide a valuable tool for evaluating multimodal systems
Keywords :
gesture recognition; learning (artificial intelligence); probability; speech recognition; statistical analysis; Quickset; gesture recognition; learning; multimodal recognition systems; parallel input signals; probability; speech gesture multimodal system; speech recognition; statistical approach; Computer science; Decision making; Design optimization; Hidden Markov models; Humans; Probability; Robustness; Speech analysis; Speech recognition; Uncertainty;
fLanguage :
English
Journal_Title :
Multimedia, IEEE Transactions on
Publisher :
ieee
ISSN :
1520-9210
Type :
jour
DOI :
10.1109/6046.807953
Filename :
807953
Link To Document :
بازگشت