DocumentCode :
304822
Title :
Talking about 3D scenes: integration of image and speech understanding in a hybrid distributed system
Author :
Socher, Gudrun ; Sagerer, Gerhard ; Kummert, Franz ; Fuhr, Thomas
Author_Institution :
Bielefeld Univ., Germany
Volume :
1
fYear :
1996
fDate :
16-19 Sep 1996
Firstpage :
809
Abstract :
We present a hybrid system that integrates speech and image understanding. Given spoken references, it is able to identify objects of a 3D scene perceived via a stereo camera. Central to our approach is the extraction of qualitative object features and spatial scene properties from acoustic and visual data. The interaction of the understanding processes is performed using a procedural semantic network that interfaces with signal recognition and reconstruction modules, thus integrating semantic, neural and Bayesian networks and Hidden Markov Models
Keywords :
Bayes methods; computer vision; feature extraction; hidden Markov models; neural nets; semantic networks; speech recognition; visual databases; 3D scenes; Bayesian networks; Hidden Markov Models; hybrid distributed system; image understanding; neural nets; procedural semantic network; qualitative object features extraction; semantic nets; spatial scene properties; speech understanding; spoken references; stereo camera; Cameras; Cognitive science; Data mining; Humans; Image reconstruction; Knowledge representation; Layout; Signal processing; Speech; Stereo vision;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Image Processing, 1996. Proceedings., International Conference on
Conference_Location :
Lausanne
Print_ISBN :
0-7803-3259-8
Type :
conf
DOI :
10.1109/ICIP.1996.561028
Filename :
561028
Link To Document :
بازگشت