DocumentCode
3358191
Title
Building an adaptive spoken language interface for perceptually grounded human-robot interaction
Author
Dominey, Peter Ford ; Boucher, Jean-David ; Inui, Toshio
Author_Institution
Inst. des Sci. Cognitives, CNRS, Bron, France
Volume
1
fYear
2004
fDate
10-12 Nov. 2004
Firstpage
168
Abstract
In previous research, we developed an integrated platform that combined visual scene interpretation with speech processing to provide input to a language-learning model. The system was demonstrated to learn a rich set of sentence-meaning mappings that could allow it to construct the appropriate meanings for new sentences in a generalization task. While this demonstrated potential promise, it fell short in several aspects of providing a useful human-robot interaction system. The current research addresses three of these shortcomings, demonstrating the natural extensibility of the platform architecture. First, the system must be able not only to understand what it hears, but also to describe what it sees and to interact with the human user. This is a natural extension of the knowledge of sentence-to-meaning mappings that is now applied in the inverse scene-to-sentence sense. Secondly, we extend the system´s ontology from physical events to include spatial relations. We show that spatial relations are naturally accommodated in the predicate argument representations for events. Finally, because the robot community is international the robot should be able to speak multiple languages, we thus demonstrate that the language model extends naturally to include both English and Japanese. Concrete results from a working interactive system are presented and future directions for adaptive human-robot interaction systems are outlined.
Keywords
generalisation (artificial intelligence); humanoid robots; intelligent robots; man-machine systems; natural language interfaces; ontologies (artificial intelligence); speech processing; adaptive spoken language interface; generalization task; inverse scene-to-sentence sense; language-learning model; perceptually grounded human-robot interaction; predicate argument representations; sentence-meaning mappings; spatial relations; speech processing; visual scene interpretation; Charge coupled devices; Charge-coupled image sensors; Humanoid robots; Humans; Image processing; Informatics; Layout; Natural languages; Robot sensing systems; Speech processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Humanoid Robots, 2004 4th IEEE/RAS International Conference on
Print_ISBN
0-7803-8863-1
Type
conf
DOI
10.1109/ICHR.2004.1442121
Filename
1442121
Link To Document