Building an adaptive spoken language interface for perceptually grounded human-robot interaction

Author

Dominey, Peter Ford ; Boucher, Jean-David ; Inui, Toshio

Author_Institution

Inst. des Sci. Cognitives, CNRS, Bron, France

Volume

1

fYear

2004

fDate

10-12 Nov. 2004

Firstpage

168

Abstract

In previous research, we developed an integrated platform that combined visual scene interpretation with speech processing to provide input to a language-learning model. The system was demonstrated to learn a rich set of sentence-meaning mappings that could allow it to construct the appropriate meanings for new sentences in a generalization task. While this demonstrated potential promise, it fell short in several aspects of providing a useful human-robot interaction system. The current research addresses three of these shortcomings, demonstrating the natural extensibility of the platform architecture. First, the system must be able not only to understand what it hears, but also to describe what it sees and to interact with the human user. This is a natural extension of the knowledge of sentence-to-meaning mappings that is now applied in the inverse scene-to-sentence sense. Secondly, we extend the system´s ontology from physical events to include spatial relations. We show that spatial relations are naturally accommodated in the predicate argument representations for events. Finally, because the robot community is international the robot should be able to speak multiple languages, we thus demonstrate that the language model extends naturally to include both English and Japanese. Concrete results from a working interactive system are presented and future directions for adaptive human-robot interaction systems are outlined.

Keywords

generalisation (artificial intelligence); humanoid robots; intelligent robots; man-machine systems; natural language interfaces; ontologies (artificial intelligence); speech processing; adaptive spoken language interface; generalization task; inverse scene-to-sentence sense; language-learning model; perceptually grounded human-robot interaction; predicate argument representations; sentence-meaning mappings; spatial relations; speech processing; visual scene interpretation; Charge coupled devices; Charge-coupled image sensors; Humanoid robots; Humans; Image processing; Informatics; Layout; Natural languages; Robot sensing systems; Speech processing;

fLanguage

English

Publisher

ieee

Conference_Titel

Humanoid Robots, 2004 4th IEEE/RAS International Conference on

Print_ISBN

0-7803-8863-1

Type

conf

DOI

10.1109/ICHR.2004.1442121

Filename

1442121