DocumentCode
663572
Title
Grounding spatial relations for human-robot interaction
Author
Guadarrama, Sergio ; Riano, Lorenzo ; Golland, Dave ; Gouhring, Daniel ; Yangqing Jia ; Klein, David ; Abbeel, Pieter ; Darrell, Trevor
Author_Institution
EECS, Univ. of California at Berkeley, Berkeley, CA, USA
fYear
2013
fDate
3-7 Nov. 2013
Firstpage
1640
Lastpage
1647
Abstract
We propose a system for human-robot interaction that learns both models for spatial prepositions and for object recognition. Our system grounds the meaning of an input sentence in terms of visual percepts coming from the robot´s sensors in order to send an appropriate command to the PR2 or respond to spatial queries. To perform this grounding, the system recognizes the objects in the scene, determines which spatial relations hold between those objects, and semantically parses the input sentence. The proposed system uses the visual and spatial information in conjunction with the semantic parse to interpret statements that refer to objects (nouns), their spatial relationships (prepositions), and to execute commands (actions). The semantic parse is inherently compositional, allowing the robot to understand complex commands that refer to multiple objects and relations such as: “Move the cup close to the robot to the area in front of the plate and behind the tea box”. Our system correctly parses 94% of the 210 online test sentences, correctly interprets 91% of the correctly parsed sentences, and correctly executes 89% of the correctly interpreted sentences.
Keywords
human-robot interaction; image sensors; intelligent robots; natural language interfaces; object recognition; robot vision; visual perception; PR2; complex commands; human-robot interaction; input sentence; object recognition; online test sentences; robot sensors; semantic parse; sentence interpretation; spatial information; spatial prepositions; spatial queries; spatial relations grounding; spatial relationships; visual information; visual percepts; Adaptation models; Grounding; Natural languages; Robot sensing systems; Semantics; Three-dimensional displays;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Robots and Systems (IROS), 2013 IEEE/RSJ International Conference on
Conference_Location
Tokyo
ISSN
2153-0858
Type
conf
DOI
10.1109/IROS.2013.6696569
Filename
6696569
Link To Document