• DocumentCode
    663572
  • Title

    Grounding spatial relations for human-robot interaction

  • Author

    Guadarrama, Sergio ; Riano, Lorenzo ; Golland, Dave ; Gouhring, Daniel ; Yangqing Jia ; Klein, David ; Abbeel, Pieter ; Darrell, Trevor

  • Author_Institution
    EECS, Univ. of California at Berkeley, Berkeley, CA, USA
  • fYear
    2013
  • fDate
    3-7 Nov. 2013
  • Firstpage
    1640
  • Lastpage
    1647
  • Abstract
    We propose a system for human-robot interaction that learns both models for spatial prepositions and for object recognition. Our system grounds the meaning of an input sentence in terms of visual percepts coming from the robot´s sensors in order to send an appropriate command to the PR2 or respond to spatial queries. To perform this grounding, the system recognizes the objects in the scene, determines which spatial relations hold between those objects, and semantically parses the input sentence. The proposed system uses the visual and spatial information in conjunction with the semantic parse to interpret statements that refer to objects (nouns), their spatial relationships (prepositions), and to execute commands (actions). The semantic parse is inherently compositional, allowing the robot to understand complex commands that refer to multiple objects and relations such as: “Move the cup close to the robot to the area in front of the plate and behind the tea box”. Our system correctly parses 94% of the 210 online test sentences, correctly interprets 91% of the correctly parsed sentences, and correctly executes 89% of the correctly interpreted sentences.
  • Keywords
    human-robot interaction; image sensors; intelligent robots; natural language interfaces; object recognition; robot vision; visual perception; PR2; complex commands; human-robot interaction; input sentence; object recognition; online test sentences; robot sensors; semantic parse; sentence interpretation; spatial information; spatial prepositions; spatial queries; spatial relations grounding; spatial relationships; visual information; visual percepts; Adaptation models; Grounding; Natural languages; Robot sensing systems; Semantics; Three-dimensional displays;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Robots and Systems (IROS), 2013 IEEE/RSJ International Conference on
  • Conference_Location
    Tokyo
  • ISSN
    2153-0858
  • Type

    conf

  • DOI
    10.1109/IROS.2013.6696569
  • Filename
    6696569