• DocumentCode
    663669
  • Title

    Multimodal blending for high-accuracy instance recognition

  • Author

    Ziang Xie ; Singh, Ashutosh ; Uang, Justin ; Narayan, Karthik S. ; Abbeel, Pieter

  • Author_Institution
    Dept. of Electr. Eng. & Comput. Sci., Univ. of California, Berkeley, Berkeley, CA, USA
  • fYear
    2013
  • fDate
    3-7 Nov. 2013
  • Firstpage
    2214
  • Lastpage
    2221
  • Abstract
    Despite the rich information provided by sensors such as the Microsoft Kinect in the robotic perception setting, the problem of detecting object instances remains unsolved, even in the tabletop setting, where segmentation is greatly simplified. Existing object detection systems often focus on textured objects, for which local feature descriptors can be used to reliably obtain correspondences between different views of the same object. We examine the benefits of dense feature extraction and multimodal features for improving the accuracy and robustness of an instance recognition system. By combining multiple modalities and blending their scores through an ensemble-based method in order to generate our final object hypotheses, we obtain significant improvements over previously published results on two RGB-D datasets. On the Challenge dataset, our method results in only one missed detection (achieving 100% precision and 99.77% recall). On the Willow dataset, we also make significant gains on the prior state of the art (achieving 98.28% precision and 87.78% recall), resulting in an increase in F-score from 0.8092 to 0.9273.
  • Keywords
    feature extraction; image recognition; object detection; robot vision; visual perception; F-score; Microsoft Kinect; RGB-D datasets; Willow dataset; ensemble-based method; feature extraction; high-accuracy instance recognition; instance recognition system; missed detection; multimodal blending; multimodal features; object detection; object hypotheses; robotic perception; sensors; Feature extraction; Image color analysis; Pipelines; Reliability; Three-dimensional displays; Training; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Robots and Systems (IROS), 2013 IEEE/RSJ International Conference on
  • Conference_Location
    Tokyo
  • ISSN
    2153-0858
  • Type

    conf

  • DOI
    10.1109/IROS.2013.6696666
  • Filename
    6696666