• DocumentCode
    647419
  • Title

    MonoFusion: Real-time 3D reconstruction of small scenes with a single web camera

  • Author

    Pradeep, V. ; Rhemann, Christoph ; Izadi, Shahram ; Zach, Christopher ; Bleyer, Michael ; Bathiche, Steven

  • Author_Institution
    Microsoft Corp. & Microsoft Res., Cambridge, UK
  • fYear
    2013
  • fDate
    1-4 Oct. 2013
  • Firstpage
    83
  • Lastpage
    88
  • Abstract
    MonoFusion allows a user to build dense 3D reconstructions of their environment in real-time, utilizing only a single, off-the-shelf web camera as the input sensor. The camera could be one already available in a tablet, phone, or a standalone device. No additional input hardware is required. This removes the need for power intensive active sensors that do not work robustly in natural outdoor lighting. Using the input stream of the camera we first estimate the 6DoF camera pose using a sparse tracking method. These poses are then used for efficient dense stereo matching between the input frame and a key frame (extracted previously). The resulting dense depth maps are directly fused into a voxel-based implicit model (using a computationally inexpensive method) and surfaces are extracted per frame. The system is able to recover from tracking failures as well as filter out geometrically inconsistent noise from the 3D reconstruction. Our method is both simple to implement and efficient, making such systems even more accessible. This paper details the algorithmic components that make up our system and a GPU implementation of our approach. Qualitative results demonstrate high quality reconstructions even visually comparable to active depth sensor-based systems such as KinectFusion.
  • Keywords
    cameras; graphics processing units; image reconstruction; image sensors; pose estimation; real-time systems; target tracking; 6DoF camera pose; GPU implementation; MonoFusion; natural outdoor lighting; off-the-shelf Web camera; power intensive active sensors; real-time 3D reconstruction; single Web camera; small scenes; sparse tracking method; voxel-based implicit model; Cameras; Image reconstruction; Optimization; Real-time systems; Robustness; Streaming media; Three-dimensional displays;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Mixed and Augmented Reality (ISMAR), 2013 IEEE International Symposium on
  • Conference_Location
    Adelaide, SA
  • Type

    conf

  • DOI
    10.1109/ISMAR.2013.6671767
  • Filename
    6671767