• DocumentCode
    250709
  • Title

    Multimodal learning for autonomous underwater vehicles from visual and bathymetric data

  • Author

    Rao, Dantam ; De Deuge, Mark ; Nourani-Vatani, Navid ; Douillard, Bertrand ; Williams, Stefan B. ; Pizarro, Oscar

  • Author_Institution
    Australian Centre for Field Robot., Univ. of Sydney, Sydney, NSW, Australia
  • fYear
    2014
  • fDate
    May 31 2014-June 7 2014
  • Firstpage
    3819
  • Lastpage
    3825
  • Abstract
    Autonomous Underwater Vehicles (AUVs) gather large volumes of visual imagery, which can help monitor marine ecosystems and plan future surveys. One key task in marine ecology is benthic habitat mapping, the classification of large regions of the ocean floor into broad habitat categories. Since visual data only covers a small fraction of the ocean floor, traditional habitat mapping is performed using shipborne acoustic multi-beam data, with visual data as ground truth. However, given the high resolution and rich textural cues in visual data, an ideal approach should explicitly utilise visual features in the classification process. To this end, we propose a multimodal model which utilises visual data and shipborne multi-beam bathymetry to perform both classification and sampling tasks. Our algorithm learns the relationship between both modalities, but is also effective when visual data is missing. Our results suggest that by performing multimodal learning, classification performance is improved in scenarios where visual data is unavailable, such as the habitat mapping scenario. We also demonstrate empirically that the model is able to perform generative tasks, producing plausible samples from the underlying data-generating distribution.
  • Keywords
    autonomous underwater vehicles; feature extraction; image classification; image sampling; image texture; learning (artificial intelligence); marine engineering; mobile robots; oceanographic techniques; robot vision; AUV; autonomous underwater vehicles; bathymetric data; benthic habitat mapping; marine ecology; marine ecosystem monitoring; multimodal learning; ocean floor region classification; sampling task; shipborne acoustic multibeam data; shipborne multibeam bathymetry utilization; textural cues; visual data utilization; visual feature utilization; visual imagery; Correlation; Encoding; Image reconstruction; Oceans; Training; Vectors; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Robotics and Automation (ICRA), 2014 IEEE International Conference on
  • Conference_Location
    Hong Kong
  • Type

    conf

  • DOI
    10.1109/ICRA.2014.6907413
  • Filename
    6907413