Audio-Visual Self-Supervised Terrain Type Discovery for Mobile Platforms

10/13/2020
by   Akiyoshi Kurobe, et al.
0

The ability to both recognize and discover terrain characteristics is an important function required for many autonomous ground robots such as social robots, assistive robots, autonomous vehicles, and ground exploration robots. Recognizing and discovering terrain characteristics is challenging because similar terrains may have very different appearances (e.g., carpet comes in many colors), while terrains with very similar appearance may have very different physical properties (e.g. mulch versus dirt). In order to address the inherent ambiguity in vision-based terrain recognition and discovery, we propose a multi-modal self-supervised learning technique that switches between audio features extracted from a mic attached to the underside of a mobile platform and image features extracted by a camera on the platform to cluster terrain types. The terrain cluster labels are then used to train an image-based convolutional neural network to predict changes in terrain types. Through experiments, we demonstrate that the proposed self-supervised terrain type discovery method achieves over 80 baselines and suggests strong potential for assistive applications.

READ FULL TEXT

page 1

page 4

page 8

research
04/13/2020

Self-supervised Feature Learning by Cross-modality and Cross-view Correspondences

The success of supervised learning requires large-scale ground truth lab...
research
05/24/2017

Self-supervised learning of visual features through embedding images into text topic spaces

End-to-end training from scratch of current deep architectures for new c...
research
03/02/2022

Audio Self-supervised Learning: A Survey

Inspired by the humans' cognitive ability to generalise knowledge and sk...
research
03/21/2022

Towards Self-Supervised Gaze Estimation

Recent joint embedding-based self-supervised methods have surpassed stan...
research
05/11/2019

Self-Supervised Visual Place Recognition Learning in Mobile Robots

Place recognition is a critical component in robot navigation that enabl...
research
05/26/2021

Self-supervised Monocular Multi-robot Relative Localization with Efficient Deep Neural Networks

Relative localization is an important ability for multiple robots to per...
research
10/09/2021

Visually Exploring Multi-Purpose Audio Data

We analyse multi-purpose audio using tools to visualise similarities wit...

Please sign up or login with your details

Forgot password? Click here to reset