Self-supervised learning through the eyes of a child

07/31/2020
by   A. Emin Orhan, et al.
0

Within months of birth, children have meaningful expectations about the world around them. How much of this early knowledge can be explained through generic learning mechanisms applied to sensory data, and how much of it requires more substantive innate inductive biases? Addressing this fundamental question in its full generality is currently infeasible, but we can hope to make real progress in more narrowly defined domains, such as the development of high-level visual categories, thanks to improvements in data collecting technology and recent progress in deep learning. In this paper, our goal is to achieve such progress by utilizing modern self-supervised deep learning methods and a recent longitudinal, egocentric video dataset recorded from the perspective of several young children (Sullivan et al., 2020). Our results demonstrate the emergence of powerful, high-level visual representations from developmentally realistic natural videos using generic self-supervised learning objectives.

READ FULL TEXT

page 3

page 5

page 6

page 8

page 9

page 14

page 15

page 16

research
02/16/2019

Self-supervised Visual Feature Learning with Deep Neural Networks: A Survey

Large-scale labeled data are generally required to train deep neural net...
research
05/03/2019

Scaling and Benchmarking Self-Supervised Visual Representation Learning

Self-supervised learning aims to learn representations from the data its...
research
06/09/2021

Self-supervised Feature Enhancement: Applying Internal Pretext Task to Supervised Learning

Traditional self-supervised learning requires CNNs using external pretex...
research
05/24/2023

What can generic neural networks learn from a child's visual experience?

Young children develop sophisticated internal models of the world based ...
research
06/20/2022

Great Expectations: Unsupervised Inference of Suspense, Surprise and Salience in Storytelling

Stories interest us not because they are a sequence of mundane and predi...
research
06/08/2021

Interpretable agent communication from scratch (with a generic visual processor emerging on the side)

As deep networks begin to be deployed as autonomous agents, the issue of...
research
08/07/2023

Scaling may be all you need for achieving human-level object recognition capacity with human-like visual experience

This paper asks whether current self-supervised learning methods, if suf...

Please sign up or login with your details

Forgot password? Click here to reset