Towards Sustainable Self-supervised Learning

10/20/2022
by   Shanghua Gao, et al.
0

Although increasingly training-expensive, most self-supervised learning (SSL) models have repeatedly been trained from scratch but not fully utilized, since only a few SOTAs are employed for downstream tasks. In this work, we explore a sustainable SSL framework with two major challenges: i) learning a stronger new SSL model based on the existing pretrained SSL model, also called as "base" model, in a cost-friendly manner, ii) allowing the training of the new model to be compatible with various base models. We propose a Target-Enhanced Conditional (TEC) scheme which introduces two components to the existing mask-reconstruction based SSL. Firstly, we propose patch-relation enhanced targets which enhances the target given by base model and encourages the new model to learn semantic-relation knowledge from the base model by using incomplete inputs. This hardening and target-enhancing help the new model surpass the base model, since they enforce additional patch relation modeling to handle incomplete input. Secondly, we introduce a conditional adapter that adaptively adjusts new model prediction to align with the target of different base models. Extensive experimental results show that our TEC scheme can accelerate the learning speed, and also improve SOTA SSL base models, e.g., MAE and iBOT, taking an explorative step towards sustainable SSL.

READ FULL TEXT

page 2

page 3

research
10/17/2021

Self-Supervised Learning for Binary Networks by Joint Classifier Training

Despite the great success of self-supervised learning with large floatin...
research
10/13/2022

On Compressing Sequences for Self-Supervised Speech Models

Compressing self-supervised models has become increasingly necessary, as...
research
07/12/2022

Synergistic Self-supervised and Quantization Learning

With the success of self-supervised learning (SSL), it has become a main...
research
04/06/2022

Structure-aware Protein Self-supervised Learning

Protein representation learning methods have shown great potential to yi...
research
02/19/2023

Self-supervised Cloth Reconstruction via Action-conditioned Cloth Tracking

State estimation is one of the greatest challenges for cloth manipulatio...
research
06/01/2019

Patch Learning

There have been different strategies to improve the performance of a mac...
research
10/31/2022

Where to start? Analyzing the potential value of intermediate models

Previous studies observed that finetuned models may be better base model...

Please sign up or login with your details

Forgot password? Click here to reset