Evaluating The Robustness of Self-Supervised Representations to Background/Foreground Removal

06/02/2023
by   Xavier F. Cadet, et al.
0

Despite impressive empirical advances of SSL in solving various tasks, the problem of understanding and characterizing SSL representations learned from input data remains relatively under-explored. We provide a comparative analysis of how the representations produced by SSL models differ when masking parts of the input. Specifically, we considered state-of-the-art SSL pretrained models, such as DINOv2, MAE, and SwaV, and analyzed changes at the representation levels across 4 Image Classification datasets. First, we generate variations of the datasets by applying foreground and background segmentation. Then, we conduct statistical analysis using Canonical Correlation Analysis (CCA) and Centered Kernel Alignment (CKA) to evaluate the robustness of the representations learned in SSL models. Empirically, we show that not all models lead to representations that separate foreground, background, and complete images. Furthermore, we test different masking strategies by occluding the center regions of the images to address cases where foreground and background are difficult. For example, the DTD dataset that focuses on texture rather specific objects.

READ FULL TEXT

page 4

page 5

research
11/18/2022

Invariant Learning via Diffusion Dreamed Distribution Shifts

Though the background is an important signal for image classification, o...
research
06/02/2022

Optimizing Relevance Maps of Vision Transformers Improves Robustness

It has been observed that visual classification models often rely mostly...
research
11/21/2020

Contextual Interference Reduction by Selective Fine-Tuning of Neural Networks

Feature disentanglement of the foreground target objects and the backgro...
research
07/19/2011

Weakly Supervised Learning of Foreground-Background Segmentation using Masked RBMs

We propose an extension of the Restricted Boltzmann Machine (RBM) that a...
research
10/07/2021

Virtual Multi-Modality Self-Supervised Foreground Matting for Human-Object Interaction

Most existing human matting algorithms tried to separate pure human-only...
research
07/16/2021

Rectifying the Shortcut Learning of Background: Shared Object Concentration for Few-Shot Image Recognition

Few-Shot image classification aims to utilize pretrained knowledge learn...
research
07/04/2023

Mitigating Bias: Enhancing Image Classification by Improving Model Explanations

Deep learning models have demonstrated remarkable capabilities in learni...

Please sign up or login with your details

Forgot password? Click here to reset