Mark My Words: Dangers of Watermarked Images in ImageNet

03/09/2023
by   Kirill Bykov, et al.
0

The utilization of pre-trained networks, especially those trained on ImageNet, has become a common practice in Computer Vision. However, prior research has indicated that a significant number of images in the ImageNet dataset contain watermarks, making pre-trained networks susceptible to learning artifacts such as watermark patterns within their latent spaces. In this paper, we aim to assess the extent to which popular pre-trained architectures display such behavior and to determine which classes are most affected. Additionally, we examine the impact of watermarks on the extracted features. Contrary to the popular belief that the Chinese logographic watermarks impact the "carton" class only, our analysis reveals that a variety of ImageNet classes, such as "monitor", "broom", "apron" and "safe" rely on spurious correlations. Finally, we propose a simple approach to mitigate this issue in fine-tuned networks by ignoring the encodings from the feature-extractor layer of ImageNet pre-trained networks that are most susceptible to watermark imprints.

READ FULL TEXT

page 2

page 8

research
03/09/2022

Inadequately Pre-trained Models are Better Feature Extractors

Pre-training has been a popular learning paradigm in deep learning era, ...
research
10/10/2019

Improving sample diversity of a pre-trained, class-conditional GAN by changing its class embeddings

Mode collapse is a well-known issue with Generative Adversarial Networks...
research
06/13/2021

HistoTransfer: Understanding Transfer Learning for Histopathology

Advancement in digital pathology and artificial intelligence has enabled...
research
05/23/2023

Eliminating Spurious Correlations from Pre-trained Models via Data Mixing

Machine learning models pre-trained on large datasets have achieved rema...
research
06/03/2022

Learning an Adaptation Function to Assess Image Visual Similarities

Human perception is routinely assessing the similarity between images, b...
research
12/22/2019

Analyzing ImageNet with Spectral Relevance Analysis: Towards ImageNet un-Hans'ed

Today's machine learning models for computer vision are typically traine...
research
04/20/2023

Visual DNA: Representing and Comparing Images using Distributions of Neuron Activations

Selecting appropriate datasets is critical in modern computer vision. Ho...

Please sign up or login with your details

Forgot password? Click here to reset