Adversarial Attacks on Variational Autoencoders

06/12/2018
by   George Gondim-Ribeiro, et al.
2

Adversarial attacks are malicious inputs that derail machine-learning models. We propose a scheme to attack autoencoders, as well as a quantitative evaluation framework that correlates well with the qualitative assessment of the attacks. We assess --- with statistically validated experiments --- the resistance to attacks of three variational autoencoders (simple, convolutional, and DRAW) in three datasets (MNIST, SVHN, CelebA), showing that both DRAW's recurrence and attention mechanism lead to better resistance. As autoencoders are proposed for compressing data --- a scenario in which their safety is paramount --- we expect more attention will be given to adversarial attacks on them.

READ FULL TEXT

page 4

page 5

page 8

research
03/10/2021

Diagnosing Vulnerability of Variational Auto-Encoders to Adversarial Attacks

In this work, we explore adversarial attacks on the Variational Autoenco...
research
02/19/2020

Variational Encoder-based Reliable Classification

Machine learning models provide statistically impressive results which m...
research
08/29/2022

Towards Adversarial Purification using Denoising AutoEncoders

With the rapid advancement and increased use of deep learning models in ...
research
05/31/2018

Resisting Adversarial Attacks using Gaussian Mixture Variational Autoencoders

Susceptibility of deep neural networks to adversarial attacks poses a ma...
research
01/28/2023

Selecting Models based on the Risk of Damage Caused by Adversarial Attacks

Regulation, legal liabilities, and societal concerns challenge the adopt...
research
03/18/2022

Defending Variational Autoencoders from Adversarial Attacks with MCMC

Variational autoencoders (VAEs) are deep generative models used in vario...
research
12/01/2016

Adversarial Images for Variational Autoencoders

We investigate adversarial attacks for autoencoders. We propose a proced...

Please sign up or login with your details

Forgot password? Click here to reset