Assessing the Robustness of Visual Question Answering

11/30/2019
by   Jia-Hong Huang, et al.
0

Deep neural networks have been playing an essential role in the task of Visual Question Answering (VQA). Until recently, their accuracy has been the main focus of research. Now there is a trend toward assessing the robustness of these models against adversarial attacks by evaluating the accuracy of these models under increasing levels of noisiness in the inputs of VQA models. In VQA, the attack can target the image and/or the proposed query question, dubbed main question, and yet there is a lack of proper analysis of this aspect of VQA. In this work, we propose a new method that uses semantically related questions, dubbed basic questions, acting as noise to evaluate the robustness of VQA models. We hypothesize that as the similarity of a basic question to the main question decreases, the level of noise increases. To generate a reasonable noise level for a given main question, we rank a pool of basic questions based on their similarity with this main question. We cast this ranking problem as a LASSO optimization problem. We also propose a novel robustness measure Rscore and two large-scale basic question datasets in order to standardize robustness analysis of VQA models. The experimental results demonstrate that the proposed evaluation method is able to effectively analyze the robustness of VQA models. To foster the VQA research, we will publish our proposed datasets.

READ FULL TEXT
research
11/16/2017

A Novel Framework for Robustness Analysis of Visual QA Models

Deep neural networks have been playing an essential role in many compute...
research
04/06/2023

Improving Visual Question Answering Models through Robustness Analysis and In-Context Learning with a Chain of Basic Questions

Deep neural networks have been critical in the task of Visual Question A...
research
06/11/2020

Exploring Weaknesses of VQA Models through Attribution Driven Insights

Deep Neural Networks have been successfully used for the task of Visual ...
research
10/11/2021

Beyond Accuracy: A Consolidated Tool for Visual Question Answering Benchmarking

On the way towards general Visual Question Answering (VQA) systems that ...
research
12/30/2021

VisQA: Quantifying Information Visualisation Recallability via Question Answering

Despite its importance for assessing the effectiveness of communicating ...
research
12/01/2020

Open-Ended Multi-Modal Relational Reason for Video Question Answering

People with visual impairments urgently need helps, not only on the basi...
research
06/08/2021

Are VQA Systems RAD? Measuring Robustness to Augmented Data with Focused Interventions

Deep learning algorithms have shown promising results in visual question...

Please sign up or login with your details

Forgot password? Click here to reset