VQA with no questions-answers training

11/20/2018
by   Ben Zion Vatashsky, et al.
16

Methods for teaching machines to answer visual questions have made significant progress in the last few years, but although demonstrating impressive results on particular datasets, these methods lack some important human capabilities, including integrating new visual classes and concepts in a modular manner, providing explanations for the answer and handling new domains without new examples. In this paper we present a system that achieves state-of-the-art results on the CLEVR dataset without any questions-answers training, utilizes real visual estimators and explains the answer. The system includes a question representation stage followed by an answering procedure, which invokes an extendable set of visual estimators. It can explain the answer, including its failures, and provide alternatives to negative answers. The scheme builds upon a framework proposed recently, with extensions allowing the system to deal with novel domains without relying on training examples.

READ FULL TEXT

page 1

page 5

page 7

page 8

research
06/28/2020

Improving VQA and its Explanations by Comparing Competing Explanations

Most recent state-of-the-art Visual Question Answering (VQA) systems are...
research
10/25/2018

Understand, Compose and Respond - Answering Visual Questions by a Composition of Abstract Procedures

An image related question defines a specific visual task that is require...
research
11/30/2018

From Known to the Unknown: Transferring Knowledge to Answer Questions about Novel Visual and Semantic Concepts

Current Visual Question Answering (VQA) systems can answer intelligent q...
research
02/21/2021

Learning Compositional Representation for Few-shot Visual Question Answering

Current methods of Visual Question Answering perform well on the answers...
research
04/12/2020

Which visual questions are difficult to answer? Analysis with Entropy of Answer Distributions

We propose a novel approach to identify the difficulty of visual questio...
research
11/22/2017

Visual Question Answering as a Meta Learning Task

The predominant approach to Visual Question Answering (VQA) demands that...
research
03/26/2020

P ≈ NP, at least in Visual Question Answering

In recent years, progress in the Visual Question Answering (VQA) field h...

Please sign up or login with your details

Forgot password? Click here to reset