Dynamic Memory Networks for Visual and Textual Question Answering

03/04/2016
by   Caiming Xiong, et al.
0

Neural network architectures with memory and attention mechanisms exhibit certain reasoning capabilities required for question answering. One such architecture, the dynamic memory network (DMN), obtained high accuracy on a variety of language tasks. However, it was not shown whether the architecture achieves strong results for question answering when supporting facts are not marked during training or whether it could be applied to other modalities such as images. Based on an analysis of the DMN, we propose several improvements to its memory and input modules. Together with these changes we introduce a novel input module for images in order to be able to answer visual questions. Our new DMN+ model improves the state of the art on both the Visual Question Answering dataset and the -10k text question-answering dataset without supporting fact supervision.

READ FULL TEXT
research
06/17/2016

FVQA: Fact-based Visual Question Answering

Visual Question Answering (VQA) has attracted a lot of attention in both...
research
01/07/2016

Learning to Compose Neural Networks for Question Answering

We describe a question answering model that applies to both images and s...
research
05/25/2018

Think Visually: Question Answering through Virtual Imagery

In this paper, we study the problem of geometric reasoning in the contex...
research
04/08/2017

An Empirical Evaluation of Visual Question Answering for Novel Objects

We study the problem of answering questions about images in the harder s...
research
06/10/2022

Less Is More: Linear Layers on CLIP Features as Powerful VizWiz Model

Current architectures for multi-modality tasks such as visual question a...
research
11/01/2020

CHIME: Cross-passage Hierarchical Memory Network for Generative Review Question Answering

We introduce CHIME, a cross-passage hierarchical memory network for ques...
research
05/14/2018

Did the Model Understand the Question?

We analyze state-of-the-art deep learning models for three tasks: questi...

Please sign up or login with your details

Forgot password? Click here to reset