A Cost-Sensitive Visual Question-Answer Framework for Mining a Deep And-OR Object Semantics from Web Images

08/13/2017
by   Quanshi Zhang, et al.
0

This paper presents a cost-sensitive Question-Answering (QA) framework for learning a nine-layer And-Or graph (AoG) from web images, which explicitly represents object categories, poses, parts, and detailed structures within the parts in a compositional hierarchy. The QA framework is designed to minimize an overall risk, which trades off the loss and query costs. The loss is defined for nodes in all layers of the AoG, including the generative loss (measuring the likelihood for the images) and the discriminative loss (measuring the fitness to human answers). The cost comprises both human labor of answering questions and the computational cost of model learning. The cost-sensitive QA framework iteratively selects different storylines of questions to update different nodes in the AoG. Experiments showed that our method required much less human supervision (e.g., labeling parts on 3--10 training objects for each category) and achieved better performance than baseline methods.

READ FULL TEXT

page 6

page 9

page 11

research
11/14/2019

FAQ-based Question Answering via Knowledge Anchors

Question answering (QA) aims to understand user questions and find appro...
research
05/27/2022

V-Doc : Visual questions answers with Documents

We propose V-Doc, a question-answering tool using document images and PD...
research
10/07/2020

Learning a Cost-Effective Annotation Policy for Question Answering

State-of-the-art question answering (QA) relies upon large amounts of tr...
research
12/18/2018

Mining Interpretable AOG Representations from Convolutional Networks via Active Question Answering

In this paper, we present a method to mine object-part patterns from con...
research
11/11/2015

Visual7W: Grounded Question Answering in Images

We have seen great progress in basic perceptual tasks such as object rec...
research
11/19/2020

Logically Consistent Loss for Visual Question Answering

Given an image, a back-ground knowledge, and a set of questions about an...
research
05/14/2018

A Cost-Effective Framework for Preference Elicitation and Aggregation

We propose a cost-effective framework for preference elicitation and agg...

Please sign up or login with your details

Forgot password? Click here to reset