Tagging like Humans: Diverse and Distinct Image Annotation

03/31/2018
by   Baoyuan Wu, et al.
0

In this work we propose a new automatic image annotation model, dubbed diverse and distinct image annotation (D2IA). The generative model D2IA is inspired by the ensemble of human annotations, which create semantically relevant, yet distinct and diverse tags. In D2IA, we generate a relevant and distinct tag subset, in which the tags are relevant to the image contents and semantically distinct to each other, using sequential sampling from a determinantal point process (DPP) model. Multiple such tag subsets that cover diverse semantic aspects or diverse semantic levels of the image contents are generated by randomly perturbing the DPP sampling process. We leverage a generative adversarial network (GAN) model to train D2IA. Extensive experiments including quantitative and qualitative comparisons, as well as human subject studies, on two benchmark datasets demonstrate that the proposed model can produce more diverse and distinct tags than the state-of-the-arts.

READ FULL TEXT
research
04/18/2016

Annotation Order Matters: Recurrent Image Annotator for Arbitrary Length Image Tagging

Automatic image annotation has been an important research topic in facil...
research
05/02/2020

Single Model Ensemble using Pseudo-Tags and Distinct Vectors

Model ensemble techniques often increase task performance in neural netw...
research
09/07/2016

Automatic Visual Theme Discovery from Joint Image and Text Corpora

A popular approach to semantic image understanding is to manually tag im...
research
03/15/2012

Hybrid Generative/Discriminative Learning for Automatic Image Annotation

Automatic image annotation (AIA) raises tremendous challenges to machine...
research
06/10/2019

Patch Transformer for Multi-tagging Whole Slide Histopathology Images

Automated whole slide image (WSI) tagging has become a growing demand du...
research
08/24/2023

Tag-Based Annotation for Avatar Face Creation

Currently, digital avatars can be created manually using human images as...
research
03/21/2017

Recurrent Topic-Transition GAN for Visual Paragraph Generation

A natural image usually conveys rich semantic content and can be viewed ...

Please sign up or login with your details

Forgot password? Click here to reset