Learning where to look: Semantic-Guided Multi-Attention Localization for Zero-Shot Learning

03/01/2019
by   Yizhe Zhu, et al.
6

Zero-shot learning extends the conventional object classification to the unseen class recognition by introducing semantic representations of classes. Existing approaches predominantly focus on learning the proper mapping function for visual-semantic embedding, while neglecting the effect of learning discriminative visual features. In this paper, we study the significance of the discriminative region localization. We propose a semantic-guided multi-attention localization model, which automatically discovers the most discriminative parts of objects for zero-shot learning without any human annotations. Our model jointly learns cooperative global and local features from the whole object as well as the detected parts to categorize objects based on semantic descriptions. Moreover, with the joint supervision of embedding softmax loss and class-center triplet loss, the model is encouraged to learn features with high inter-class dispersion and intra-class compactness. Through comprehensive experiments on three widely used zero-shot learning benchmarks, we show the efficacy of the multi-attention localization and our proposed approach improves the state-of-the-art results by a considerable margin.

READ FULL TEXT

page 1

page 3

page 8

page 9

research
08/19/2020

Attribute Prototype Network for Zero-Shot Learning

From the beginning of zero-shot learning research, visual attributes hav...
research
05/21/2018

Stacked Semantic-Guided Attention Model for Fine-Grained Zero-Shot Learning

Zero-Shot Learning (ZSL) is achieved via aligning the semantic relations...
research
05/22/2017

Semantic Softmax Loss for Zero-Shot Learning

A typical pipeline for Zero-Shot Learning (ZSL) is to integrate the visu...
research
02/05/2021

Zero-shot Learning with Deep Neural Networks for Object Recognition

Zero-shot learning deals with the ability to recognize objects without a...
research
10/12/2022

Semantic Cross Attention for Few-shot Learning

Few-shot learning (FSL) has attracted considerable attention recently. A...
research
03/18/2018

Discriminative Learning of Latent Features for Zero-Shot Recognition

Zero-shot learning (ZSL) aims to recognize unseen image categories by le...
research
07/30/2021

Multi-Head Self-Attention via Vision Transformer for Zero-Shot Learning

Zero-Shot Learning (ZSL) aims to recognise unseen object classes, which ...

Please sign up or login with your details

Forgot password? Click here to reset