Transductive Zero-Shot Learning with Adaptive Structural Embedding

by   Yunlong Yu, et al.

Zero-shot learning (ZSL) endows the computer vision system with the inferential capability to recognize instances of a new category that has never seen before. Two fundamental challenges in it are visual-semantic embedding and domain adaptation in cross-modality learning and unseen class prediction steps, respectively. To address both challenges, this paper presents two corresponding methods named Adaptive STructural Embedding (ASTE) and Self-PAsed Selective Strategy (SPASS), respectively. Specifically, ASTE formulates the visualsemantic interactions in a latent structural SVM framework to adaptively adjust the slack variables to embody the different reliableness among training instances. In this way, the reliable instances are imposed with small punishments, wheras the less reliable instances are imposed with more severe punishments. Thus, it ensures a more discriminative embedding. On the other hand, SPASS offers a framework to alleviate the domain shift problem in ZSL, which exploits the unseen data in an easy to hard fashion. Particularly, SPASS borrows the idea from selfpaced learning by iteratively selecting the unseen instances from reliable to less reliable to gradually adapt the knowledge from the seen domain to the unseen domain. Subsequently, by combining SPASS and ASTE, we present a self-paced Transductive ASTE (TASTE) method to progressively reinforce the classification capacity. Extensive experiments on three benchmark datasets (i.e., AwA, CUB, and aPY) demonstrate the superiorities of ASTE and TASTE. Furthermore, we also propose a fast training (FT) strategy to improve the efficiency of most of existing ZSL methods. The FT strategy is surprisingly simple and general enough, which can speed up the training time of most existing methods by 4 300 times while holding the previous performance.


page 1

page 3

page 6

page 9


Zero-Shot Learning via Category-Specific Visual-Semantic Mapping

Zero-Shot Learning (ZSL) aims to classify a test instance from an unseen...

Transductive Zero-Shot Learning with a Self-training dictionary approach

As an important and challenging problem in computer vision, zero-shot le...

Domain segmentation and adjustment for generalized zero-shot learning

In the generalized zero-shot learning, synthesizing unseen data with gen...

Transductive Zero-Shot Learning with Visual Structure Constraint

Zero-shot Learning (ZSL) aims to recognize objects of the unseen classes...

TGG: Transferable Graph Generation for Zero-shot and Few-shot Learning

Zero-shot and few-shot learning aim to improve generalization to unseen ...

Zero-Shot Learning via Joint Latent Similarity Embedding

Zero-shot recognition (ZSR) deals with the problem of predicting class l...

Hardness Sampling for Self-Training Based Transductive Zero-Shot Learning

Transductive zero-shot learning (T-ZSL) which could alleviate the domain...

Please sign up or login with your details

Forgot password? Click here to reset