Zero-Shot Robustification of Zero-Shot Models With Foundation Models

09/08/2023
by   Dyah Adila, et al.
0

Zero-shot inference is a powerful paradigm that enables the use of large pretrained models for downstream classification tasks without further training. However, these models are vulnerable to inherited biases that can impact their performance. The traditional solution is fine-tuning, but this undermines the key advantage of pretrained models, which is their ability to be used out-of-the-box. We propose RoboShot, a method that improves the robustness of pretrained model embeddings in a fully zero-shot fashion. First, we use zero-shot language models (LMs) to obtain useful insights from task descriptions. These insights are embedded and used to remove harmful and boost useful components in embeddings – without any supervision. Theoretically, we provide a simple and tractable model for biases in zero-shot embeddings and give a result characterizing under what conditions our approach can boost performance. Empirically, we evaluate RoboShot on nine image and NLP classification tasks and show an average improvement of 15.98 zero-shot baselines. Additionally, we demonstrate that RoboShot is compatible with a variety of pretrained and language models.

READ FULL TEXT

page 27

page 28

research
10/31/2022

Zero-Shot Text Classification with Self-Training

Recent advances in large pretrained language models have increased atten...
research
05/17/2023

Equivariant Few-Shot Learning from Pretrained Models

Efficient transfer learning algorithms are key to the success of foundat...
research
06/14/2023

Zero-Shot 3D Shape Sketch View Similarity and Retrieval

We conduct a detailed study of the ability of pretrained on pretext task...
research
01/21/2023

Transfer Knowledge from Natural Language to Electrocardiography: Can We Detect Cardiovascular Disease Through Language Models?

Recent advancements in Large Language Models (LLMs) have drawn increasin...
research
09/09/2021

Avoiding Inference Heuristics in Few-shot Prompt-based Finetuning

Recent prompt-based approaches allow pretrained language models to achie...
research
09/10/2023

Mitigating Word Bias in Zero-shot Prompt-based Classifiers

Prompt-based classifiers are an attractive approach for zero-shot classi...
research
05/04/2023

LLM2Loss: Leveraging Language Models for Explainable Model Diagnostics

Trained on a vast amount of data, Large Language models (LLMs) have achi...

Please sign up or login with your details

Forgot password? Click here to reset