Decoding Layer Saliency in Language Transformers

08/09/2023
by   Elizabeth M. Hou, et al.
0

In this paper, we introduce a strategy for identifying textual saliency in large-scale language models applied to classification tasks. In visual networks where saliency is more well-studied, saliency is naturally localized through the convolutional layers of the network; however, the same is not true in modern transformer-stack networks used to process natural language. We adapt gradient-based saliency methods for these networks, propose a method for evaluating the degree of semantic coherence of each layer, and demonstrate consistent improvement over numerous other methods for textual saliency on multiple benchmark classification datasets. Our approach requires no additional training or access to labelled data, and is comparatively very computationally efficient.

READ FULL TEXT

page 16

page 17

page 18

research
12/21/2016

Top-down Visual Saliency Guided by Captions

Neural image/video captioning models can generate accurate descriptions,...
research
11/15/2022

Evaluating the Faithfulness of Saliency-based Explanations for Deep Learning Models for Temporal Colour Constancy

The opacity of deep learning models constrains their debugging and impro...
research
04/12/2021

Evaluating Saliency Methods for Neural Language Models

Saliency methods are widely used to interpret neural network predictions...
research
08/30/2020

A Compact Deep Architecture for Real-time Saliency Prediction

Saliency computation models aim to imitate the attention mechanism in th...
research
04/20/2019

Saliency-Guided Attention Network for Image-Sentence Matching

This paper studies the task of matching image and sentence, where learni...
research
04/12/2016

Recurrent Attentional Networks for Saliency Detection

Convolutional-deconvolution networks can be adopted to perform end-to-en...
research
07/25/2023

Analyzing Chain-of-Thought Prompting in Large Language Models via Gradient-based Feature Attributions

Chain-of-thought (CoT) prompting has been shown to empirically improve t...

Please sign up or login with your details

Forgot password? Click here to reset