The Quarks of Attention

02/15/2022
by   Pierre Baldi, et al.
11

Attention plays a fundamental role in both natural and artificial intelligence systems. In deep learning, attention-based neural architectures, such as transformer architectures, are widely used to tackle problems in natural language processing and beyond. Here we investigate the fundamental building blocks of attention and their computational properties. Within the standard model of deep learning, we classify all possible fundamental building blocks of attention in terms of their source, target, and computational mechanism. We identify and study three most important mechanisms: additive activation attention, multiplicative output attention (output gating), and multiplicative synaptic attention (synaptic gating). The gating mechanisms correspond to multiplicative extensions of the standard model and are used across all current attention-based deep learning architectures. We study their functional properties and estimate the capacity of several attentional building blocks in the case of linear and polynomial threshold gates. Surprisingly, additive activation attention plays a central role in the proofs of the lower bounds. Attention mechanisms reduce the depth of certain basic circuits and leverage the power of quadratic activations without incurring their full cost.

READ FULL TEXT

page 12

page 13

page 21

page 22

page 27

page 28

research
02/24/2022

Attention Enables Zero Approximation Error

Deep learning models have been widely applied in various aspects of dail...
research
08/19/2022

An Investigation into Neuromorphic ICs using Memristor-CMOS Hybrid Circuits

The memristance of a memristor depends on the amount of charge flowing t...
research
04/16/2022

Visual Attention Methods in Deep Learning: An In-Depth Survey

Inspired by the human cognitive system, attention is a mechanism that im...
research
07/04/2015

Describing Multimedia Content using Attention-based Encoder--Decoder Networks

Whereas deep neural networks were first mostly used for classification t...
research
12/30/2021

Attention mechanisms and deep learning for machine vision: A survey of the state of the art

With the advent of state of the art nature-inspired pure attention based...
research
09/05/2017

Deep learning: Technical introduction

This note presents in a technical though hopefully pedagogical way the t...
research
04/26/2022

A survey on attention mechanisms for medical applications: are we moving towards better algorithms?

The increasing popularity of attention mechanisms in deep learning algor...

Please sign up or login with your details

Forgot password? Click here to reset