Learning Operators with Coupled Attention

01/04/2022
by   Georgios Kissas, et al.
13

Supervised operator learning is an emerging machine learning paradigm with applications to modeling the evolution of spatio-temporal dynamical systems and approximating general black-box relationships between functional data. We propose a novel operator learning method, LOCA (Learning Operators with Coupled Attention), motivated from the recent success of the attention mechanism. In our architecture, the input functions are mapped to a finite set of features which are then averaged with attention weights that depend on the output query locations. By coupling these attention weights together with an integral transform, LOCA is able to explicitly learn correlations in the target output functions, enabling us to approximate nonlinear operators even when the number of output function in the training set measurements is very small. Our formulation is accompanied by rigorous approximation theoretic guarantees on the universal expressiveness of the proposed model. Empirically, we evaluate the performance of LOCA on several operator learning scenarios involving systems governed by ordinary and partial differential equations, as well as a black-box climate prediction problem. Through these scenarios we demonstrate state of the art accuracy, robustness with respect to noisy input data, and a consistently small spread of errors over testing data sets, even for out-of-distribution prediction tasks.

READ FULL TEXT

page 2

page 31

page 33

page 35

page 36

page 37

page 38

page 39

research
02/12/2022

MIONet: Learning multiple-input operators via tensor product

As an emerging paradigm in scientific machine learning, neural operators...
research
05/26/2022

Transformer for Partial Differential Equations' Operator Learning

Data-driven learning of partial differential equations' solution operato...
research
10/08/2019

DeepONet: Learning nonlinear operators for identifying differential equations based on the universal approximation theorem of operators

While it is widely known that neural networks are universal approximator...
research
07/08/2022

Physics-Informed Deep Neural Operator Networks

Standard neural networks can approximate general nonlinear operators, re...
research
06/07/2022

NOMAD: Nonlinear Manifold Decoders for Operator Learning

Supervised learning in function spaces is an emerging area of machine le...
research
07/26/2023

Learning to simulate partially known spatio-temporal dynamics with trainable difference operators

Recently, using neural networks to simulate spatio-temporal dynamics has...
research
02/01/2023

Learning Functional Transduction

Research in Machine Learning has polarized into two general regression a...

Please sign up or login with your details

Forgot password? Click here to reset