Phase Transitions for the Information Bottleneck in Representation Learning

01/07/2020
by   Tailin Wu, et al.
0

In the Information Bottleneck (IB), when tuning the relative strength between compression and prediction terms, how do the two terms behave, and what's their relationship with the dataset and the learned representation? In this paper, we set out to answer these questions by studying multiple phase transitions in the IB objective: IB_β[p(z|x)] = I(X; Z) - β I(Y; Z) defined on the encoding distribution p(z|x) for input X, target Y and representation Z, where sudden jumps of dI(Y; Z)/d β and prediction accuracy are observed with increasing β. We introduce a definition for IB phase transitions as a qualitative change of the IB loss landscape, and show that the transitions correspond to the onset of learning new classes. Using second-order calculus of variations, we derive a formula that provides a practical condition for IB phase transitions, and draw its connection with the Fisher information matrix for parameterized models. We provide two perspectives to understand the formula, revealing that each IB phase transition is finding a component of maximum (nonlinear) correlation between X and Y orthogonal to the learned representation, in close analogy with canonical-correlation analysis (CCA) in linear settings. Based on the theory, we present an algorithm for discovering phase transition points. Finally, we verify that our theory and algorithm accurately predict phase transitions in categorical datasets, predict the onset of learning new classes and class difficulty in MNIST, and predict prominent phase transitions in CIFAR10.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/25/2022

Exact Phase Transitions in Deep Learning

This work reports deep-learning-unique first-order and second-order phas...
research
01/11/2020

Intelligence, physics and information – the tradeoff between accuracy and simplicity in machine learning

How can we enable machines to make sense of the world, and become better...
research
07/17/2019

Learnability for the Information Bottleneck

The Information Bottleneck (IB) method (tishby2000information) provides ...
research
12/01/2020

Interpretable Phase Detection and Classification with Persistent Homology

We apply persistent homology to the task of discovering and characterizi...
research
03/31/2023

Generalized Information Bottleneck for Gaussian Variables

The information bottleneck (IB) method offers an attractive framework fo...
research
09/23/2015

Detecting phase transitions in collective behavior using manifold's curvature

If a given behavior of a multi-agent system restricts the phase variable...
research
07/03/2019

Understanding Phase Transitions via Mutual Information and MMSE

The ability to understand and solve high-dimensional inference problems ...

Please sign up or login with your details

Forgot password? Click here to reset