Variational Inference with Tail-adaptive f-Divergence

10/29/2018
by   Dilin Wang, et al.
0

Variational inference with α-divergences has been widely used in modern probabilistic machine learning. Compared to Kullback-Leibler (KL) divergence, a major advantage of using α-divergences (with positive α values) is their mass-covering property. However, estimating and optimizing α-divergences require to use importance sampling, which could have extremely large or infinite variances due to heavy tails of importance weights. In this paper, we propose a new class of tail-adaptive f-divergences that adaptively change the convex function f with the tail of the importance weights, in a way that theoretically guarantees finite moments, while simultaneously achieving mass-covering properties. We test our methods on Bayesian neural networks, as well as deep reinforcement learning in which our method is applied to improve a recent soft actor-critic (SAC) algorithm. Our results show that our approach yields significant advantages compared with existing methods based on classical KL and α-divergences.

READ FULL TEXT
research
09/21/2017

Perturbative Black Box Variational Inference

Black box variational inference (BBVI) with reparameterization gradients...
research
03/01/2021

Challenges and Opportunities in High-dimensional Variational Inference

We explore the limitations of and best practices for using black-box var...
research
11/03/2018

VIREL: A Variational Inference Framework for Reinforcement Learning

Applying probabilistic models to reinforcement learning (RL) has become ...
research
05/02/2018

Alpha-Beta Divergence For Variational Inference

This paper introduces a variational approximation framework using direct...
research
11/06/2018

Deep Probabilistic Ensembles: Approximate Variational Inference through KL Regularization

In this paper, we introduce Deep Probabilistic Ensembles (DPEs), a scala...
research
04/10/2023

Deep Reinforcement Learning with Importance Weighted A3C for QoE enhancement in Video Delivery Services

Adaptive bitrate (ABR) algorithms are used to adapt the video bitrate ba...
research
02/04/2021

Variational Inference for Deblending Crowded Starfields

In the image data collected by astronomical surveys, stars and galaxies ...

Please sign up or login with your details

Forgot password? Click here to reset