Toward Interpretable Deep Reinforcement Learning with Linear Model U-Trees

07/16/2018
by   Guiliang Liu, et al.
0

Deep Reinforcement Learning (DRL) has achieved impressive success in many applications. A key component of many DRL models is a neural network representing a Q function, to estimate the expected cumulative reward following a state-action pair. The Q function neural network contains a lot of implicit knowledge about the RL problems, but often remains unexamined and uninterpreted. To our knowledge, this work develops the first mimic learning framework for Q functions in DRL. We introduce Linear Model U-trees (LMUTs) to approximate neural network predictions. An LMUT is learned using a novel on-line algorithm that is well-suited for an active play setting, where the mimic learner observes an ongoing interaction between the neural net and the environment. Empirical evaluation shows that an LMUT mimics a Q function substantially better than five baseline methods. The transparent tree structure of an LMUT facilitates understanding the network's learned knowledge by analyzing feature influence, extracting rules, and highlighting the super-pixels in image inputs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/15/2021

Learning Symbolic Rules for Interpretable Deep Reinforcement Learning

Recent progress in deep reinforcement learning (DRL) can be largely attr...
research
05/26/2018

Deep Reinforcement Learning in Ice Hockey for Context-Aware Player Evaluation

A variety of machine learning models have been proposed to assess the pe...
research
01/18/2021

Stable deep reinforcement learning method by predicting uncertainty in rewards as a subtask

In recent years, a variety of tasks have been accomplished by deep reinf...
research
06/04/2020

Cracking the Black Box: Distilling Deep Sports Analytics

This paper addresses the trade-off between Accuracy and Transparency for...
research
08/13/2019

Is Deep Reinforcement Learning Really Superhuman on Atari?

Consistent and reproducible evaluation of Deep Reinforcement Learning (D...
research
08/18/2011

Feature Reinforcement Learning In Practice

Following a recent surge in using history-based methods for resolving pe...
research
08/31/2021

Learning to Synthesize Programs as Interpretable and Generalizable Policies

Recently, deep reinforcement learning (DRL) methods have achieved impres...

Please sign up or login with your details

Forgot password? Click here to reset