A long-standing goal of reinforcement learning is that algorithms can le...
Model-based reinforcement learning usually suffers from a high sample
co...
Importance sampling (IS) is a popular technique in off-policy evaluation...
Though deep reinforcement learning (DRL) has obtained substantial succes...
In high-stake scenarios like medical treatment and auto-piloting, it's r...
We present Tianshou, a highly modularized python library for deep
reinfo...
Reward shaping is one of the most effective methods to tackle the crucia...
In this paper, we focus on solving two-player zero-sum extensive games w...
We study a nonparametric Bayesian approach to linear inverse problems un...
We obtain rates of contraction of posterior distributions in inverse pro...