We propose a novel offline reinforcement learning (RL) algorithm, namely...
Sample-efficient offline reinforcement learning (RL) with linear functio...
We consider the problem of personalised news recommendation where each u...
This thesis rigorously studies fundamental reinforcement learning (RL)
m...
Offline policy learning (OPL) leverages existing data collected a priori...
We address policy learning with logged data in contextual bandits. Curre...
This paper studies the statistical theory of offline reinforcement learn...