Cameras and image-editing software often process images in the wide-gamu...
We study the problem of policy optimization (PO) with linear temporal lo...
Off-policy policy evaluation (OPE) is the problem of estimating the onli...
We present Imitation-Projected Policy Gradient (IPPG), an algorithmic
fr...
We study the problem of programmatic reinforcement learning, in which
po...
When learning policies for real-world domains, two important questions a...
The goal of this paper is to understand the impact of learning on contro...
Many modern nonlinear control methods aim to endow systems with guarante...
We study the problem of learning policies over long time horizons. We pr...