Linear temporal logic (LTL) offers a simplified way of specifying tasks ...
We study the problem of policy optimization (PO) with linear temporal lo...
We present a novel off-policy loss function for learning a transition mo...
Off-policy policy evaluation (OPE) is the problem of estimating the onli...
When learning policies for real-world domains, two important questions a...