An increasingly important building block of large scale machine learning...
This paper introduces a new principled approach for offline policy
optim...
Both in academic and industry-based research, online evaluation methods ...
We introduce Probabilistic Rank and Reward model (PRR), a scalable
proba...
Personalised interactive systems such as recommender systems require
sel...
This paper extends the Distributionally Robust Optimization (DRO) approa...
A common task for recommender systems is to build a pro le of the intere...
The combination of the re-parameterization trick with the use of variati...