An increasingly important building block of large scale machine learning...
Off-policy learning (OPL) aims at finding improved policies from logged
...
Aggregating a dataset, then injecting some noise, is a simple and common...
Both in academic and industry-based research, online evaluation methods ...
We introduce Probabilistic Rank and Reward model (PRR), a scalable
proba...
Personalised interactive systems such as recommender systems require
sel...
We present a recommender system based on the Random Utility Model. Onlin...
We consider the problem of slate recommendation, where the recommender s...
Recommender systems are often optimised for short-term reward: a
recomme...
A common task for recommender systems is to build a pro le of the intere...
The concept of causality has a controversial history. The question of wh...
The combination of the re-parameterization trick with the use of variati...
In machine learning we often try to optimise a decision rule that would ...
In this paper, the method UCB-RS, which resorts to recommendation system...
In academic literature, recommender systems are often evaluated on the t...
It is common practice in using regression type models for inferring caus...
The concept of causality has a controversial history. The question of wh...
There are three quite distinct ways to train a machine learning model on...
Session based recommendation provides an attractive alternative to the
t...
Recommender Systems are becoming ubiquitous in many settings and take ma...