Ranking interfaces are everywhere in online platforms. There is thus an ...
We study off-policy evaluation (OPE) of contextual bandit policies for l...
Off-policy evaluation (OPE) aims to accurately evaluate the performance ...
Rankings have become the primary interface in two-sided online markets. ...
In display ad auctions of Real-Time Bid-ding (RTB), a typical Demand-Sid...
Off-policy evaluation (OPE) in contextual bandits has seen rapid adoptio...
In real-world recommender systems and search engines, optimizing ranking...
Off-policy evaluation (OPE) is the method that attempts to estimate the
...
In recommender systems (RecSys) and real-time bidding (RTB) for online
a...
Off-policy Evaluation (OPE), or offline evaluation in general, evaluates...
We study off-policy evaluation (OPE) from multiple logging policies, eac...
We build and publicize the Open Bandit Dataset and Pipeline to facilitat...
Conventional bidding strategies for online display ad auction heavily re...
How can we conduct efficient hyperparameter optimization for a completel...
To construct a well-performing recommender offline, eliminating selectio...
In display advertising, predicting the conversion rate, that is, the
pro...
What is the most effective way to select the best causal model among
pot...
Implicit feedback plays a critical role to construct recommender systems...