Distribution-free Contextual Dynamic Pricing

by   Yiyun Luo, et al.

Contextual dynamic pricing aims to set personalized prices based on sequential interactions with customers. At each time period, a customer who is interested in purchasing a product comes to the platform. The customer's valuation for the product is a linear function of contexts, including product and customer features, plus some random market noise. The seller does not observe the customer's true valuation, but instead needs to learn the valuation by leveraging contextual information and historical binary purchase feedbacks. Existing models typically assume full or partial knowledge of the random noise distribution. In this paper, we consider contextual dynamic pricing with unknown random noise in the valuation model. Our distribution-free pricing policy learns both the contextual function and the market noise simultaneously. A key ingredient of our method is a novel perturbed linear bandit framework, where a modified linear upper confidence bound algorithm is proposed to balance the exploration of market noise and the exploitation of the current knowledge for better pricing. We establish the regret upper bound and a matching lower bound of our policy in the perturbed linear bandit framework and prove a sub-linear regret bound in the considered pricing problem. Finally, we demonstrate the superior performance of our policy on simulations and a real-life auto-loan dataset.


page 1

page 2

page 3

page 4


Semi-parametric dynamic contextual pricing

We consider a canonical revenue maximization problem where customers arr...

Policy Optimization Using Semiparametric Models for Dynamic Pricing

In this paper, we study the contextual dynamic pricing problem where the...

Dynamic Assortment Optimization with Changing Contextual Information

In this paper, we study the dynamic assortment optimization problem unde...

Evaluating Deep Vs. Wide Deep Learners As Contextual Bandits For Personalized Email Promo Recommendations

Personalization enables businesses to learn customer preferences from pa...

Towards Agnostic Feature-based Dynamic Pricing: Linear Policies vs Linear Valuation with Unknown Noise

In feature-based dynamic pricing, a seller sets appropriate prices for a...

Loss Functions for Discrete Contextual Pricing with Observational Data

We study a pricing setting where each customer is offered a contextualiz...

Adaptive Model Selection Framework: An Application to Airline Pricing

Multiple machine learning and prediction models are often used for the s...

Please sign up or login with your details

Forgot password? Click here to reset