Tanguy Urvoy

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Olivier Pietquin
71 publications
Emilie Kaufmann
42 publications
Stefan Riezler
41 publications
Romain Laroche
29 publications
Artem Sokolov
19 publications
Lina M. Rojas Barahona
16 publications
Pratik Gajane
14 publications
Lina Rojas-Barahona
11 publications
Edouard Leurent
10 publications
Fabrice Lefèvre
5 publications
Johannes Heinecke
5 publications

research

∙ 02/22/2023

Few-Shot Structured Policy Learning for Multi-Domain and Multi-Task Dialogues

Reinforcement learning has been widely adopted to model dialogue manager...

0 Thibault Cordier, et al. ∙

research

∙ 10/11/2022

Graph Neural Network Policies and Imitation Learning for Multi-Domain Task-Oriented Dialogues

Task-oriented dialogue systems are designed to achieve specific goals wh...

0 Thibault Cordier, et al. ∙

research

∙ 12/01/2020

Denoising Pre-Training and Data Augmentation Strategies for Enhanced RDF Verbalization with Transformers

The task of verbalization of RDF triples has known a growth in popularit...

0 Sebastien Montella, et al. ∙

research

∙ 11/25/2020

Diluted Near-Optimal Expert Demonstrations for Guiding Dialogue Stochastic Policy Optimisation

A learning dialogue agent can infer its behaviour from interactions with...

0 Thibault Cordier, et al. ∙

research

∙ 03/03/2019

Scaling up budgeted reinforcement learning

Can we learn a control policy able to adapt its behaviour in real time s...

0 Nicolas Carrara, et al. ∙

research

∙ 08/16/2017

Corrupt Bandits for Preserving Local Privacy

We study a variant of the stochastic multi-armed bandit (MAB) problem in...

0 Pratik Gajane, et al. ∙

research

∙ 01/18/2016

Bandit Structured Prediction for Learning from Partial Feedback in Statistical Machine Translation

We present an approach to structured prediction from bandit feedback, ca...

0 Artem Sokolov, et al. ∙

Success!

An error occurred

Tanguy Urvoy

Featured Co-authors

Few-Shot Structured Policy Learning for Multi-Domain and Multi-Task Dialogues

Graph Neural Network Policies and Imitation Learning for Multi-Domain Task-Oriented Dialogues

Denoising Pre-Training and Data Augmentation Strategies for Enhanced RDF Verbalization with Transformers

Diluted Near-Optimal Expert Demonstrations for Guiding Dialogue Stochastic Policy Optimisation

Scaling up budgeted reinforcement learning

Corrupt Bandits for Preserving Local Privacy

Bandit Structured Prediction for Learning from Partial Feedback in Statistical Machine Translation

Sign in with Google

Consider DeepAI Pro