Decentralized exchanges (DEXs) are a cornerstone of decentralized financ...
Recently, DARPA launched the ShELL program, which aims to explore how
ex...
While numerous works have focused on devising efficient algorithms for
r...
Large-scale language models have shown the ability to adapt to a new tas...
We initiate the mathematical study of replicability as an algorithmic
pr...
An appropriate reward function is of paramount importance in specifying ...
Recently, the study of linear misspecified bandits has generated intrigu...
This work considers the sample complexity of obtaining an
ε-optimal poli...
In this paper, we address the stochastic contextual linear bandit proble...
In contrast to the advances in characterizing the sample complexity for
...
Contextual linear bandits is a rich and theoretically important model th...
We study lifelong reinforcement learning (RL) in a regret minimization
s...
We study distributed contextual linear bandits with stochastic contexts,...
The multi-armed bandit (MAB) problem is an active learning framework tha...
Recently there is a surge of interest in understanding the horizon-depen...
Despite a large amount of effort in dealing with heavy-tailed error in
m...
Multi-agent reinforcement learning (MARL) algorithms often suffer from a...
Recently, deep reinforcement learning (RL) has achieved remarkable empir...
Although model-based reinforcement learning (RL) approaches are consider...
For the problem of task-agnostic reinforcement learning (RL), an agent f...
We propose a model-free reinforcement learning algorithm inspired by the...
Designing provably efficient algorithms with general function approximat...
Safety in reinforcement learning has become increasingly important in re...
Graph neural networks (GNNs) are powerful tools for learning from graph ...
Policy optimization methods remain a powerful workhorse in empirical
Rei...
We study the statistical limits of Imitation Learning (IL) in episodic M...
Many real-world applications, such as those in medical domains,
recommen...
The empirical success of Multi-agent reinforcement learning is encouragi...
In this paper we consider multi-objective reinforcement learning where t...
Linear Quadratic Regulators (LQR) achieve enormous successful real-world...
Bandit learning problems find important applications ranging from medica...
It is believed that a model-based approach for reinforcement learning (R...
Imitation learning (IL) aims to mimic the behavior of an expert policy i...
Regularization for optimization is a crucial technique to avoid overfitt...
Model-based reinforcement learning (RL), which finds an optimal policy u...
Reward-free reinforcement learning (RL) is a framework which is suitable...
This paper presents the first non-asymptotic result showing that a model...
Preference-based Reinforcement Learning (PbRL) replaces reward values in...
This paper studies model-based reinforcement learning (RL) for regret
mi...
Value function approximation has demonstrated phenomenal empirical succe...
Learning to plan for long horizons is a central challenge in episodic
re...
We study how to use unsupervised learning for efficient exploration in
r...
Suppose we are given a large matrix A=(a_i,j) that cannot be stored in
m...
One of the key approaches to save samples when learning a policy for a
r...
A fundamental challenge in artificial intelligence is to build an agent ...
Modern deep learning methods provide an effective means to learn good
re...
We provide efficient algorithms for overconstrained linear regression
pr...
In this paper, we settle the sampling complexity of solving discounted
t...
This work considers the sample complexity of obtaining an ϵ-optimal
poli...
Consider a two-player zero-sum stochastic game where the transition func...