Miao Lu | DeepAI

DeepAI

AI Chat AI Image Generator AI Video AI Music Voice Chat AI Photo Editor Math AI

Featured Co-authors

Bin Li
210 publications
Jian Yang
171 publications
Houqiang Li
144 publications
Zhaoran Wang
121 publications
Jie Wang
115 publications
Zhuoran Yang
110 publications
Si Liu
70 publications
Zhihua Zhang
53 publications
Chen Gao
51 publications
Wei Xiong
27 publications
Qi Zhou
20 publications

research

∙ 05/29/2023

One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration

In online reinforcement learning (online RL), balancing exploration and ...

0 Zhihan Liu, et al. ∙

research

∙ 12/27/2022

Robust Consensus Clustering and its Applications for Advertising Forecasting

Consensus clustering aggregates partitions in order to find a better fit...

0 Deguang Kong, et al. ∙

research

∙ 11/21/2022

Video Background Music Generation: Dataset, Method and Evaluation

Music is essential when editing videos, but selecting music manually is ...

0 Le Zhuo, et al. ∙

research

∙ 09/12/2022

Statistical Estimation of Confounded Linear MDPs: An Instrumental Variable Approach

In an Markov decision process (MDP), unobservable confounders may exist ...

0 Miao Lu, et al. ∙

research

∙ 05/26/2022

Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes

We study offline reinforcement learning (RL) in partially observable Mar...

6 Miao Lu, et al. ∙

research

∙ 03/26/2022

GEN-VLKT: Simplify Association and Enhance Interaction Understanding for HOI Detection

The task of Human-Object Interaction (HOI) detection could be divided in...

0 Yue Liao, et al. ∙

research

∙ 12/20/2021

Learning Robust Policy against Disturbance in Transition Dynamics via State-Conservative Policy Optimization

Deep reinforcement learning algorithms can perform poorly in real-world ...

4 Yufei Kuang, et al. ∙

research

∙ 08/11/2021

Mining the Benefits of Two-stage and One-stage HOI Detection

Two-stage methods have dominated Human-Object Interaction (HOI) detectio...

0 Aixi Zhang, et al. ∙