Reinforcement Learning for Generative AI: A Survey

by   Yuanjiang Cao, et al.

Deep Generative AI has been a long-standing essential topic in the machine learning community, which can impact a number of application areas like text generation and computer vision. The major paradigm to train a generative model is maximum likelihood estimation, which pushes the learner to capture and approximate the target data distribution by decreasing the divergence between the model distribution and the target distribution. This formulation successfully establishes the objective of generative tasks, while it is incapable of satisfying all the requirements that a user might expect from a generative model. Reinforcement learning, serving as a competitive option to inject new training signals by creating new objectives that exploit novel signals, has demonstrated its power and flexibility to incorporate human inductive bias from multiple angles, such as adversarial learning, hand-designed rules and learned reward model to build a performant model. Thereby, reinforcement learning has become a trending research field and has stretched the limits of generative AI in both model design and application. It is reasonable to summarize and conclude advances in recent years with a comprehensive review. Although there are surveys in different application areas recently, this survey aims to shed light on a high-level review that spans a range of application areas. We provide a rigorous taxonomy in this area and make sufficient coverage on various models and applications. Notably, we also surveyed the fast-developing large language model area. We conclude this survey by showing the potential directions that might tackle the limit of current models and expand the frontiers for generative AI.


page 1

page 2

page 3

page 4


Learning from Very Few Samples: A Survey

Few sample learning (FSL) is significant and challenging in the field of...

A survey of Generative AI Applications

Generative AI has experienced remarkable growth in recent years, leading...

Reinforcement Learning for Generative AI: State of the Art, Opportunities and Open Research Challenges

Generative Artificial Intelligence (AI) is one of the most exciting deve...

A case for new neural network smoothness constraints

How sensitive should machine learning models be to input changes? We tac...

Active Divergence with Generative Deep Learning – A Survey and Taxonomy

Generative deep learning systems offer powerful tools for artefact gener...

Neurosymbolic AI and its Taxonomy: a survey

Neurosymbolic AI deals with models that combine symbolic processing, lik...

A survey on GANs for computer vision: Recent research, analysis and taxonomy

In the last few years, there have been several revolutions in the field ...

Please sign up or login with your details

Forgot password? Click here to reset