Green Simulation Assisted Policy Gradient to Accelerate Stochastic Process Control

10/17/2021
by   Hua Zheng, et al.
0

This study is motivated by the critical challenges in the biopharmaceutical manufacturing, including high complexity, high uncertainty, and very limited process data. Each experiment run is often very expensive. To support the optimal and robust process control, we propose a general green simulation assisted policy gradient (GS-PG) framework for both online and offline learning settings. Basically, to address the key limitations of state-of-art reinforcement learning (RL), such as sample inefficiency and low reliability, we create a mixture likelihood ratio based policy gradient estimation that can leverage on the information from historical experiments conducted under different inputs, including process model coefficients and decision policy parameters. Then, to accelerate the learning of optimal and robust policy, we further propose a variance reduction based sample selection method that allows GS-PG to intelligently select and reuse most relevant historical trajectories. The selection rule automatically updates the samples to be reused during the learning of process mechanisms and the search for optimal policy. Our theoretical and empirical studies demonstrate that the proposed framework can perform better than the state-of-art policy gradient approach and accelerate the optimal robust process control for complex stochastic systems under high uncertainty.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/06/2022

Variance Reduction based Partial Trajectory Reuse to Accelerate Policy Gradient Optimization

We extend the idea underlying the success of green simulation assisted p...
research
06/17/2020

Green Simulation Assisted Reinforcement Learning with Model Risk for Biomanufacturing Learning and Control

Biopharmaceutical manufacturing faces critical challenges, including com...
research
08/25/2022

Variance Reduction based Experience Replay for Policy Optimization

For reinforcement learning on complex stochastic systems where many fact...
research
03/16/2022

Stochastic Simulation Uncertainty Analysis to Accelerate Flexible Biomanufacturing Process Development

Motivated by critical challenges and needs from biopharmaceuticals manuf...
research
01/10/2022

Opportunities of Hybrid Model-based Reinforcement Learning for Cell Therapy Manufacturing Process Development and Control

Driven by the key challenges of cell therapy manufacturing, including hi...
research
01/11/2021

Reinforcement Learning under Model Risk for Biomanufacturing Fermentation Control

In the biopharmaceutical manufacturing, fermentation process plays a cri...
research
11/10/2017

Reinforcement Learning of Speech Recognition System Based on Policy Gradient and Hypothesis Selection

Speech recognition systems have achieved high recognition performance fo...

Please sign up or login with your details

Forgot password? Click here to reset