Policy Optimization in Bayesian Network Hybrid Models of Biomanufacturing Processes

05/13/2021
by   Hua Zheng, et al.
0

Biopharmaceutical manufacturing is a rapidly growing industry with impact in virtually all branches of medicine. Biomanufacturing processes require close monitoring and control, in the presence of complex bioprocess dynamics with many interdependent factors, as well as extremely limited data due to the high cost and long duration of experiments. We develop a novel model-based reinforcement learning framework that can achieve human-level control in low-data environments. The model uses a probabilistic knowledge graph to capture causal interdependencies between factors in the underlying stochastic decision process, leveraging information from existing kinetic models from different unit operations while incorporating real-world experimental data. We then present a computationally efficient, provably convergent stochastic gradient method for policy optimization. Validation is conducted on a realistic application with a multi-dimensional, continuous state variable.

READ FULL TEXT
research
01/10/2022

Opportunities of Hybrid Model-based Reinforcement Learning for Cell Therapy Manufacturing Process Development and Control

Driven by the key challenges of cell therapy manufacturing, including hi...
research
01/11/2021

Reinforcement Learning under Model Risk for Biomanufacturing Fermentation Control

In the biopharmaceutical manufacturing, fermentation process plays a cri...
research
12/29/2020

A Deep Reinforcement Learning Based Multi-Criteria Decision Support System for Textile Manufacturing Process Optimization

Textile manufacturing is a typical traditional industry involving high c...
research
06/17/2020

Green Simulation Assisted Reinforcement Learning with Model Risk for Biomanufacturing Learning and Control

Biopharmaceutical manufacturing faces critical challenges, including com...
research
02/16/2021

Model-based Meta Reinforcement Learning using Graph Structured Surrogate Models

Reinforcement learning is a promising paradigm for solving sequential de...
research
12/19/2020

Uncertainty-Aware Policy Optimization: A Robust, Adaptive Trust Region Approach

In order for reinforcement learning techniques to be useful in real-worl...
research
05/24/2019

RL4health: Crowdsourcing Reinforcement Learning for Knee Replacement Pathway Optimization

Joint replacement is the most common inpatient surgical treatment in the...

Please sign up or login with your details

Forgot password? Click here to reset