Reinforcement Learning-based Wavefront Sensorless Adaptive Optics Approaches for Satellite-to-Ground Laser Communication

03/13/2023
by   Payam Parvizi, et al.
0

Optical satellite-to-ground communication (OSGC) has the potential to improve access to fast and affordable Internet in remote regions. Atmospheric turbulence, however, distorts the optical beam, eroding the data rate potential when coupling into single-mode fibers. Traditional adaptive optics (AO) systems use a wavefront sensor to improve fiber coupling. This leads to higher system size, cost and complexity, consumes a fraction of the incident beam and introduces latency, making OSGC for internet service impractical. We propose the use of reinforcement learning (RL) to reduce the latency, size and cost of the system by up to 30-40% by learning a control policy through interactions with a low-cost quadrant photodiode rather than a wavefront phase profiling camera. We develop and share an AO RL environment that provides a standardized platform to develop and evaluate RL based on the Strehl ratio, which is correlated to fiber-coupling performance. Our empirical analysis finds that Proximal Policy Optimization (PPO) outperforms Soft-Actor-Critic and Deep Deterministic Policy Gradient. PPO converges to within 86% of the maximum reward obtained by an idealized Shack-Hartmann sensor after training of 250 episodes, indicating the potential of RL to enable efficient wavefront sensorless OSGC.

READ FULL TEXT

page 4

page 6

page 7

research
03/27/2019

Generalized Off-Policy Actor-Critic

We propose a new objective, the counterfactual objective, unifying exist...
research
06/16/2020

Parameter-based Value Functions

Learning value functions off-policy is at the core of modern Reinforceme...
research
11/03/2021

A Self-adaptive LSAC-PID Approach based on Lyapunov Reward Shaping for Mobile Robots

To solve the coupling problem of control loops and the adaptive paramete...
research
09/26/2022

DEFT: Diverse Ensembles for Fast Transfer in Reinforcement Learning

Deep ensembles have been shown to extend the positive effect seen in typ...
research
04/19/2023

Adaptive beamforming for optical wireless communication via fiber modal control

High-speed optical wireless communication can address the exponential gr...
research
06/24/2019

Deceptive Reinforcement Learning Under Adversarial Manipulations on Cost Signals

This paper studies reinforcement learning (RL) under malicious falsifica...
research
12/02/2020

Pareto Deterministic Policy Gradients and Its Application in 5G Massive MIMO Networks

In this paper, we consider jointly optimizing cell load balance and netw...

Please sign up or login with your details

Forgot password? Click here to reset