Efficient Online Learning for Cognitive Radar-Cellular Coexistence via Contextual Thompson Sampling

08/24/2020
by   Charles E. Thornton, et al.
0

This paper describes a sequential, or online, learning scheme for adaptive radar transmissions that facilitate spectrum sharing with a non-cooperative cellular network. First, the interference channel between the radar and a spatially distant cellular network is modeled. Then, a linear Contextual Bandit (CB) learning framework is applied to drive the radar's behavior. The fundamental trade-off between exploration and exploitation is balanced by a proposed Thompson Sampling (TS) algorithm, a pseudo-Bayesian approach which selects waveform parameters based on the posterior probability that a specific waveform is optimal, given discounted channel information as context. It is shown that the contextual TS approach converges more rapidly to behavior that minimizes mutual interference and maximizes spectrum utilization than comparable contextual bandit algorithms. Additionally, we show that the TS learning scheme results in a favorable SINR distribution compared to other online learning algorithms. Finally, the proposed TS algorithm is compared to a deep reinforcement learning model. We show that the TS algorithm maintains competitive performance with a more complex Deep Q-Network (DQN).

READ FULL TEXT
research
10/29/2020

Constrained Online Learning to Mitigate Distortion Effects in Pulse-Agile Cognitive Radar

Pulse-agile radar systems have demonstrated favorable performance in dyn...
research
01/30/2021

Multi-player Bandits for Distributed Cognitive Radar

With new applications for radar networks such as automotive control or i...
research
03/09/2021

Constrained Contextual Bandit Learning for Adaptive Radar Waveform Selection

A sequential decision process in which an adaptive radar system repeated...
research
12/01/2022

Online Learning-based Waveform Selection for Improved Vehicle Recognition in Automotive Radar

This paper describes important considerations and challenges associated ...
research
06/23/2020

Deep Reinforcement Learning Control for Radar Detection and Tracking in Congested Spectral Environments

In this paper, dynamic non-cooperative coexistence between a cognitive p...
research
12/01/2022

When is Cognitive Radar Beneficial?

When should an online reinforcement learning-based frequency agile cogni...
research
10/21/2021

Online Meta-Learning for Scene-Diverse Waveform-Agile Radar Target Tracking

A fundamental problem for waveform-agile radar systems is that the true ...

Please sign up or login with your details

Forgot password? Click here to reset