Multi-Agent Reinforcement Learning based Joint Cooperative Spectrum Sensing and Channel Access for Cognitive UAV Networks

03/15/2021
by   Weiheng Jiang, et al.
0

Designing clustered unmanned aerial vehicle (UAV) communication networks based on cognitive radio (CR) and reinforcement learning can significantly improve the intelligence level of clustered UAV communication networks and the robustness of the system in a time-varying environment. Among them, designing smarter systems for spectrum sensing and access is a key research issue in CR. Therefore, we focus on the dynamic cooperative spectrum sensing and channel access in clustered cognitive UAV (CUAV) communication networks. Due to the lack of prior statistical information on the primary user (PU) channel occupancy state, we propose to use multi-agent reinforcement learning (MARL) to model CUAV spectrum competition and cooperative decision-making problem in this dynamic scenario, and a return function based on the weighted compound of sensing-transmission cost and utility is introduced to characterize the real-time rewards of multi-agent game. On this basis, a time slot multi-round revisit exhaustive search algorithm based on virtual controller (VC-EXH), a Q-learning algorithm based on independent learner (IL-Q) and a deep Q-learning algorithm based on independent learner (IL-DQN) are respectively proposed. Further, the information exchange overhead, execution complexity and convergence of the three algorithms are briefly analyzed. Through the numerical simulation analysis, all three algorithms can converge quickly, significantly improve system performance and increase the utilization of idle spectrum resources.

READ FULL TEXT

page 1

page 12

research
06/17/2021

Cooperative Multi-Agent Reinforcement Learning Based Distributed Dynamic Spectrum Access in Cognitive Radio Networks

With the development of the 5G and Internet of Things, amounts of wirele...
research
11/13/2018

Distributed Cooperative Spectrum Sharing in UAV Networks Using Multi-Agent Reinforcement Learning

In this paper, we develop a distributed mechanism for spectrum sharing a...
research
11/30/2020

Low-Bandwidth Communication Emerges Naturally in Multi-Agent Learning Systems

In this work, we study emergent communication through the lens of cooper...
research
07/14/2021

Learning-based Spectrum Sensing and Access in Cognitive Radios via Approximate POMDPs

A novel LEarning-based Spectrum Sensing and Access (LESSA) framework is ...
research
10/03/2022

Cooperative Multi-Agent Deep Reinforcement Learning for Reliable and Energy-Efficient Mobile Access via Multi-UAV Control

This paper addresses a novel multi-agent deep reinforcement learning (MA...
research
12/26/2017

Who is Smarter? Intelligence Measure of Learning-based Cognitive Radios

Cognitive radio (CR) is considered as a key enabling technology for dyna...
research
11/17/2019

Subcarrier Assignment Schemes Based on Q-Learning in Wideband Cognitive Radio Networks

Subcarrier assignment is of crucial importance in wideband cognitive rad...

Please sign up or login with your details

Forgot password? Click here to reset