An intelligent decision-making method for anti-jamming communication based on deep reinforcement learning

Bailin Song; Hua Xu; Lei Jiang; Ning Rao

doi:10.1051/jnwpu/20213930641

All issues

Volume 39 / No 3 (June 2021)

JNWPU, 39 3 (2021) 641-649

Abstract

Open Access

Issue		JNWPU Volume 39, Number 3, June 2021


Page(s)		641 - 649
DOI		https://doi.org/10.1051/jnwpu/20213930641
Published online		09 August 2021

JNWPU 2021, 39(3): 641–649

An intelligent decision-making method for anti-jamming communication based on deep reinforcement learning

一种基于深度强化学习的通信抗干扰智能决策方法

Bailin Song, Hua Xu, Lei Jiang and Ning Rao

Received: 27 August 2020

Abstract

In order to solve the problem of intelligent anti-jamming decision-making in battlefield communication, this paper designs an intelligent decision-making method for communication anti-jamming based on deep reinforcement learning. Introducing experience replay and dynamic epsilon mechanism based on PHC under the framework of DQN algorithm, a dynamic epsilon-DQN intelligent decision-making method is proposed. The algorithm can better select the value of epsilon according to the state of the decision network and improve the convergence speed and decision success rate. During the decision-making process, the jamming signals of all communication frequencies are detected, and the results are input into the decision-making algorithm as jamming discriminant information, so that we can effectively avoid being jammed under the condition of no prior jamming information. The experimental results show that the proposed method adapts to various communication models, has a fast decision-making speed, and the average success rate of the convergent algorithm can reach more than 95%, which has a great advantage over the existing decision-making methods.

摘要

为解决战场通信智能抗干扰决策问题，设计了一种基于深度强化学习的通信抗干扰决策方法。该方法在DQN算法架构下引入经验回放和基于爬山策略（PHC）的动态ε机制，提出动态ε-DQN智能决策算法，该算法能够根据决策网络状态更优地选择ε值，提高收敛速度和决策成功率。在决策过程中，对所有通信频率是否存在干扰信号进行检测，将结果作为干扰判别信息输入决策算法，使算法可在无先验干扰信息条件下智能决策通信频率，在尽量保证通信不中断的前提下，有效躲避干扰。实验结果表明，所提方法适应多种通信模型，决策速度较快，算法收敛后的平均成功率可达95%以上，较已有方法具有较大优势。

Key words: anti-jamming communication / intelligent decision-making / deep reinforcement learning

关键字 : 通信抗干扰 / 智能决策 / 深度强化学习

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.