Netinfo Security ›› 2026, Vol. 26 ›› Issue (1): 91-101.doi: 10.3969/j.issn.1671-1122.2026.01.008

Previous Articles     Next Articles

A Study on Autonomous Decision-Making for Network Defense Based on Hierarchical Reinforcement Learning

WANG Huanzhen1, XU Hongping2, LI Kuangdai1, LIU Yang1, YAO Linyuan1()   

  1. 1. Beijing Institute of Astronautical System Engineering, Beijing 100076, China
    2. China Academy of Launch Vehicle Technology, Beijing 100076, China
  • Received:2025-03-17 Online:2026-01-10 Published:2026-02-13

Abstract:

To address the issue that traditional network defense decision-making methods are unable to effectively cope with complex dynamic network environments and diverse network attacks, this paper proposed a network defense autonomous decision-making method based on hierarchical reinforcement learning, combined with a high-fidelity network attack and defense simulation environment. A Markov network attack and defense game model based on incomplete information was constructed to analyze the dynamic interaction process of the attacker and defender and to formally represent the optimal defense strategy. The complex defense decision-making task caused by the unknown type of attacker was decomposed through the collaborative work of the top-level control agent and the bottom-level execution agent. Simulation experiment results under different attack and defense scenarios show that this method can make flexible and efficient decision responses to two types of penetration attack patterns, maintain resilient defense, and generate interpretable action distributions. Comparative analysis with existing related work further confirms the superiority of the proposed method in defense effectiveness.

Key words: network defense decision, Markov game, hierarchical reinforcement learning, autonomous cyber operation

CLC Number: