Netinfo Security ›› 2021, Vol. 21 ›› Issue (6): 26-35.doi: 10.3969/j.issn.1671-1122.2021.06.004

Previous Articles     Next Articles

Automatic Intrusion Response Decision-making Method Based on Q-Learning

LIU Jing*(), ZHANG Yuchen, ZHANG Hongqi   

  1. Department of Cryptogram Engineering, Information Engineering University of PLA, Zhengzhou 450001, China
  • Received:2021-01-21 Online:2021-06-10 Published:2021-07-01
  • Contact: LIU Jing* E-mail:cybersecuritys@163.com

Abstract:

Aiming at the problem of poor adaptability of existing automatic intrusion response decision-making, this paper proposes an automatic intrusion response decision-making method based on Q-Learning (Q-AIRD). Q-AIRD formalizes the states and actions of network attack and defense based on the attack graph, and introduces the attack mode layer to identify attackers with different abilities, so as to make more targeted response actions. According to the characteristics of intrusion response, the Softmax algorithm is adopted and the security threshold θ, stable reward factor μ and penalty factor ν are introduced to select the response strategy. Based on the voting mechanism, the multi-response purpose evaluation of the strategy is realized to meet the needs of the multi-response purpose. On this basis, an automatic intrusion response decision algorithm based on Q-Learning is designed. The simulation results show that Q-AIRD has good adaptability and can realize timely and effective intrusion response decision-making.

Key words: reinforcement learning, automatic intrusion response, Softmax algorithm, multi-objective decision-making

CLC Number: