信息网络安全 ›› 2020, Vol. 20 ›› Issue (5): 72-82.doi: 10.3969/j.issn.1671-1122.2020.05.009

• 技术研究 • 上一篇    下一篇

复杂网络中基于QRD的主动防御决策方法研究

金辉1,2,*(), 张红旗1,2, 张传富1,2, 胡浩1,2   

  1. 1. 中国人民解放军战略支援部队信息工程大学,郑州 450004
    2. 河南省信息安全重点实验室,郑州 450004
  • 收稿日期:2020-01-15 出版日期:2020-05-10 发布日期:2020-06-05
  • 通讯作者: 金辉 E-mail:695969075@qq.com
  • 作者简介:金辉(1988—),男,宁夏,硕士研究生,主要研究方向为网络安全建模与仿真;|张红旗(1962—),男,河北,教授,博士,主要研究方向为网络安全、等级保护和信息安全管理等|张传富(1973—),男,山东,副教授,博士,主要研究方向为计算机建模与仿真技术等|胡浩(1989—),男,安徽,讲师,博士,主要研究方向为网络态势感知、网络行为分析和图像秘密共享等
  • 基金资助:
    国家自然科学基金(61902427)

Research on Active Defense Decision-making Method Based on QRD in Complex Network

JIN Hui1,2,*(), ZHANG Hongqi1,2, ZHANG Chuanfu1,2, HU Hao1,2   

  1. 1. PLA SSF Information Engineering University, Zhengzhou 450004, China
    2. Henan Province Key Laboratory of Information Security, Zhengzhou 450004, China
  • Received:2020-01-15 Online:2020-05-10 Published:2020-06-05
  • Contact: Hui JIN E-mail:695969075@qq.com

摘要:

针对未知网络攻防场景下,信息不公开导致最优防御策略难以准确选取的问题。通过对不完全信息下的网络攻防博弈进行分析,文章首先构建具有探索机制的攻防演化博弈模型;然后基于Boltzmann探索的Q-learning复制动态方程构建攻防决策动态演化方程;最后通过求解演化稳定均衡给出最优防御策略选取方法,并刻画攻防策略的演化轨迹。仿真实验结果表明,对于小规模局域网,在探索程度参数取10附近时,生成的最优防御策略具有较好的可解释性和稳定性,能够使得防御主体获取最大防御收益。

关键词: 网络攻防, 不完全信息, 演化博弈, Q-learning复制动态方程, 最优防御策略

Abstract:

Aiming at the problem that unknown information makes the optimal defense strategy difficult to select accurately in an unknown network attack and defense scenario. By analyzing the network attack and defense game with incomplete information, firstly,theattack and defense evolutionary game model with an exploration mechanism is constructed. Then, based on Q-learning replication dynamic equationswithexploration of Boltzmann,the dynamic evolution equations ofattack and defense decision are constructed. Finally, the optimal defense strategy selection method is given by solving the evolutionary stable equilibrium, and the evolutionary trajectory of attack and defense strategies are described.The simulation experiment results show that the generated optimal defense strategy has better interpretability and stabilityfor small-scale local area networks, when the exploration degree parameter is around 10, which can enable the defense subject to obtain the maximum defense benefit.

Key words: network attack and defense, incomplete information, evolutionary game, Q-learning replication dynamic equation, optimal defense strategy

中图分类号: