• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Zhang, Huan (Zhang, Huan.) | Zheng, Kangfeng (Zheng, Kangfeng.) | Wang, Xiujuan (Wang, Xiujuan.) | Luo, Shoushan (Luo, Shoushan.) | Wu, Bin (Wu, Bin.)

收录:

EI Scopus SCIE

摘要:

As a core component of the network, web applications have become one of the preferred targets for attackers because the static configuration of web applications simplifies the exploitation of vulnerabilities by attackers. Although the moving target defense (MTD) has been proposed to increase the attack difficulty for the attackers, there is no solo approach can cope with different attacks; in addition, it is impossible to implement all these approaches simultaneously due to the resource limitation. Thus, the selection of an optimal defense strategy based on MTD has become the focus of research. In general, the confrontation of two players in the security domain is viewed as a stochastic game, and the reward matrices are known to both players. However, in a real security confrontation, this scenario represents an incomplete information game. Each player can only observe the actions performed by the opponent, and the observed actions are not completely accurate. To accurately describe the attacker's reward function to reach the Nash equilibrium, this work simulated and updated the strategy selection distribution of the attacker by observing and investigating the strategy selection history of the attacker. Next, the possible rewards of the attacker in each confrontation via the observation matrix were corrected. On this basis, the Nash-Q learning algorithm with reward quantification was proposed to select the optimal strategy. Moreover, the performances of the Minimax-Q learning algorithm and Naive-Q learning algorithm were compared and analyzed in the MTD environment. Finally, the experimental results showed that the strategy selection algorithm can enable defenders to select a more reasonable defensive strategy and achieve the maximum possible reward.

关键词:

incomplete information game Moving target defense Nash-Q learning algorithm optimal strategy selection web service

作者机构:

  • [ 1 ] [Zhang, Huan]Beijing Univ Posts & Telecommun, Sch CyberSpace Secur, Beijing 100088, Peoples R China
  • [ 2 ] [Zheng, Kangfeng]Beijing Univ Posts & Telecommun, Sch CyberSpace Secur, Beijing 100088, Peoples R China
  • [ 3 ] [Luo, Shoushan]Beijing Univ Posts & Telecommun, Sch CyberSpace Secur, Beijing 100088, Peoples R China
  • [ 4 ] [Wu, Bin]Beijing Univ Posts & Telecommun, Sch CyberSpace Secur, Beijing 100088, Peoples R China
  • [ 5 ] [Wang, Xiujuan]Beijing Univ Technol, Coll Comp Sci, Beijing 100124, Peoples R China

通讯作者信息:

  • [Zheng, Kangfeng]Beijing Univ Posts & Telecommun, Sch CyberSpace Secur, Beijing 100088, Peoples R China

电子邮件地址:

查看成果更多字段

相关关键词:

相关文章:

来源 :

CMC-COMPUTERS MATERIALS & CONTINUA

ISSN: 1546-2218

年份: 2020

期: 2

卷: 62

页码: 763-786

3 . 1 0 0

JCR@2022

ESI学科: COMPUTER SCIENCE;

ESI高被引阀值:132

被引次数:

WoS核心集被引频次: 12

SCOPUS被引频次: 13

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 2

在线人数/总访问数:291/4429594
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司