Autonomous security analysis and penetration testing model based on attack graph and deep Q-learning network

With the continuous development and widespread application of network technology, network security issues have become increasingly prominent.Penetration testing has emerged as an important method for assessing and enhancing network security.However, traditional manual penetration testing methods suf...

Full description

Saved in:
Bibliographic Details
Main Authors: Cheng FAN, Guoqing HU, Taojie DING, Zhanhua ZHANG
Format: Article
Language:English
Published: POSTS&TELECOM PRESS Co., LTD 2023-12-01
Series:网络与信息安全学报
Subjects:
Online Access:http://www.cjnis.com.cn/thesisDetails#10.11959/j.issn.2096-109x.2023091
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841529616531980288
author Cheng FAN
Guoqing HU
Taojie DING
Zhanhua ZHANG
author_facet Cheng FAN
Guoqing HU
Taojie DING
Zhanhua ZHANG
author_sort Cheng FAN
collection DOAJ
description With the continuous development and widespread application of network technology, network security issues have become increasingly prominent.Penetration testing has emerged as an important method for assessing and enhancing network security.However, traditional manual penetration testing methods suffer from inefficiency,human error, and tester skills, leading to high uncertainty and poor evaluation results.To address these challenges, an autonomous security analysis and penetration testing framework called ASAPT was proposed, based on attack graphs and deep Q-learning networks (DQN).The ASAPT framework was consisted of two main components:training data construction and model training.In the training data construction phase, attack graphs were utilized to model the threats in the target network by representing vulnerabilities and possible attacker attack paths as nodes and edges.By integrating the common vulnerability scoring system (CVSS) vulnerability database, a “state-action”transition matrix was constructed, which depicted the attacker’s behavior and transition probabilities in different states.This matrix comprehensively captured the attacker’s capabilities and network security status.To reduce computational complexity, a depth-first search (DFS) algorithm was innovatively applied to simplify the transition matrix, identifying and preserving all attack paths that lead to the final goal for subsequent model training.In the model training phase, a deep reinforcement learning algorithm based on DQN was employed to determine the optimal attack path during penetration testing.The algorithm interacted continuously with the environment, updating the Q-value function to progressively optimize the selection of attack paths.Simulation results demonstrate that ASAPT achieves an accuracy of 84% in identifying the optimal path and exhibits fast convergence speed.Compared to traditional Q-learning, ASAPT demonstrates superior adaptability in dealing with large-scale network environments, which could provide guidance for practical penetration testing.
format Article
id doaj-art-7557cc044753465cb3a370f9d38c8527
institution Kabale University
issn 2096-109X
language English
publishDate 2023-12-01
publisher POSTS&TELECOM PRESS Co., LTD
record_format Article
series 网络与信息安全学报
spelling doaj-art-7557cc044753465cb3a370f9d38c85272025-01-15T03:16:56ZengPOSTS&TELECOM PRESS Co., LTD网络与信息安全学报2096-109X2023-12-01916617559580903Autonomous security analysis and penetration testing model based on attack graph and deep Q-learning networkCheng FANGuoqing HUTaojie DINGZhanhua ZHANGWith the continuous development and widespread application of network technology, network security issues have become increasingly prominent.Penetration testing has emerged as an important method for assessing and enhancing network security.However, traditional manual penetration testing methods suffer from inefficiency,human error, and tester skills, leading to high uncertainty and poor evaluation results.To address these challenges, an autonomous security analysis and penetration testing framework called ASAPT was proposed, based on attack graphs and deep Q-learning networks (DQN).The ASAPT framework was consisted of two main components:training data construction and model training.In the training data construction phase, attack graphs were utilized to model the threats in the target network by representing vulnerabilities and possible attacker attack paths as nodes and edges.By integrating the common vulnerability scoring system (CVSS) vulnerability database, a “state-action”transition matrix was constructed, which depicted the attacker’s behavior and transition probabilities in different states.This matrix comprehensively captured the attacker’s capabilities and network security status.To reduce computational complexity, a depth-first search (DFS) algorithm was innovatively applied to simplify the transition matrix, identifying and preserving all attack paths that lead to the final goal for subsequent model training.In the model training phase, a deep reinforcement learning algorithm based on DQN was employed to determine the optimal attack path during penetration testing.The algorithm interacted continuously with the environment, updating the Q-value function to progressively optimize the selection of attack paths.Simulation results demonstrate that ASAPT achieves an accuracy of 84% in identifying the optimal path and exhibits fast convergence speed.Compared to traditional Q-learning, ASAPT demonstrates superior adaptability in dealing with large-scale network environments, which could provide guidance for practical penetration testing.http://www.cjnis.com.cn/thesisDetails#10.11959/j.issn.2096-109x.2023091autonomous penetration testingreinforcement learningattack graphdeep Q-learning network
spellingShingle Cheng FAN
Guoqing HU
Taojie DING
Zhanhua ZHANG
Autonomous security analysis and penetration testing model based on attack graph and deep Q-learning network
网络与信息安全学报
autonomous penetration testing
reinforcement learning
attack graph
deep Q-learning network
title Autonomous security analysis and penetration testing model based on attack graph and deep Q-learning network
title_full Autonomous security analysis and penetration testing model based on attack graph and deep Q-learning network
title_fullStr Autonomous security analysis and penetration testing model based on attack graph and deep Q-learning network
title_full_unstemmed Autonomous security analysis and penetration testing model based on attack graph and deep Q-learning network
title_short Autonomous security analysis and penetration testing model based on attack graph and deep Q-learning network
title_sort autonomous security analysis and penetration testing model based on attack graph and deep q learning network
topic autonomous penetration testing
reinforcement learning
attack graph
deep Q-learning network
url http://www.cjnis.com.cn/thesisDetails#10.11959/j.issn.2096-109x.2023091
work_keys_str_mv AT chengfan autonomoussecurityanalysisandpenetrationtestingmodelbasedonattackgraphanddeepqlearningnetwork
AT guoqinghu autonomoussecurityanalysisandpenetrationtestingmodelbasedonattackgraphanddeepqlearningnetwork
AT taojieding autonomoussecurityanalysisandpenetrationtestingmodelbasedonattackgraphanddeepqlearningnetwork
AT zhanhuazhang autonomoussecurityanalysisandpenetrationtestingmodelbasedonattackgraphanddeepqlearningnetwork