Moving target defense of routing randomization with deep reinforcement learning against eavesdropping attack
作者机构:Academy of People’s Armed PoliceBeijing100012China PLA SSF Information Engineering UniversityZhengzhou450001China Institute of Information EngineeringChinese Academy of SciencesBeijing100093China School of Cyber SecurityUniversity of Chinese Academy of SciencesBeijing101408China
出 版 物:《Digital Communications and Networks》 (数字通信与网络(英文版))
年 卷 期:2022年第8卷第3期
页 面:373-387页
核心收录:
学科分类:0810[工学-信息与通信工程] 08[工学] 081001[工学-通信与信息系统]
基 金:supported by the National Key Research and Development Program of China 国家自然科学基金
主 题:Routing randomization Moving target defense Deep reinforcement learning Deep deterministic policy gradient
摘 要:Eavesdropping attacks have become one of the most common attacks on networks because of their easy implementation. Eavesdropping attacks not only lead to transmission data leakage but also develop into other more harmful attacks. Routing randomization is a relevant research direction for moving target defense, which has been proven to be an effective method to resist eavesdropping attacks. To counter eavesdropping attacks, in this study, we analyzed the existing routing randomization methods and found that their security and usability need to be further improved. According to the characteristics of eavesdropping attacks, which are “latent and transferable, a routing randomization defense method based on deep reinforcement learning is proposed. The proposed method realizes routing randomization on packet-level granularity using programmable switches. To improve the security and quality of service of legitimate services in networks, we use the deep deterministic policy gradient to generate random routing schemes with support from powerful network state awareness. In-band network telemetry provides real-time, accurate, and comprehensive network state awareness for the proposed method. Various experiments show that compared with other typical routing randomization defense methods, the proposed method has obvious advantages in security and usability against eavesdropping attacks.