A reinforcement learning-based method for generating adversarial examples against PE malware

Computer Engineering & Science ›› 2026, Vol. 48 ›› Issue (4): 617-627.

• Computer Network and Znformation Security • Previous Articles Next Articles

A reinforcement learning-based method for generating adversarial examples against PE malware

ZHANG Chaoran,MA Yuqi,ZHANG Sanfeng,YANG Wang

(1.School of Cyber Science and Engineering,Southeast University,Nanjing 211189;
2.Key Laboratory of Computer Network and Information Integration (Southeast University),
Ministry of Education,Nanjing 211189,China)

Received:2024-02-27 Revised:2024-09-24 Online:2026-04-25 Published:2026-04-29
Supported by:

Abstract

Abstract: This paper proposes a reinforcement learning-based method for generating adversarial examples against PE malware. Firstly, it regards the generation of adversarial examples for PE malware as a sequence-to-sequence generation task, which models sequences on an offline reinforcement learning dataset and leverages the powerful sequence generation capability of Transformer by incrementally generating sequences through predicting actions at each step. Furthermore, an information transmission mechanism is introduced to facilitate cross-episode information transfer during the reinforcement learning process, enhancing data efficiency. Experimental results demonstrate that the evasion rate of PE malware adversarial examples generated using this method outperforms those in comparative experiments and exhibits transferability.

Key words: reinforcement learning, adversarial example, PE malware, malware detection

ZHANG Chaoran, MA Yuqi, ZHANG Sanfeng, YANG Wang. A reinforcement learning-based method for generating adversarial examples against PE malware[J]. Computer Engineering & Science, 2026, 48(4): 617-627.

[1]	ZHANG Xinjun, GUO Jifa. An adversarial examples defense method for image reconstruction based on SCViT [J]. Computer Engineering & Science, 2026, 48(3): 500-511.
[2]	XIONG Zhi1, 2, LIU Fang1, WANG Yixuan1. Android malware detection based on classifier-oriented feature weighting [J]. Computer Engineering & Science, 2025, 47(9): 1598-1608.
[3]	LIU Qiang, LI Mu-chun, WU Xiao-jie, WANG Yu-heng. S-JSMA: A fast JSMA adversarial example generation method with low disturbance redundancy [J]. Computer Engineering & Science, 2024, 46(8): 1395-1402.

A reinforcement learning-based method for generating adversarial examples against PE malware

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 3

Recommended Articles

Metrics

Comments