Netinfo Security ›› 2025, Vol. 25 ›› Issue (1): 159-172.doi: 10.3969/j.issn.1671-1122.2025.01.014

Previous Articles     Next Articles

A Dynamic Malware Detection Method Based on Ensemble Learning

LIU Qiang1,2, WANG Jian1, WANG Yanan1(), WANG Shan3   

  1. 1. School of Air Defense and Antimissile, Air Force Engineering University, Xi’an 710051, China
    2. Graduate School of Air Force Engineering University, Xi’an 710051, China
    3. 94789 Troop of PLA, Nanjing 210018, China
  • Received:2024-09-25 Online:2025-01-10 Published:2025-02-14
  • Contact: WANG Yanan E-mail:wyn1988814@163.com

Abstract:

In the current network environment, constantly upgrading variants of malicious code pose significant challenges to network security. Although existing artificial intelligence models have shown significant effectiveness in detecting malicious code, there are still two undeniable shortcomings. Firstly, their generalization ability is poor. Although they perform well on training data, their performance is not ideal in actual testing due to the phenomenon of concept drift. Secondly, their robustness is poor and they are susceptible to attacks from adversarial samples. To solve the above problems, this paper proposed a dynamic detection method for malicious code based on ensemble learning. According to the different features of API sequences, statistical feature analysis module, semantic feature analysis module, and structural feature analysis module were respectively constructed. Each module performed targeted malicious code detection, and finally integrated the analysis results of each module to obtain the final detection conclusion. The experimental results on the Speakeasy dataset show that compared with existing research methods, this method has significant advantages in various performance indicators and good robustness, which can effectively resist two adversarial attack methods against API sequences.

Key words: malware detection, n-gram algorithm, Transformer encoder, graph neural network, adversarial attack

CLC Number: