Netinfo Security ›› 2025, Vol. 25 ›› Issue (10): 1579-1588.doi: 10.3969/j.issn.1671-1122.2025.10.009

Previous Articles     Next Articles

Multi-Feature Fusion for Malicious PDF Document Detection Based on CNN-BiLSTM-CBAM

WANG Youhe, SUN Yi()   

  1. School of Cryptography Engineering, Information Engineering University, Zhengzhou 450001, China
  • Received:2025-05-25 Online:2025-10-10 Published:2025-11-07
  • Contact: SUN Yi E-mail:11112072@bjtu.edu.cn

Abstract:

In order to solve the problems that the existing detection methods of malicious PDF documents ignore the semantic relationship between features and are often limited to a single type of feature analysis, this paper proposed a detection scheme, which applied the CNN-BiLSTM-CBAM model and multi-feature fusion to the detection of malicious PDF documents. This method not only integrated the conventional and structural information extracted from static analysis, but also combined the API sequence information captured by dynamic analysis to build a comprehensive multi-dimensional feature set. First, the model used convolutional neural network to extract local features of feature set. Secondly, BiLSTM was used to capture the dependency and context-semantic relationship between features, and convolution block attention module (CBAM) was used to assign different weights to different features to screen out the most distinguishable key features. Finally, Softmax classifier was used to calculate the detection results. The experimental results show that compared with the existing methods, the proposed model shows significant advantages in key performance indicators such as accuracy, recall and F1 score, and effectively improves the detection performance of malicious PDF documents.

Key words: malicious PDF document detection, multi-feature fusion, convolutional block attention module, BiLSTM

CLC Number: