Netinfo Security ›› 2024, Vol. 24 ›› Issue (10): 1506-1514.doi: 10.3969/j.issn.1671-1122.2024.10.004

Previous Articles     Next Articles

Mining Traffic Detection Method Based on Global Feature Learning

WEI Jinxia1,2, HUANG Xizhang1,2, FU Yuhao1, LI Jing1, LONG Chun1,2()   

  1. 1. Computer Network Information Center, Chinese Academy of Sciences, Beijing 100083, China
    2. School of Computer Science and Technology, University of Chinese Academy of Sciences, Beijing 100049, China
  • Received:2024-06-22 Online:2024-10-10 Published:2024-09-27

Abstract:

Mining traffic detection is a variable-length data classification task. Existing detection schemes, such as keyword matching and N-gram feature signatures, which are based on local feature classification methods, fail to fully utilize the global features of traffic. By employing deep learning models to model mining traffic, global features within the mining traffic are extracted to enhance the accuracy of mining traffic detection. The traffic classification model proposed in the article first employed a Transformer encoder to extract global features of the traffic, followed by a sequence summarizer to process the encoded results, obtaining a fixed-length representation for classification. Due to the mining samples accounting for less than 3% in the dataset, using accuracy to measure the classification effect of the model leads to significant bias. Therefore, the article comprehensively considered the precision and recall of the model, and employed the F1 score to evaluate the classification performance. Utilizing sinusoidal positional encoding in the model’s encoder enables the model to achieve an F1 score of 99.84% on the test set, with a precision rate of 100%.

Key words: mining malware, traffic classification, deep learning, sequence processing

CLC Number: