Netinfo Security ›› 2023, Vol. 23 ›› Issue (11): 1-8.doi: 10.3969/j.issn.1671-1122.2023.11.001

Previous Articles     Next Articles

Malicious Domain Detection Method Based on Multivariate Time-Series Features

YAO Yuan1,2,3, FAN Zhaoshan1,2, WANG Qing1,2, TAO Yuan4()   

  1. 1. Institute of Information Engineering, Chinese Academy of Sciences, Beijing 100085, China
    2. School of Cyber Security, University of Chinese Academy of Sciences, Beijing 100049, China
    3. Hubei Branch of The National Internet Emergency Center of China, Wuhan 430072, China
    4. The Third Research Institute of the Ministry of Public Security, Shanghai 200031, China
  • Received:2023-05-06 Online:2023-11-10 Published:2023-11-10

Abstract:

At present, malicious domains as the main attack vector are widely abused in a variety of network attack activities. To address the problems of complex design of detection features in malicious domain detection, the need for empirical knowledge assistance and the ease of targeted bypassing by attackers, the paper proposed a malicious domain detection method based on multivariate temporal features. The method uses a deep learning model based on fused long and short-term memory networks and full convolutional neural networks to automatically extract multivariate temporal embedding features from client requests and domain resolution traffic, respectively, and learn low-dimensional temporal representations of malicious domain behaviors. Compared with traditional time-statistical feature schemes or time-series local pattern discrimination schemes, this method can establish long-term domain activity patterns and distinguish the behavior sequences of malicious domains from normal domains, which has more powerful malicious domain detection capability. Meanwhile, the method supports the fusion of multivariate time-series embedding features and generic malicious domain detection features to characterize malicious behavior information in multiple dimensions, improving detection performance as well as model robustness and scalability.

Key words: malicious domain, long short-term memory, fully convolutional network, multivariate time-series feature, feature fusion

CLC Number: