Netinfo Security ›› 2026, Vol. 26 ›› Issue (3): 462-470.doi: 10.3969/j.issn.1671-1122.2026.03.012

Previous Articles     Next Articles

Hidden Link Headline Detection Method Based on Multi-Modal Features

YIN Jie1, LIU Jiayin1, HUANG Xiaoyu1, LAN Haoliang1(), XIE Wenwei2   

  1. 1. Department of Computer Information and Cyber Security, Jiangsu Police Institute, Nanjing 210031, China
    2. Trend Micro Incorporated, Nanjing 210012, China
  • Received:2025-08-08 Online:2026-03-10 Published:2026-03-30

Abstract:

As the growing phenomenon of web page tampering with implanted hidden links, and the popularity of automatic detection methods, hidden link headline implantation has become one of the important factors endangering network security. Currently, the detection rate of unimodal, natural language processing-based detection techniques gradually decreases as hidden link attackers adopt disguises such as morphological close characters, interference symbols, and emoticons. To address this problem, this paper proposed a multimodal detection method based on image features and text features. The proposed method first extracted the semantic features and image features of the headline text with BERT and ResNet respectively, and then based on the gate function and multi-headed attention methods, the features were deeply fused to achieve the classification of hidden link headlines. Experimental results on the evaluation dataset show that the recognition accuracy of the proposed method can reach 0.966, which is about 1 percentage points higher than that of the benchmark method. This indicates that the image features can effectively overcome the shortage that text features cannot cope with the problem of headline disguise.

Key words: hidden link headline detection, BERT, ResNet, multi-modal feature fusion

CLC Number: