Netinfo Security ›› 2026, Vol. 26 ›› Issue (4): 605-614.doi: 10.3969/j.issn.1671-1122.2026.04.008

Previous Articles     Next Articles

Research on Graph Neural Network Text Matching Model for Derivative Classification

YU Miao1,2, GUO Songhui1(), SONG Shuaichao1, YANG Yeming1   

  1. 1 School of Cryptography Engineering, Cyberspace Force Information Engineering University, Zhengzhou 450001, China
    2 95861 PLA Unit, Jiuquan 735018, China
  • Received:2025-05-12 Online:2026-04-10 Published:2026-04-29

Abstract:

Derivative classification is a method that judge the degree of secrets according to the similarity of text semantics. It is generally abstracted as a text matching task. Due to the fact that texts to be classified have the characteristics of long length, sparse secret key-point features and complex semantics structure, the traditional text matching method is difficult to accurately model and capture the features of secret key-point that contains the semantics of confidential matters in the text. Therefore, a targeted graph neural network text matching model for derivative classification was proposed, which transformed text matching into a graph matching problem. Firstly, a secret key-point feature extractor was designed to model the text as a matching graph representing the features of secret key-point, so as to solve the problem of weak representation of secret key-point features of the text to be classified. Secondly, a hierarchized graph neutral network was designed to perform multiple rounds of updating and aggregation operations on the encoded matching graph, so as to enhance the extraction of similarity features between the texts to be classified. Finally, the classification result was predicted according to the edges of the matching graph. Experimental results indicate that the performance of the model in this paper is significantly improved on the dataset that simulating derivative classification. The accuracy of the classification is increased by more than 4.77% and the F1 value is increased by more than 3.83%.

Key words: derivative classification, graph neural network, secret key-point feature extractor, long text matching, matching graph

CLC Number: