信息网络安全 ›› 2026, Vol. 26 ›› Issue (4): 605-614.doi: 10.3969/j.issn.1671-1122.2026.04.008

• 学术研究 • 上一篇    下一篇

面向派生定密的图神经网络文本匹配模型研究

于淼1,2, 郭松辉1(), 宋帅超1, 杨烨铭1   

  1. 1 网络空间部队信息工程大学密码工程学院郑州 450001
    2 95861 部队酒泉 735018
  • 收稿日期:2025-05-12 出版日期:2026-04-10 发布日期:2026-04-29
  • 通讯作者: 郭松辉 E-mail:songhui.guo@outlook.com
  • 作者简介:于淼(1987—),男,黑龙江,工程师,硕士,CCF会员,主要研究方向为自然语言处理|郭松辉(1979—),男,四川,研究员,博士,CCF会员,主要研究方向为人工智能安全和云计算安全|宋帅超(2000—),男,河南,博士研究生,CCF会员,主要研究方向为人工智能安全和生物特征安全|杨烨铭(1999—),男,河南,博士研究生,CCF会员,主要研究方向为人工智能安全和后量子安全
  • 基金资助:
    国家自然科学基金(62176265)

Research on Graph Neural Network Text Matching Model for Derivative Classification

YU Miao1,2, GUO Songhui1(), SONG Shuaichao1, YANG Yeming1   

  1. 1 School of Cryptography Engineering, Cyberspace Force Information Engineering University, Zhengzhou 450001, China
    2 95861 PLA Unit, Jiuquan 735018, China
  • Received:2025-05-12 Online:2026-04-10 Published:2026-04-29

摘要:

派生定密是根据文本语义相似程度判断密级的定密方式,一般被抽象为文本匹配任务。由于待定密文本普遍具有篇幅较长、密点特征稀疏、语义结构复杂等特点,传统文本匹配方法难以准确建模和捕获文本中包含涉密事项语义的密点特征,因此,文章提出一种面向派生定密的图神经网络文本匹配模型,将文本匹配转化为图匹配问题。首先,设计密点特征提取器,将文本建模为表示密点特征的匹配图,以解决待定密文本密点特征表示能力弱的问题。然后,设计分层化图神经网络,对编码后的匹配图进行多轮更新和聚合操作,以增强对待定密文本之间相似性特征的提取。最后,根据匹配图的边预测文本定密结果。实验结果表明,在模拟派生定密数据集上,该模型性能提升明显,准确率提升4.77%以上,F1值提升3.83%以上。

关键词: 派生定密, 图神经网络, 密点特征提取器, 长文本匹配, 匹配图

Abstract:

Derivative classification is a method that judge the degree of secrets according to the similarity of text semantics. It is generally abstracted as a text matching task. Due to the fact that texts to be classified have the characteristics of long length, sparse secret key-point features and complex semantics structure, the traditional text matching method is difficult to accurately model and capture the features of secret key-point that contains the semantics of confidential matters in the text. Therefore, a targeted graph neural network text matching model for derivative classification was proposed, which transformed text matching into a graph matching problem. Firstly, a secret key-point feature extractor was designed to model the text as a matching graph representing the features of secret key-point, so as to solve the problem of weak representation of secret key-point features of the text to be classified. Secondly, a hierarchized graph neutral network was designed to perform multiple rounds of updating and aggregation operations on the encoded matching graph, so as to enhance the extraction of similarity features between the texts to be classified. Finally, the classification result was predicted according to the edges of the matching graph. Experimental results indicate that the performance of the model in this paper is significantly improved on the dataset that simulating derivative classification. The accuracy of the classification is increased by more than 4.77% and the F1 value is increased by more than 3.83%.

Key words: derivative classification, graph neural network, secret key-point feature extractor, long text matching, matching graph

中图分类号: