信息网络安全 ›› 2026, Vol. 26 ›› Issue (5): 699-712.doi: 10.3969/j.issn.1671-1122.2026.05.003

• 学术研究 • 上一篇    下一篇

基于质量相异度的异常数据检测方法

张宏涛1,2, 张霄1(), 郭毅1,3, 张连成1,3, 李旭青1   

  1. 1 郑州大学网络空间安全学院, 郑州 450002
    2 郑州大学网络管理中心, 郑州 450001
    3 网络空间部队信息工程大学网络空间安全学院, 郑州 450001
  • 收稿日期:2025-12-02 出版日期:2026-05-10 发布日期:2026-06-03
  • 通讯作者: 张霄 xzhang@gs.zzu.edu.cn
  • 作者简介:张宏涛(1977—),男,陕西,高级实验师,博士,主要研究方向为网络与系统安全、数据安全和下一代互联网安全|张霄(2000—),男,河南,硕士研究生,主要研究方向为数据安全、异常数据检测|郭毅(1984—),男,福建,副教授,博士,主要研究方向为下一代互联网安全、域间路由安全技术和数据安全|张连成(1982—),男,河南,副教授,博士,主要研究方向为下一代互联网安全、IPv6网络安全和SND网络安全|李旭青(2000—),男,河南,硕士研究生,主要研究方向为数据安全、区块链
  • 基金资助:
    河南省自然科学基金(242300421415);国家电网有限公司科技项目(5700-202318300A-1-1-ZN)

Anomaly Data Detection Method Based on Quality Dissimilarity

ZHANG Hongtao1,2, ZHANG Xiao1(), GUO Yi1,3, ZHANG Liancheng1,3, LI Xuqing1   

  1. 1 School of Cyber Science and Engineering, Zhengzhou University, Zhengzhou 450002, China
    2 Network Management Center, Zhengzhou University, Zhengzhou 450001, China
    3 School of Cyberspace Security, Cyberspace Force Information Engineering University, Zhengzhou 450001, China
  • Received:2025-12-02 Online:2026-05-10 Published:2026-06-03

摘要:

各类数据管理系统规模和复杂性日益提升,其产生的数据呈现高维、非线性等特征,导致数据冗余、异常污染和质量下降等问题,进而威胁系统安全。传统异常数据检测方法存在精度低、适应性差等局限,难以有效应对此场景。文章提出一种基于质量相异度的异常数据检测方法。该方法利用粒子群算法来识别关键特征,并通过数据质量相异度评估数据间差异来建立检测模型,从而有效区分正常与异常数据模式。实验结果证明,该方法优于传统的统计和机器学习方法,尤其是在处理高维的非线性数据集方面。该方法从数据质量差异的视角刻画数据相异度,显著提升了复杂环境下异常数据识别的准确性与可靠性。

关键词: 异常数据检测, 数据安全, 特征选择, 粒子群优化, 质量相异度

Abstract:

The scale and complexity of various data management systems continue to escalate, with the data they generate exhibiting characteristics such as high dimensionality and non-linearity. This often leads to issues such as data redundancy, contamination by anomalies, and diminished quality, thereby posing a threat to system security. Traditional anomaly data detection methods struggle to address these challenges effectively, exhibiting limitations such as low accuracy and poor adaptability. This paper proposed an anomaly data detection approach based on quality dissimilarity. The method employed a particle swarm optimisation algorithm to identify key features and constructed a detection model by evaluating sample differences through data quality dissimilarity, thereby effectively distinguishing normal from anomalous data patterns. Experiments demonstrate that this method outperforms traditional statistical and machine learning approaches, particularly when handling high-dimensional, non-linear datasets. By characterising sample dissimilarity through data quality divergence, the method significantly enhances the accuracy and reliability of anomaly detection in complex environments.

Key words: anomaly data detection, data security, feature selection, particle swarm optimisation, quality dissimilarity

中图分类号: