信息网络安全 ›› 2017, Vol. 17 ›› Issue (8): 76-82.doi: 10.3969/j.issn.1671-1122.2017.08.011

• • 上一篇    下一篇

一种基于时间序列分解的数据窃密事件检测方法研究

安冉1, 朱小波2, 严寒冰2()   

  1. 1. 北京航空航天大学计算机学院,北京 100191
    2. 国家计算机网络应急技术处理协调中心,北京 100029
  • 收稿日期:2017-06-20 出版日期:2017-08-20 发布日期:2020-05-12
  • 作者简介:

    作者简介: 安冉(1993—),男,内蒙古,硕士研究生,主要研究方向为网络安全、机器学习;朱小波(1987—),男,江西,工程师,硕士,主要研究方向为网络安全;严寒冰(1975—),男,江西, 教授级高级工程师,博士,主要研究方向为网络安全监测、应急响应处理、图形图像分析等。

  • 基金资助:
    国家科技支撑计划[2015BAK21B01]

Research on a Method of Data Theft Detection Based on Time Series Decomposition

Ran AN1, Xiaobo ZHU2, Hanbing YAN2()   

  1. 1. School of Computer Science, Beihang University, Beijing 100191, China
    2. National Computer Network Emergency Response Technical Team/Coordination Center of China, Beijing 100029, China
  • Received:2017-06-20 Online:2017-08-20 Published:2020-05-12

摘要:

在网络安全领域中,数据窃密检测是重要的研究内容。文章提出一种应用在网络流量场景下的时间序列分解算法,将时间序列分解为季节性数据、趋势数据、残差数据3部分,采用滑动窗口内的中位数来更好地拟合趋势数据,并且针对离散单点进行了过滤。同时,将异常点所在时间范围作为算法的最后输出形式。文章提出利用信息熵工具有助于发现隐蔽性较高的数据窃密行为。文中将本文算法和Piecewise Median算法、STL算法进行对比,并在经信息熵处理后的时间序列上应用本文算法进行检测。实验表明,本文算法相对于Piecewise Median算法、STL算法有较大幅度的性能提升,数据窃密检测效果良好。

关键词: 大型服务器, 数据窃密, 时间序列分解, 滑动窗口

Abstract:

In the field of network security, data theft detection is an important part of research contents. This paper proposes a time series decomposition algorithm in network traffic scenarios which decomposes data into three parts of seasonal data, trend data and residual data. The algorithm uses median in sliding window to fit better with the trend data, filters discrete single points, and takes the time interval containing continuous outliers as the final output form of the algorithm. The paper proposes that the information entropy of payload length is helpful detecting the hidden data theft behaviors. In the part of experiment, the algorithm is compared with STL and Piecewise Median algorithm. The algorithm is used to detect the time series that are processed with information entropy tool. Experiments show that, compared with STL and Piecewise Median algorithm, this algorithm improves the performances greatly, data theft detection effect is well.

Key words: large-scale server, data theft, time series decomposition, sliding window

中图分类号: