信息网络安全 ›› 2022, Vol. 22 ›› Issue (10): 8-14.doi: 10.3969/j.issn.1671-1122.2022.10.002

• 入选论文 • 上一篇    下一篇

基于MHA与SDAE的Tor网站指纹识别模型

蒋首志, 曹金璇(), 殷浩展, 芦天亮   

  1. 中国人民公安大学信息网络安全学院,北京 100038
  • 收稿日期:2022-07-20 出版日期:2022-10-10 发布日期:2022-11-15
  • 通讯作者: 曹金璇 E-mail:caojinxuan@163.com
  • 作者简介:蒋首志(1999—),男,江苏,硕士研究生,主要研究方向为网络信息安全、匿名网络|曹金璇(1966—),女,北京,教授,硕士,主要研究方向为数据挖掘与数据分析|殷浩展(1998—),男,山东,硕士研究生,主要研究方向为网络信息安全、匿名网络|芦天亮(1985—),男,河北,副教授,博士,主要研究方向为网络信息安全、恶意代码分析与检测
  • 基金资助:
    国家自然科学基金(61602489);中国人民公安大学基本科研业务费(2020JKF101)

Identifying Tor Website Fingerprinting Model Based on MHA and SDAE

JIANG Shouzhi, CAO Jinxuan(), YIN Haozhan, LU Tianliang   

  1. School of Information Network Security, People’s Public Security University of China, Beijing 100038, China
  • Received:2022-07-20 Online:2022-10-10 Published:2022-11-15
  • Contact: CAO Jinxuan E-mail:caojinxuan@163.com

摘要:

为解决Tor网站指纹识别技术在开放世界准确率低及概念漂移问题,文章提出一种基于MHA与SDAE的网站指纹识别模型—MHA-SDAE-GRU。首先将网站流量处理成序列格式;然后利用多头自注意力机制捕获输入数据的关键信息,并用堆叠降噪自编码器学习流量中的深层特征,增强模型的鲁棒性,通过GRU学习序列的前后关系;最后用Softmax函数输出结果。实验结果表明,MHA-SDAE-GRU模型在封闭世界的准确率高于CUMUL等算法,在开放世界的准确率和鲁棒性均优于CNN等算法,在概念漂移实验中对新数据的适应性优于CNN等算法。MHA-SDAE-GRU模型在网站指纹识别上具有优秀的表现。

关键词: 网站指纹, 多头注意力, 堆叠降噪自编码器, 循环神经网络

Abstract:

This paper aims at addressing the poor performance of identification technology in open world and the issue of concept drift by developing a new method to identify Tor website fingerprinting based on MHA and SDAE. First, this paper processed website traces into sequence form and extracts essential information of input data with muti-head attention, then the robustness was enhanced via learning deep features of traces with denoising autoencoder. The results were output by using Softmax after learning sequence relation with GRU. The results of experiments presents that accuracy of MHA-SDAE-GRU model in closed world is higher than CUMUL algorithm, accuracy and robustness in open world are better than other algorithms and adaptability to new data in concept drift experiments is better than the others. MHA-SDAE-GRU model plays an effective role in identifying tor website fingerprinting.

Key words: website fingerprinting, MHA, SDAE, recurrent neural network

中图分类号: