信息网络安全 ›› 2026, Vol. 26 ›› Issue (5): 747-757.doi: 10.3969/j.issn.1671-1122.2026.05.007

• 学术研究 • 上一篇    下一篇

一种基于多特征融合的加密流量分类方法

苏兆品1,2,3, 方宏程1, 张国富1,3,4(), 王垚飞1,2,3   

  1. 1 合肥工业大学计算机与信息学院, 合肥 230601
    2 智能互联系统安徽省实验室, 合肥 230009
    3 工业安全与应急技术安徽省重点实验室, 合肥 230601
    4 音视频智能防识联合实验室, 合肥 230009
  • 收稿日期:2025-11-25 出版日期:2026-05-10 发布日期:2026-06-03
  • 通讯作者: 张国富 zgf@hfut.edu.cn
  • 作者简介:苏兆品(1983—),女,山东,副教授,博士,主要研究方向为复杂智能系统、多媒体安全|方宏程(2001—),男,安徽,硕士研究生,主要研究方向为加密流量分类|张国富(1979—),男,安徽,教授,博士,主要研究方向为语音安全|王垚飞(1996—),男,河北,副教授,博士,主要研究方向为多媒体内容安全
  • 基金资助:
    国家自然科学基金(62302146)

A Multi-Feature Fusion Based Encrypted Traffic Classification Method

SU Zhaopin1,2,3, FANG Hongcheng1, ZHANG Guofu1,3,4(), WANG Yaofei1,2,3   

  1. 1 School of Computer and Information Technology, Hefei University of Technology, Hefei 230601, China
    2 Intelligent Interconnected System Anhui Laboratory, Hefei 230009, China
    3 Anhui Province Key Laboratory of Industry Safety and Emergency Technology, Hefei 230601, China
    4 Joint Laboratory of Intelligent Prevention and Recognition of Audio and Video, Hefei 230009, China
  • Received:2025-11-25 Online:2026-05-10 Published:2026-06-03

摘要:

随着加密通信的广泛应用,流量分类面临新的挑战。传统方法在处理加密流量时效果不佳,现有方法要么依赖手工特征提取,要么无法充分捕捉数据包间的交互模式。为此,文章提出一种基于多特征融合的加密流量分类(MFF-ETC)方法。在预处理阶段,该方法将数据包的有效载荷生成的包级图像组合成会话图像,有效减轻信息混淆并保留数据包间的交互模式。在分类阶段,会话图像分别经过包视觉转换器(PVT)、时间流量卷积网络(T-TCN)和流量门控瓶颈卷积(T-GBConv)模块处理,提取全局特征、全尺度时间特征和空间特征。随后,通过动态加权机制融合3类特征,根据流量类型动态调整权重,实现更准确的分类。实验结果表明,MFF-ETC在ISCX-VPN-Service、ISCX-VPN-App和CSTNET-TLS1.3上的F1分数分别达到98.81%、98.93%和98.05%,验证了该方法的高分类精度与泛化能力。

关键词: 加密流量分类, 全局特征, 时间特征, 空间特征, 动态加权机制融合

Abstract:

With the widespread adoption of encrypted communications, traffic classification faces new challenges. Traditional methods perform poorly when dealing with encrypted traffic, as existing approaches either rely on manual feature extraction or fail to fully capture the interaction patterns between packets. To address this issue, this paper proposed a Multi-feature Fusion based Encrypted Traffic classification method (MFF-ETC). In the preprocessing stage, the method combined packet-level images generated from packet payloads into session images, effectively mitigating information confusion while preserving the interaction patterns among packets. In the classification stage, the session images were processed by three modules: the Packet Vision Transformer (PVT), the Temporal Traffic Convolutional Network (T-TCN), and the Traffic Gated Bottleneck Convolution (T-GBConv) module, which extracted global features, full-scale temporal features, and spatial features, respectively. Subsequently, a dynamic weighting mechanism fused these three types of features, adjusting their weights adaptively according to the traffic type to achieve more accurate classification. Experimental results demonstrate that MFF-ETC achieves F1-score of 98.81%, 98.93%, and 98.05% on the ISCX-VPN-Service, ISCX-VPN-App, and CSTNET-TLS1.3 datasets, respectively, validating the method’s high classification accuracy and generalization capability.

Key words: encrypted traffic classification, global features, temporal features, spatial features, dynamic weighted feature fusion

中图分类号: