信息网络安全 ›› 2026, Vol. 26 ›› Issue (4): 642-653.doi: 10.3969/j.issn.1671-1122.2026.04.011

• 技术研究 • 上一篇    下一篇

基于多特征感知和注意力机制的深度伪造图像检测研究

袁小刚(), 裴桓, 安德智, 万建鑫   

  1. 甘肃政法大学网络空间安全学院兰州 730070
  • 收稿日期:2026-01-07 出版日期:2026-04-10 发布日期:2026-04-29
  • 通讯作者: 袁小刚 E-mail:yxg7349@gsupl.edu.cn
  • 作者简介:袁小刚(1980—),男,江苏,教授,博士,CCF会员,主要研究方向为网络安全、信息内容安全和密码技术|裴桓(2001—),女,甘肃,硕士研究生,主要研究方向为计算机视觉|安德智(1973—),男,浙江,教授,本科,CCF会员,主要研究方向为网络安全和信息内容安全|万建鑫(2001—),男,河南,硕士研究生,主要研究方向为网络安全
  • 基金资助:
    国家自然科学基金(62562005);甘肃省高校教师创新基金(2025A-124);兰州市科技计划(2023-1-53)

Research on Deepfake Image Detection Based on Multi-Feature Perception and Attention Mechanism

YUAN Xiaogang(), PEI Huan, AN Dezhi, WAN Jianxin   

  1. School of Cyberspace Security, Gansu University of Political Science and Law, Lanzhou 730070, China
  • Received:2026-01-07 Online:2026-04-10 Published:2026-04-29

摘要:

随着生成对抗网络(GAN)和扩散技术的不断进步,生成的图像在视觉质量上已经达到一个较高水平,与真实图像几乎难以分辨,这对个人隐私和社会安全均构成潜在威胁。为应对这一挑战,文章提出一种多特征融合的深度伪造图像检测模型,该模型结合全局、局部和颜色特征,以全面捕捉生成图像中的伪造痕迹,进而准确识别图像真伪。全局分支聚焦提取整个图像的全局空间信息,局部分支通过细粒度选择模块关注关键区域的局部特征,而颜色分支则增强了对不同颜色空间中伪造特征的适应性。将这些特征通过注意力机制进行融合,全面提升对深度伪造图像伪造痕迹的捕捉能力。通过在14个GAN和5个扩散模型数据集上的实验,验证了该方法对不同生成模型均具有较高的检测准确性和泛化能力,为深度伪造图像的检测提供了一种高效且可靠的解决方案。

关键词: 深度伪造图像检测, 生成对抗网络, 扩散模型, 颜色差异, 注意力机制

Abstract:

With the continuous advancement of GAN and diffusion technologies, the visual quality of generated images had reached an exceptionally high level, making them nearly indistinguishable from real images. This posed potential threats to personal privacy and social security. To address this challenge, a multi-feature fusion model for deepfake image detection was proposed, integrating global, local, and color features to comprehensively capture forgery traces in generated images and accurately identify their authenticity. The global branch focused on extracting the overall spatial information of the image, the local branch employed a fine-grained selection module to capture local features in key regions, and the color branch enhanced adaptability to forgery features across different color spaces. These features were fused through an attention mechanism, which significantly improved the capability of capturing forgery traces in deepfake images. Extensive experiments conducted on 14 GAN datasets and 5 diffusion model datasets demonstrate that the proposed method achieves high detection accuracy and strong generalization ability across different generative models, providing an efficient and reliable solution for deepfake image detection.

Key words: deepfake image detection, generative adversarial network, diffusion model, color disparities, attention mechanism

中图分类号: