信息网络安全 ›› 2026, Vol. 26 ›› Issue (4): 566-578.doi: 10.3969/j.issn.1671-1122.2026.04.005

• 学术研究 • 上一篇    下一篇

基于知识迁移和冻结的模型反演防御方法

易文哲1,2, 徐枭洋1,2, 石磊3, 庄泳1,2, 王鹃1,2()   

  1. 1 武汉大学国家网络安全学院武汉 430072
    2 空天信息安全与可信计算教育部重点实验室武汉 430072
    3 北京禹宏信安科技有限公司北京 100101
  • 收稿日期:2025-06-16 出版日期:2026-04-10 发布日期:2026-04-29
  • 通讯作者: 王鹃 E-mail:jwang@whu.edu.cn
  • 作者简介:易文哲(2001—),男,湖北,博士研究生,主要研究方向为人工智能隐私安全、可信人工智能|徐枭洋(1999—),男,河北,博士研究生,主要研究方向为人工智能安全、分布式学习安全|石磊(1980—),男,山东,高级工程师,硕士,CCF会员,主要研究方向为网络安全、数据安全|庄泳(1999—),女,湖北,博士研究生,主要研究方向为生成式人工智能安全、可信人工智能|王鹃(1976—),女,湖北,教授,博士,CCF会员,主要研究方向为网络安全、可信计算、系统安全、人工智能安全
  • 基金资助:
    智能电网国家科技重大专项(2030);智能电网国家科技重大专项(2024ZD0803000);湖北省重点研发计划(2023BAB165)

Model Inversion Defense Method Based on Knowledge Transfer and Freezing

YI Wenzhe1,2, XU Xiaoyang1,2, SHI Lei3, ZHUANG Yong1,2, WANG Juan1,2()   

  1. 1 School of Cyber Science and Engineering, Wuhan University, Wuhan 430072, China
    2 Key Laboratory of Aerospace Information Security and Trusted Computing of Ministry of Education, Wuhan 430072, China
    3 Beijing Yuhong Xin’an Technology Co., Ltd., Beijing 100101, China
  • Received:2025-06-16 Online:2026-04-10 Published:2026-04-29

摘要:

随着深度学习技术的快速发展和广泛应用,其所引发的隐私安全问题也日益受到关注。其中,模型反演攻击能够仅凭模型参数还原用户的人脸图像,对用户隐私构成严重威胁。尽管现有研究已提出多种防御方案,但仍存在模型性能与防御效果难以权衡、对新型攻击防御能力不足等问题。针对上述问题,文章提出一种基于知识迁移和冻结的模型反演防御方法。该方法通过冻结与分类相关的全连接层,有效防止隐私信息被提取,同时迁移紧邻全连接层的参数,以进一步增强防御能力。实验结果表明,与现有防御方法相比,该方法在多个模型和数据集上均展现出更优的防御性能与稳定性。

关键词: 模型反演攻击, 迁移学习, 隐私保护

Abstract:

With the rapid development and widespread application of deep learning technology, concerns about privacy and security issues have been growing. Model inversion attacks can reconstruct users facial images solely based on model parameters, posing a serious threat to user privacy. Although existing research has proposed various defense strategies, there are still challenges in balancing model performance and defense effectiveness, as well as in defending against emerging attacks. To address these issues, this paper proposed a model inversion defense method based on knowledge transfer and freezing. By freezing the fully connected layers most relevant to classification, the method effectively prevented the extraction of private information. Meanwhile, it transferred the parameters adjacent to the fully connected layers to further enhance defense performance. Experimental results demonstrate that, compared to existing defense methods, the proposed method achieves superior defense effectiveness and stability across multiple models and datasets.

Key words: model inversion attacks, transfer learning, privacy protection

中图分类号: