Netinfo Security ›› 2024, Vol. 24 ›› Issue (8): 1173-1183.doi: 10.3969/j.issn.1671-1122.2024.08.004

Previous Articles     Next Articles

A Multi-Scale Feature Fusion Deepfake Detection Algorithm Based on Reconstruction Learning

XU Kaiwen1, ZHOU Yichao1, GU Wenquan2, CHEN Chen3, HU Xiyuan1()   

  1. 1. School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, China
    2. Luyi County Public Security Bureau Video Investigation Brigade, Zhoukou 477299, China
    3. Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
  • Received:2024-05-11 Online:2024-08-10 Published:2024-08-22

Abstract:

With the rapid development of deepfake technology, the detection of deepfake faces has become a research hotspot in the field of computer vision. Although existing detection methods based on noise, local texture, or frequency features can exhibit good detection performance to a certain extent or in specific scenarios, these methods lack in-depth exploration of fine-grained facial representation features, limiting their generalization ability. To address the above issues, this paper proposed a novel classification network model based on multi-scale feature fusion reconstruction MSFFR. This network explored fine-grained facial content and gradient representation features from the perspective of reconstruction learning and achieved deepfake face detection through multi-scale feature fusion. The model included three innovative modules, a dual-branch feature extraction module designed to reveal distribution differences between real and fake faces; a fine-grained content and gradient feature fusion module to explore the correlation between fine-grained content features and gradient features of faces; a bidirectional attention module based on reconstruction disparity, effectively guiding the model to classify the fused features. Extensive experiments conducted on large-scale benchmark datasets demonstrate that, compared with existing state-of-the-art techniques, the proposed method significantly improves detection performance, especially in terms of generalization ability.

Key words: deepfake detection, multi-scale feature fusion, reconstruction learning, deep generative model

CLC Number: