Netinfo Security ›› 2025, Vol. 25 ›› Issue (1): 63-77.doi: 10.3969/j.issn.1671-1122.2025.01.006

Previous Articles     Next Articles

Research on Federated Learning Adaptive Differential Privacy Method Based on Heterogeneous Data

XU Ruzhi, TONG Yumeng(), DAI Lipeng   

  1. School of Control and Computer Engineering, North China Electric Power University, Beijing 102206, China
  • Received:2024-09-28 Online:2025-01-10 Published:2025-02-14
  • Contact: TONG Yumeng E-mail:tongym02@163.com

Abstract:

In federated learning, the need for a large amount of parameter exchange may lead to security threats from untrusted participating devices. In order to protect training data and model parameters, effective privacy protection measures must be taken. Given the imbalanced nature of heterogeneous data, this paper proposed an adaptive differential privacy method to protect the security of federated learning based on heterogeneous data. Firstly, different initial privacy budgets were set for different clients, and Gaussian noise was added to the gradient parameters of the local model; Secondly, during the training process, the privacy budget of each client was dynamically adjusted based on the loss function value of each iteration to accelerate convergence speed; Then, set a trusted central node to randomly exchange the parameters of each layer of local models from different clients, and then uploaded the confused local model parameters to the central server for aggregation; Finally, the central server aggregated the obfuscation parameters uploaded by trusted central nodes, added appropriate noise to the global model based on a pre-set global privacy budget threshold, and performed privacy correction to achieve server level privacy protection. The experimental results show that under the same heterogeneous data conditions, compared to ordinary differential privacy methods, the adaptive differential privacy method proposed in this paper has faster convergence speed and better model performance.

Key words: federated learning, heterogeneous data, differential privacy, Gaussian noise

CLC Number: