Netinfo Security ›› 2024, Vol. 24 ›› Issue (8): 1196-1209.doi: 10.3969/j.issn.1671-1122.2024.08.006

Previous Articles     Next Articles

Hierarchical Clustering Federated Learning Framework for Personalized Privacy-Preserving

GUO Qian1, ZHAO Jin2, GUO Yi1()   

  1. 1. Department of Computer Science and Engineering, East China University of Science and Technology, Shanghai 200237, China
    2. School of Computer Science, Fudan University, Shanghai 200433, China
  • Received:2024-01-28 Online:2024-08-10 Published:2024-08-22

Abstract:

Federated learning (FL) is an emerging framework of privacy-preserving distributed machine learning that effectively deals with the privacy leakage problem by utilizing cryptographic primitives. However, how to prevent poisoning attacks in distributed situations has recently become a research hotspot FL concern. Currently, most existing works rely on an independently identical distribution situation and identify malicious gradients using plaintext, which cannot handle the data heterogeneity scenario challenges and imposes significant privacy leakage risks due to releasing unencrypted gradients. To address these challenges, this paper proposed a hierarchical clustering federated learning framework for personalized privacy-preserving. The framework exploited homomorphic encryption by employing the median coordinate as the benchmark. Subsequently, it employed a secure cosine similarity scheme to identify poisonous gradients, and it innovatively utilized clustering as part of the defense mechanism and developed a hierarchical aggregation that enhances the proposed mode’s robustness in IID and non-IID scenarios. Experimental results on the MNIST, CIFAR-10 and Fashion-MNIST datasets indicates that it has powerful privacy-preserving capabilities, and compared to existing defense strategies of FedAVG, PPeFL Media, Trimmed Mean and Clustering, the proposed method achieves an average improvement of 14.90%, 9.59%, 29.50%, 26.57% and 23.19% on accuracy, respectively.

Key words: federated learning, hierarchical aggregation, homomorphic encryption, privacy-preserving

CLC Number: