A robust and personalized privacy-preserving approach for adaptive clustered federated distillation

Abstract Federated learning (FL) is a promising approach that addresses privacy, and scalability concerns in contrast to traditional centralized methods. Challenges such as personalization and data heterogeneity issues remain critical. Clustered federated learning (CFL) has been proposed as a promis...

Full description

Saved in:
Bibliographic Details
Main Authors: Mai Shawkat, Zainab H. Ali, Mofreh Salem, Ali El-desoky
Format: Article
Language:English
Published: Nature Portfolio 2025-04-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-96468-8
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Federated learning (FL) is a promising approach that addresses privacy, and scalability concerns in contrast to traditional centralized methods. Challenges such as personalization and data heterogeneity issues remain critical. Clustered federated learning (CFL) has been proposed as a promising approach to alleviate these issues by establishing specialized global models for sets of similar users. Although CFL enhances adaptability to highly statistically heterogeneous environments, it may suffer from real-time distribution changes due to limitations in fixed cluster configurations. This study presents the robust model of personalized federated distillation (RMPFD), a personalized and privacy-enhanced framework. The RMPFD framework employs an adaptive hierarchical clustering strategy to generate semi-global models by grouping clients with similar data distributions, allowing them to train independently. Meta-learning is used in each cluster to enhance the personalization of the local models and the classification accuracy of the non-independent and Identically distributed (non-IID) data distributions. Experimental evaluations conducted on CIFAR- 10, CIFAR- 100, Fashion-MNIST and Enron email datasets reveal that RMPFD reduces communication overhead by approximately 15% and 20%, compared to Federated Averaging (FedAvg) and other baseline techniques. Moreover, the RMPFD framework improves the convergence rates and classification accuracy, leading to an improvement of over 12% in performance compared to traditional FL methods.
ISSN:2045-2322