FedDSL: A Novel Client Selection Method to Handle Statistical Heterogeneity in Cross-Silo Federated Learning Using Flower Framework

Federated learning provides a mechanism for different silos to collaborate, and each silo gets aid without compromising privacy. This simulation study is based on healthcare datasets, so the silos are hospitals or healthcare organizations. The selection of hospitals for federated learning increases...

Full description

Saved in:
Bibliographic Details
Main Authors: Vineetha Pais, Santhosha Rao, Balachandra Muniyal
Format: Article
Language:English
Published: IEEE 2024-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10720779/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850199858483298304
author Vineetha Pais
Santhosha Rao
Balachandra Muniyal
author_facet Vineetha Pais
Santhosha Rao
Balachandra Muniyal
author_sort Vineetha Pais
collection DOAJ
description Federated learning provides a mechanism for different silos to collaborate, and each silo gets aid without compromising privacy. This simulation study is based on healthcare datasets, so the silos are hospitals or healthcare organizations. The selection of hospitals for federated learning increases the overall performance of the model. Cross-silo comes with many challenges, even though the number of participating clients is limited compared to cross-device federated learning. This study specifically addresses two of those aspects, heterogeneity of data and local performance. An approach called FedDSL based on ‘Datasize’, ‘Skewness’, and ‘Local Performance’ is introduced in this paper. Initially, synthetic data are generated considering the size of the data and skewness, which creates statistical heterogeneity in the cross-silo environment. Once this environment is created, a client selection strategy is applied that uses a weighted approach to select clients. A statistical analysis checks the data distributed among hospitals using skewness and normality tests. Experiments are conducted using the Flower Framework, and FedDSL is compared with random client selection. The model is applied with various aggregation algorithms, including FedAvg, FedProx, and FedAdam. The results show an increased model performance with the FedDSL approach compared to random client selection.
format Article
id doaj-art-e8af2c97ab6c405f926e5194da32dfb2
institution OA Journals
issn 2169-3536
language English
publishDate 2024-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-e8af2c97ab6c405f926e5194da32dfb22025-08-20T02:12:30ZengIEEEIEEE Access2169-35362024-01-011215964815965910.1109/ACCESS.2024.348238810720779FedDSL: A Novel Client Selection Method to Handle Statistical Heterogeneity in Cross-Silo Federated Learning Using Flower FrameworkVineetha Pais0https://orcid.org/0000-0003-3525-1523Santhosha Rao1https://orcid.org/0000-0001-9511-3048Balachandra Muniyal2https://orcid.org/0000-0002-4839-0082Department of Information and Communication Technology, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal, Karnataka, IndiaDepartment of Information and Communication Technology, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal, Karnataka, IndiaDepartment of Information and Communication Technology, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal, Karnataka, IndiaFederated learning provides a mechanism for different silos to collaborate, and each silo gets aid without compromising privacy. This simulation study is based on healthcare datasets, so the silos are hospitals or healthcare organizations. The selection of hospitals for federated learning increases the overall performance of the model. Cross-silo comes with many challenges, even though the number of participating clients is limited compared to cross-device federated learning. This study specifically addresses two of those aspects, heterogeneity of data and local performance. An approach called FedDSL based on ‘Datasize’, ‘Skewness’, and ‘Local Performance’ is introduced in this paper. Initially, synthetic data are generated considering the size of the data and skewness, which creates statistical heterogeneity in the cross-silo environment. Once this environment is created, a client selection strategy is applied that uses a weighted approach to select clients. A statistical analysis checks the data distributed among hospitals using skewness and normality tests. Experiments are conducted using the Flower Framework, and FedDSL is compared with random client selection. The model is applied with various aggregation algorithms, including FedAvg, FedProx, and FedAdam. The results show an increased model performance with the FedDSL approach compared to random client selection.https://ieeexplore.ieee.org/document/10720779/Federated learningflower frameworkhospitalsskewnesscross-silo
spellingShingle Vineetha Pais
Santhosha Rao
Balachandra Muniyal
FedDSL: A Novel Client Selection Method to Handle Statistical Heterogeneity in Cross-Silo Federated Learning Using Flower Framework
IEEE Access
Federated learning
flower framework
hospitals
skewness
cross-silo
title FedDSL: A Novel Client Selection Method to Handle Statistical Heterogeneity in Cross-Silo Federated Learning Using Flower Framework
title_full FedDSL: A Novel Client Selection Method to Handle Statistical Heterogeneity in Cross-Silo Federated Learning Using Flower Framework
title_fullStr FedDSL: A Novel Client Selection Method to Handle Statistical Heterogeneity in Cross-Silo Federated Learning Using Flower Framework
title_full_unstemmed FedDSL: A Novel Client Selection Method to Handle Statistical Heterogeneity in Cross-Silo Federated Learning Using Flower Framework
title_short FedDSL: A Novel Client Selection Method to Handle Statistical Heterogeneity in Cross-Silo Federated Learning Using Flower Framework
title_sort feddsl a novel client selection method to handle statistical heterogeneity in cross silo federated learning using flower framework
topic Federated learning
flower framework
hospitals
skewness
cross-silo
url https://ieeexplore.ieee.org/document/10720779/
work_keys_str_mv AT vineethapais feddslanovelclientselectionmethodtohandlestatisticalheterogeneityincrosssilofederatedlearningusingflowerframework
AT santhosharao feddslanovelclientselectionmethodtohandlestatisticalheterogeneityincrosssilofederatedlearningusingflowerframework
AT balachandramuniyal feddslanovelclientselectionmethodtohandlestatisticalheterogeneityincrosssilofederatedlearningusingflowerframework