Improving Federated Learning Through Low-Entropy Client Sampling Based on Learned High-Level Features

Waqwoya Abebe, Pablo Munoz, Ali Jannesari

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Data heterogeneity impacts the performance of Federated Learning (FL) by introducing training noise. Although representative client sampling can help mitigate the issue, it remains challenging to implement without compromising data privacy. This work introduces a new method to address the problem by proposing an affordable blind (privacy preserving) clustering mechanism for conducting stratified client sampling. Inspired by the 'dialect quiz', we propose a 'response test' to cluster clients whose models have learned similar high-level features. This approach facilitates representative client sampling without the need for direct access to client data. We demonstrate empirically that our method yields client samples with low relative entropy with respect to the global data distribution, indicating increased representativeness. Convergence experiments reveal that applying our method significantly improves the convergence and accuracy of the global model compared to strong baselines like SCAFFOLD and FL-CIR. Additionally, the reduced number of training rounds required to achieve target accuracy leads to decreased communication overhead and computational expense, making our approach promising for practical FL implementations.

Original languageEnglish
Title of host publicationProceedings - 2024 IEEE 17th International Conference on Cloud Computing, CLOUD 2024
EditorsRong N. Chang, Carl K. Chang, Jingwei Yang, Nimanthi Atukorala, Zhi Jin, Michael Sheng, Jing Fan, Kenneth Fletcher, Qiang He, Tevfik Kosar, Santonu Sarkar, Sreekrishnan Venkateswaran, Shangguang Wang, Xuanzhe Liu, Seetharami Seelam, Chandra Narayanaswami, Ziliang Zong
PublisherIEEE Computer Society
Pages20-29
Number of pages10
ISBN (Electronic)9798350368536
DOIs
StatePublished - 2024
Event17th IEEE International Conference on Cloud Computing, CLOUD 2024 - Shenzhen, China
Duration: Jul 7 2024Jul 13 2024

Publication series

NameIEEE International Conference on Cloud Computing, CLOUD
ISSN (Print)2159-6182
ISSN (Electronic)2159-6190

Conference

Conference17th IEEE International Conference on Cloud Computing, CLOUD 2024
Country/TerritoryChina
CityShenzhen
Period07/7/2407/13/24

Keywords

  • Client Sampling
  • Commu-nication Efficiency
  • Federated Learning
  • Performance Gain

Fingerprint

Dive into the research topics of 'Improving Federated Learning Through Low-Entropy Client Sampling Based on Learned High-Level Features'. Together they form a unique fingerprint.

Cite this