A Study of Enhancing Federated Learning on Non-IID Data with Server Learning

Van Sy Mai; Richard J La; Tao Zhang

doi:10.1109/tai.2024.3430250

A Study of Enhancing Federated Learning on Non-IID Data with Server Learning

IEEE Trans Artif Intell. 2024 Jul:Early Access:10.1109/tai.2024.3430250. doi: 10.1109/tai.2024.3430250.

Authors

Van Sy Mai¹, Richard J La², Tao Zhang¹

Affiliations

¹ National Institute of Standards and Technology (NIST), Gaithersburg, MD 20899, USA.
² NIST, University of Maryland, College Park, MD 20742, USA.

Abstract

Federated Learning (FL) has emerged as a means of distributed learning using local data stored at clients with a coordinating server. Recent studies showed that FL can suffer from poor performance and slower convergence when training data at the clients are not independent and identically distributed (IID). Here, we consider auxiliary server learning as a complementary approach to improving the performance of FL on non-IID data. Our analysis and experiments show that this approach can achieve significant improvements in both model accuracy and convergence time even when the dataset utilized by the server is small and its distribution differs from that of the clients' aggregate data. Moreover, experimental results suggest that auxiliary server learning delivers benefits when employed together with other techniques proposed to mitigate the performance degradation of FL on non-IID data.

Keywords: Distribute Machine Learning; Federated Learning; Non-IID Data.

Grants and funding

9999-NIST/ImNIST/Intramural NIST DOC/United States