Network Gradient Descent Algorithm for Decentralized Federated Learning

Shuyuan Wua Guanghua School of Management, Peking University, Beijing, ChinaView further author information

Danyang Huangb Center for Applied Statistics, Renmin University of China, Beijing, China;c School of Statistics, Renmin University of China, Beijing, ChinaCorrespondence[email protected]
View further author information

Hansheng Wanga Guanghua School of Management, Peking University, Beijing, China

https://orcid.org/0000-0003-2386-0209 View further author information

Abstract

We study a fully decentralized federated learning algorithm, which is a novel gradient descent algorithm executed on a communication-based network. For convenience, we refer to it as a network gradient descent (NGD) method. In the NGD method, only statistics (e.g., parameter estimates) need to be communicated, minimizing the risk of privacy. Meanwhile, different clients communicate with each other directly according to a carefully designed network structure without a central master. This greatly enhances the reliability of the entire algorithm. Those nice properties inspire us to carefully study the NGD method both theoretically and numerically. Theoretically, we start with a classical linear regression model. We find that both the learning rate and the network structure play significant roles in determining the NGD estimator’s statistical efficiency. The resulting NGD estimator can be statistically as efficient as the global estimator, if the learning rate is sufficiently small and the network structure is weakly balanced, even if the data are distributed heterogeneously. Those interesting findings are then extended to general models and loss functions. Extensive numerical studies are presented to corroborate our theoretical findings. Classical deep learning models are also presented for illustration purpose.

Keywords:

Supplementary Materials

Supplementary_Material.pdf:This document provides the extensions of the proposed method, the proofs of the theoretical results in the main text, and some additional simulation results. Appendix A provides for technical lemmas which are useful to prove the results in the main text. Appendix B contains the detailed proofs of the main theorems and corollaries developed in the main text. Appendix C reports some extensions and discussions of the proposed method.

Code.zip:This file is the python code for the proposed method. Please see the “README.md” in the file for using the code.

Additional information

Funding

Danyang Huang’s research is partially supported National Natural Science Foundation of China (No. 12071477, 71873137); fund for building world-class universities (disciplines) of Renmin University of China; Public Computing Cloud, Renmin University of China. Hansheng Wang’s research is partially supported by National Natural Science Foundation of China (No. 11831008) and also partially supported by the Open Research Fund of Key Laboratory of Advanced Theory and Application in Statistics and Data Science (KLATASDS-MOE-ECNU-KLATASDS2101).

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Network Gradient Descent Algorithm for Decentralized Federated Learning

Information for

Open access

Opportunities

Help and information

Network Gradient Descent Algorithm for Decentralized Federated Learning

Abstract

Supplementary Materials

Additional information

Funding

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature