-
Notifications
You must be signed in to change notification settings - Fork 30
Description
Following @rabbanitw's comment, extending our federated averaging methods to support FedProx would greatly improve Disco's robustness to client heterogeneity (data and system).
In short, FedProx adds a regularizing term (called "proximal term") to the local objective function.
From the paper:
The proximal term is beneficial in two aspects:
(1) It addresses the issue of statistical heterogeneity by restricting the local updates to be closer to the initial (global) model without any need to manually set the number of local epochs.
(2) It allows for safely incorporating variable amounts of local work resulting from systems heterogeneity
The algorithm is summarized here.

Disco currently doesn't select a subset of users and relies either on all users or on the first ones that reply with local updates.