Agnostic federated learning
Mehryar Mohri1,2, Gary Sivek1, Ananda Theertha Suresh1
1Google Research, 2Courant Institute
June 11, 2019
1/8
Agnostic federated learning Mehryar Mohri 1 , 2 , Gary Sivek 1 , - - PowerPoint PPT Presentation
Agnostic federated learning Mehryar Mohri 1 , 2 , Gary Sivek 1 , Ananda Theertha Suresh 1 1 Google Research, 2 Courant Institute June 11, 2019 1/8 Federated learning scenario [McMahan et al., 17] centralized model client A client
1Google Research, 2Courant Institute
1/8
centralized model client A client Z client B ◮ Data from large number of clients (phones, sensors) ◮ Data remains distributed over clients ◮ Centralized model trained based on data
2/8
◮ Merge samples from all clients and minimize loss ◮ Domains: clusters of clients ◮ Clients belong to p domains: D1, D2, . . . , Dp
◮ ˆ
◮ ˆ
p
i=1 mi
◮ Minimize loss over uniform distribution
h∈H Lˆ U(h)
3/8
Source: Lorem ipsum dolor sit amet, consectetur adipiscing elit. Duis non erat sem Proprietary + Confidential
Inference distribution is not same as training distribution – E.g., training only when the phone is connected to wifj and is being charged
Training Inference
P 4
Source: Lorem ipsum dolor sit amet, consectetur adipiscing elit. Duis non erat sem Proprietary + Confidential
Inference distribution is not same as training distribution – E.g., training only when the phone is connected to wifj and is being charged
Training Inference
P 4
4/8
D2 D1 Dp Dλ
◮ Learn model that performs well over any mixture of domains ◮ Dλ = p k=1 λk · ˆ
◮ λ is unknown and belongs to Λ ⊆ ∆p ◮ Minimize the agnostic loss
h∈H max λ∈Λ LDλ(h) ◮ Fairness implications
5/8
◮ Rm(G, λ) : weighted Rademacher complexity ◮ s(λ m) : skewness parameter 1 + χ2(λ, m) ◮ Regularization based on generalization bound
6/8
T
t=1 wt and λA = 1 T
t=1 λt
◮ 1/
◮ Extensions to stochastic mirror descent ◮ Experimental validation of the above results
7/8
8/8