Online Sinkhorn: Optimal Transport distances from sample streams
Arthur Mensch Joint work with Gabriel Peyr´ e
´ Ecole Normale Sup´ erieure D´ epartement de Math´ ematiques et Applications Paris, France
CIRM, 3/12/2020
Online Sinkhorn: Optimal Transport distances from sample streams - - PowerPoint PPT Presentation
Online Sinkhorn: Optimal Transport distances from sample streams Arthur Mensch Joint work with Gabriel Peyr e Ecole Normale Sup erieure D epartement de Math ematiques et Applications Paris, France CIRM, 3/12/2020 Optimal
CIRM, 3/12/2020
1 / 29
1 / 29
1 / 29
Réseau de neurone Échantillon
Figure: StyleGAN2
2 / 29
Réseau de neurone Échantillon
Figure: StyleGAN2
2 / 29
Réseau de neurone Échantillon
Figure: StyleGAN2
2 / 29
1
2
3 / 29
n
n
4 / 29
n
n
4 / 29
n
n
4 / 29
P ∈△n×m P 1=a,P ⊤1=b
Information Processing Systems. 2013. 5 / 29
P ∈△n×m P 1=a,P ⊤1=b
C(α, β) =
P ∈△n×m P 1=a,P ⊤1=b
Information Processing Systems. 2013. 5 / 29
6 / 29
π∈U(α,β)π, C + KL(π|α ⊗ β)
6 / 29
π∈U(α,β)π, C + KL(π|α ⊗ β)
i,j Pi,jδxi,yj
6 / 29
π∈U(α,β)π, C + KL(π|α ⊗ β)
f,g∈C(X)α, f+β, g−α⊗β, exp(f ⊕g−C)+1
Proceedings of Symposia in Pure Mathematics. Vol. 18.1. 1970, pp. 241–250.
3Richard Sinkhorn. “A relationship between arbitrary positive matrices and doubly stochastic matrices”. In: The
Annals of Mathematical Statistics 35 (1964), pp. 876–879. 7 / 29
π∈U(α,β)π, C + KL(π|α ⊗ β)
f,g∈C(X)α, f+β, g−α⊗β, exp(f ⊕g−C)+1
Proceedings of Symposia in Pure Mathematics. Vol. 18.1. 1970, pp. 241–250.
3Richard Sinkhorn. “A relationship between arbitrary positive matrices and doubly stochastic matrices”. In: The
Annals of Mathematical Statistics 35 (1964), pp. 876–879. 7 / 29
π∈U(α,β)π, C + KL(π|α ⊗ β)
f,g∈C(X)α, f+β, g−α⊗β, exp(f ⊕g−C)+1
Proceedings of Symposia in Pure Mathematics. Vol. 18.1. 1970, pp. 241–250.
3Richard Sinkhorn. “A relationship between arbitrary positive matrices and doubly stochastic matrices”. In: The
Annals of Mathematical Statistics 35 (1964), pp. 876–879. 7 / 29
π∈U(α,β)π, C + KL(π|α ⊗ β)
f,g∈C(X)α, f+β, g−α⊗β, exp(f ⊕g−C)+1
Proceedings of Symposia in Pure Mathematics. Vol. 18.1. 1970, pp. 241–250.
3Richard Sinkhorn. “A relationship between arbitrary positive matrices and doubly stochastic matrices”. In: The
Annals of Mathematical Statistics 35 (1964), pp. 876–879. 7 / 29
n
n
8 / 29
n
n
8 / 29
Sampling ˆ
Cost C = (C(xi, yj))i,j −
Sinkhorn W(ˆ
b
b
9 / 29
Sampling ˆ
Cost C = (C(xi, yj))i,j −
Sinkhorn W(ˆ
b
b
9 / 29
Sampling ˆ
Cost C = (C(xi, yj))i,j −
Sinkhorn W(ˆ
nt+1
nt+1
Repeated sampling (ˆ
Cost + transform (ft, gt)t
9 / 29
Sampling ˆ
Cost C = (C(xi, yj))i,j −
Sinkhorn W(ˆ
nt+1
nt+1
Repeated sampling (ˆ
Cost + transform (ft, gt)t
9 / 29
f,g∈C(X)α, f+β, g−α⊗β, exp(f ⊕g−C)+1
e, and F. Bach. “Stochastic Optimization for Large-scale Optimal Transport”. In: Advances in Neural Information Processing Systens. 2016, pp. 3432–3440. 10 / 29
f,g∈C(X)α, f+β, g−α⊗β, exp(f ⊕g−C)+1
e, and F. Bach. “Stochastic Optimization for Large-scale Optimal Transport”. In: Advances in Neural Information Processing Systens. 2016, pp. 3432–3440. 10 / 29
f,g∈C(X)α, f+β, g−α⊗β, exp(f ⊕g−C)+1
t
t
e, and F. Bach. “Stochastic Optimization for Large-scale Optimal Transport”. In: Advances in Neural Information Processing Systens. 2016, pp. 3432–3440. 10 / 29
11 / 29
√ t)
11 / 29
12 / 29
1
2
13 / 29
14 / 29
nt
nt
14 / 29
nt
nt
14 / 29
nt
nt
14 / 29
15 / 29
nt+1
nt+1
15 / 29
nt+1
nt+1
nt+1
nt+1
15 / 29
16 / 29
nt+1
5Persi Diaconis and David Freedman. “Iterated random functions”. In: SIAM Review 41.1 (1999), pp. 45–76.
17 / 29
nt+1
5Persi Diaconis and David Freedman. “Iterated random functions”. In: SIAM Review 41.1 (1999), pp. 45–76.
17 / 29
nt+1
5Persi Diaconis and David Freedman. “Iterated random functions”. In: SIAM Review 41.1 (1999), pp. 45–76.
17 / 29
nt+1
5Persi Diaconis and David Freedman. “Iterated random functions”. In: SIAM Review 41.1 (1999), pp. 45–76.
17 / 29
nt+1
5Persi Diaconis and David Freedman. “Iterated random functions”. In: SIAM Review 41.1 (1999), pp. 45–76.
17 / 29
18 / 29
18 / 29
βt(gt)) = 1
nt+1
βt(gt))
18 / 29
βt(gt)) = 1
nt+1
βt(gt))
19 / 29
βt(gt)) = 1
nt+1
βt(gt))
19 / 29
t) computations: cost + fixed point iterations
20 / 29
t) computations: cost + fixed point iterations
20 / 29
ηt b1/2
t
21 / 29
6Aad W. Van der Vaart. Asymptotic statistics. Cambridge University Press, 2000.
22 / 29
6Aad W. Van der Vaart. Asymptotic statistics. Cambridge University Press, 2000.
22 / 29
√ b) approximate solution
6Aad W. Van der Vaart. Asymptotic statistics. Cambridge University Press, 2000.
22 / 29
f,g∈C(X)α, f + β, g − α ⊗ β, exp(f ⊕ g − C) + 1
23 / 29
f,g∈C(X)α, f + β, g − α ⊗ β, exp(f ⊕ g − C) + 1
µ,ν∈M+(X) KL(α|µ) + KL(β|ν) + µ ⊗ ν, exp(−C) − 1
23 / 29
f,g∈C(X)α, f + β, g − α ⊗ β, exp(f ⊕ g − C) + 1
µ,ν∈M+(X) KL(α|µ) + KL(β|ν) + µ ⊗ ν, exp(−C) − 1
23 / 29
f,g∈C(X)α, f + β, g − α ⊗ β, exp(f ⊕ g − C) + 1
µ,ν∈M+(X) KL(α|µ) + KL(β|ν) + µ ⊗ ν, exp(−C) − 1
23 / 29
24 / 29
nt
nt
25 / 29
26 / 29
27 / 29
nt
nt
28 / 29
29 / 29
◮ Mensch, Arthur and Gabriel Peyr´
from sample streams”. In: arXiv preprint arXiv:2003.01415 (2020). ◮ Cuturi, M. “Sinkhorn Distances: Lightspeed Computation of Optimal Transport”. In: Advances in Neural Information Processing Systems. 2013. ◮ Diaconis, Persi and David Freedman. “Iterated random functions”. In: SIAM Review 41.1 (1999), pp. 45–76. ◮ Genevay, A., M. Cuturi, G. Peyr´ e, and F. Bach. “Stochastic Optimization for Large-scale Optimal Transport”. In: Advances in Neural Information Processing
◮ Rockafellar, R. T. “Monotone operators associated with saddle-functions and minimax problems”. In: Proceedings of Symposia in Pure Mathematics. Vol. 18.1. 1970,
◮ Sinkhorn, Richard. “A relationship between arbitrary positive matrices and doubly stochastic matrices”. In: The Annals of Mathematical Statistics 35 (1964),
◮ Van der Vaart, Aad W. Asymptotic statistics. Cambridge University Press, 2000.
1 / 1