[PPT] - Asymptotic behavior of Multiscaled Gradient Dynamics. Applications PowerPoint Presentation

SLIDE 1

Sixi` emes journ´ ees Franco-Chiliennes d’Optimisation Universit´ e de Toulon 19-21 mai 2008

Asymptotic behavior of Multiscaled Gradient Dynamics. Applications to Coupled systems, Games and PDE’s.

Hedy ATTOUCH (Joint work with M.-O. Czarnecki)

SLIDE 2

SETTING

(MAG) ˙ z(t) + ∂Φ(z(t)) + β(t)∂Ψ(z(t)) ∋ 0. Φ + β(t)Ψ ↑ Φ + δC as t → +∞: (MAG)= “Multiscale Asymptotic Gradient” system. Claim: Under ad hoc conditions on β, Φ, Ψ, z(t) → z∞ ∈ argminCΦ as t → +∞. Motivation: Dynamic and Algorithmic approach to Optimization and Potential Games: min {f(x) + g(y) : Ax − By = 0} .

1

SLIDE 3

COUPLED GRADIENT SYSTEMS

(MAG) ˙ z(t) + ∂Φ(z(t)) + β(t)∂Ψ(z(t)) ∋ 0.

x(t) + ∂f(x(t)) + β(t)At(Ax(t) − By(t)) ∋ 0 ˙ y(t) + ∂g(y(t)) + β(t)Bt(By(t) − Ax(t)) ∋ 0 Claim: z(t) = (x(t), y(t)) → z∞ = (x∞, y∞) where (x∞, y∞) is a solution of min {f(x) + g(y) : Ax − By = 0}.

2

SLIDE 4

EXAMPLE 1: DECOMPOSITION OF DOMAINS IN PDE’s. Ω1 Ω2 Γ Dirichlet problem on Ω: h ∈ L2(Ω) given, find z : Ω → IR solution of −∆z = h on Ω z = 0 on ∂Ω Variational formulation: min

z1 ∈ X1, z2 ∈ X2, [z] = 0

min {f1(z1) + f2(z2) : z1 ∈ X1, z2 ∈ X2, A1(z1) − A2(z2) = 0} . Ai : H1(Ωi) → Z = L2(Γ) is the trace operator, i = 1, 2.

3

SLIDE 5

EXAMPLE 2: POTENTIAL GAMES, BEST RESPONSE DYNAMICS Static loss functions of players 1 and 2:    F : (ξ, η) ∈ X × Y → F(ξ, η) = f(ξ) + βΨ(ξ, η) G : (ξ, η) ∈ X × Y → G(ξ, η) = g(η) + µΨ(ξ, η). Best reply dynamic with cost to change, (players 1 and 2 play alternatively): zk = (xk, yk) − → (xk+1, yk) − → zk+1 = (xk+1, yk+1) k = 0, 1, ... xk+1 = argmin{f(ξ) + βkΨ(ξ, yk) + α

yk+1 = argmin{g(η) + βkΨ(xk+1, η) + ν

Corresponding continuous dynamical system (MAGS): z(t) = (x(t), y(t))

x(t) + ∂f(x(t)) + β(t)∇xΨ(x(t), y(t)) ∋ 0 ˙ y(t) + ∂g(y(t)) + β(t)∇yΨ(x(t), y(t)) ∋ 0 β(t) → +∞ as t → +∞ = increasing weight of the cooperative behaviour aspects.

4

SLIDE 6

2.1 β(t) → +∞. 2.2 ǫ(t) → 0. 2.3 Links with Passty theorem.

3.1 β(t) → +∞: the general case. 3.2 β(t) → +∞: the strongly monotone case. 3.3 β(t) → +∞: the finite dimensional case. 3.4 ǫ(t) → 0.

4.1 domain decomposition for PDE’s. 4.2 potential games and best response dynamics.

5

SLIDE 7

MULTISCALE FEATURES. SLOW-FAST DYNAMICS 1. (MAG) ˙ z(t) + ∂Φ(z(t)) + β(t)∂Ψ(z(t)) ∋ 0 is the combination of two dynamics:

˙ z(t) + ∂Φ(z(t)) ∋ 0.

˙ z(t) + β(t)∂Ψ(z(t)) ∋ 0. Change of time scaling in (2): take t = τ(s) and set z(τ(s)) = w(s). (2) ⇔

w(s) + ∂Ψ(w(s)) ∋ 0. Take ˙ τ(s)β(τ(s)) = 1, i.e., τ(s) β(ξ)dξ = s. Assume +∞ β(ξ)dξ = +∞, then (2) ⇔ ˙ w(s) + ∂Ψ(w(s)) ∋ 0.

6

SLIDE 8

MULTISCALE FEATURES. SLOW-FAST DYNAMICS 2. (MAG) ˙ z(t) + ∂Φ(z(t)) + β(t)∂Ψ(z(t)) ∋ 0. Change of time scaling: take t = τ(s) and set z(τ(s)) = w(s), ǫ(s) =

Equivalent system with ǫ(s) → 0 as s → +∞, +∞ ǫ(s)ds = +∞. ˙ w(s) + ǫ(s)∂Φ(w(s)) + ∂Ψ(w(s)) ∋ 0. From ˙ τ(s)β(τ(s)) = 1 , ˙ τ(s) =

+∞ ǫ(s)ds = lims→+∞τ(s) = +∞. Classical situation: Φ(w) = 1

Att.-Cominetti, Att.-Czarnecki, Cabot, Combettes-Hirstoaga, Peypouquet.

7

SLIDE 9

ERGODIC CONVERGENCE RESULTS: β(t) → +∞ (MAG) ˙ z(t) + A(z(t)) + β(t)∂Ψ(z(t)) ∋ 0.

Ψ∗= Fenchel conjugate of Ψ, σC= support function of C, NC= normal cone to C. Theorem 1 [A.-C.] Let us assume that,

A + NC is a maximal monotone operator and S := (A + NC)−1(0) = ∅ closed convex set.

∀p ∈ NC +∞ β(t)

Then,

t

β(t)Ψ(z(t))dt < +∞.

8

SLIDE 10

Interpretation of the condition (H1) ∀p ∈ NC +∞ β(t)

Ψ∗(z) = 1

(H1) ⇔ +∞

Theorem 1 ⇔ Baillon-Brezis ergodic convergence theorem for ˙ z(t) + A(z(t)) ∋ 0 with A maximal monotone operator.

9

SLIDE 11

ERGODIC CONVERGENCE RESULTS: ǫ(t) → 0 Equivalent system with ǫ(s) → 0 as s → +∞, +∞ ǫ(s)ds = +∞. ˙ w(s) + ǫ(s)A(w(s)) + ∂Ψ(w(s)) ∋ 0.

Theorem 2 [A.-C.] Let us assume that,

A + NC is a maximal monotone operator and S := (A + NC)−1(0) = ∅.

∀p ∈ NC +∞ [Ψ∗(ǫ(s)p) − σC(ǫ(s)p)] ds < +∞. Then, w − lims→+∞ 1 s

s w(τ)ǫ(τ)dτ = w∞ exists with w∞ ∈ S.

10

SLIDE 12

LINKS WITH PASSTY THEOREM

˙ w(s) + ǫ(s)M(w(s)) + ∂Ψ(w(s)) ∋ 0.

x(s) + ǫ(s)A(x(s)) + x(s) − y(s) ∋ 0 ˙ y(t) + ǫ(s)B(y(s)) + y(s) − x(s) ∋ 0 Discrete version:

yk+1 − yk + ǫ(sk)B(yk+1) + yk − xk+1 ∋ 0 yk+1 = (I + ǫkB)−1(I + ǫkA)−1yk Theorem [Passty, JMMA, 1979]: Suppose (ǫk)k∈II N ∈ l2(II N) \ l1(II N), then zn =

1 ǫk

n

11

SLIDE 13

FROM ERGODIC CONVERGENCE TO CONVERGENCE: β(t) → +∞ Take A = ∂Φ a subdifferential operator, and use energy estimates. (MAG) ˙ z(t) + ∂Φ(z(t)) + β(t)∂Ψ(z(t)) ∋ 0. Theorem 3 [A.-C.] Let us assume

β : IR+ → IR+ is a smooth (C1) increasing function and there exists some positive constant k > 0 such that for t large enough: ˙ β(t) ≤ kβ(t). Then,

exists with z∞ ∈ S.

12

SLIDE 14

STRONG CONVERGENCE RESULTS (MAG) ˙ z(t) + A(z(t)) + β(t)∂Ψ(z(t)) ∋ 0.

∃α > 0 such that Au − Av, u − v ≥ αu − v2 ∀u, v ∈ H.

Theorem 4 [A.-C.] Let us assume that A is a strongly monotone operator and

A + NC is a maximal monotone operator.

Then,

13

SLIDE 15

CONVERGENCE RESULTS: THE FINITE DIMENSIONAL CASE (MAG) ˙ z(t) + ∂Φ(z(t)) + β(t)∂Ψ(z(t)) ∋ 0. Equivalent system with ǫ(s) → 0 as s → +∞, +∞ ǫ(s)ds = +∞ : ˙ w(s) + ǫ(s)∂

From Baillon and Cominetti, J. Funct. Analysis (2001) dist(w(s), S) tends to 0 as s → +∞. Combining with the Fejer monotonicity property (valid under (H1) ) + Opial’s lemma ⇒:

14

SLIDE 16

(MAG) ˙ z(t) + ∂Φ(z(t)) + β(t)∂Ψ(z(t)) ∋ 0. Theorem 5 [A.-C.] Let us assume that

∀p ∈ NC +∞ β(t)

Then,

exists with z∞ ∈ S.

15

SLIDE 17

CONVERGENCE RESULTS: ǫ(t) → 0 Equivalent system with ǫ(s) → 0 as s → +∞, +∞ ǫ(s)ds = +∞ : ˙ w(s) + ǫ(s)∂Φ(w(s)) + ∂Ψ(w(s)) ∋ 0.

Theorem 6 [A.-C.] Let us assume that,

∂Φ + NC is a maximal monotone operator and S := (∂Φ + NC)−1(0) = ∅.

∀p ∈ NC +∞ [Ψ∗(ǫ(s)p) − σC(ǫ(s)p)] ds < +∞.

There exists some k > 0 such that for s large enough − ˙

Then, weak − limt→+∞ w(t) = w∞ exists with w∞ ∈ S. limt→+∞ Ψ(w(t)) = 0, limt→+∞ Φ(w(t)) = infCΦ.

16

SLIDE 18

DECOMPOSITION OF DOMAINS IN PDE’s. Ω1 Ω2 Γ Dirichlet problem on Ω: h ∈ L2(Ω) given, find u : Ω → IR solution of −∆u = h on Ω u = 0 on ∂Ω Variational formulation: min

v1 ∈ X1, v2 ∈ X2, [v] = 0

min {f1(v1) + f2(v2) : v1 ∈ X1, v2 ∈ X2, A1(v1) − A2(v2) = 0} . Ai : H1(Ωi) → Z = L2(Γ) is the trace operator, i = 1, 2.

17

SLIDE 19

Continuous dynamical system:                −∆∂u1

− ∆∂u2

Discrete version: Alternating Algorithm with Dirichlet-Neumann transmission conditions: (u1,k, u2,k) → (u1,k+1, u2,k) → (u1,k+1, u2,k+1) with βk → +∞.        −(1 + α)∆u1,k+1 = h1 − α∆u1,k on Ω1 (1 + α)

+ βku1,k+1 = βku2,k + α

u1,k+1 = 0 on ∂Ω1 ∩ ∂Ω          −(1 + α)∆u2,k+1 = h2 − α∆u2,k on Ω2 (1 + α)

+ βku2,k+1 = βku1,k+1 + α

u2,k+1 = 0 on ∂Ω2 ∩ ∂Ω

18

SLIDE 20

POTENTIAL GAMES AND BEST RESPONSE DYNAMICS Static loss functions of players 1 and 2:    F : (ξ, η) ∈ X × Y → F(ξ, η) = f(ξ) + βΨ(ξ, η) G : (ξ, η) ∈ X × Y → G(ξ, η) = g(η) + µΨ(ξ, η). Best reply dynamic with cost to change, (players 1 and 2 play alternatively): (xk, yk) − → (xk+1, yk) − → (xk+1, yk+1) k = 0, 1, ... xk+1 = argmin{f(ξ) + βkΨ(ξ, yk) + α

yk+1 = argmin{g(η) + βkΨ(xk+1, η) + ν

Corresponding continuous dynamical system (MAGS):

x(t) + ∂f(x(t)) + β(t)∇xΨ(x(t), y(t)) ∋ 0 ˙ y(t) + ∂g(y(t)) + β(t)∇yΨ(x(t), y(t)) ∋ 0 β(t) → +∞ as t → +∞ = increasing weight of the collective behaviour aspects.

19

SLIDE 21

PERSPECTIVES

β(t) ≤ kβ(t) optimal? Examples, counterexamples.

projection...)?

synchronization.

20

SLIDE 22

REFERENCES

equations, Differential Equations, 1994.

mation with the steepest descent method, J. Differential Equations, 1996.

with non-isolated equilibria, J. Differential Equations, 2002.

coupled convex minimization problems, Journal of Convex Analysis, 2008.

ezis, Une remarque sur le comportement asymptotique des semi-groupes non lin´ eaires, Houston J. Math, 1976.

equations, J. Funct. Analysis, 2001.

21

SLIDE 23

Journal of Functional Analysis, 1975.

minimization, ESAIM Control Calc. Var., 2004.

Nonlinear Analysis, TMA., 2008.

linear evolution equations J. Differential Equations, 1986.

esolution de probl` emes d’´ equilibre, de point fixe et d’inclusion monotone, th` ese Universit´ e Paris 6, 2006.

emes d’´ evolution et applications en optimisa- tion, th` ese Universit´ e Paris 6 et Universit´ e du Chili, 2007.

22

Sixi` emes journ´ ees Franco-Chiliennes d’Optimisation Universit´ e de Toulon 19-21 mai 2008

Asymptotic behavior of Multiscaled Gradient Dynamics. Applications to Coupled systems, Games and PDE’s.

Hedy ATTOUCH (Joint work with M.-O. Czarnecki)

SETTING

COUPLED GRADIENT SYSTEMS

(MAG) ˙ z(t) + ∂Φ(z(t)) + β(t)∂Ψ(z(t)) ∋ 0.

x(t) + ∂f(x(t)) + β(t)At(Ax(t) − By(t)) ∋ 0 ˙ y(t) + ∂g(y(t)) + β(t)Bt(By(t) − Ax(t)) ∋ 0 Claim: z(t) = (x(t), y(t)) → z∞ = (x∞, y∞) where (x∞, y∞) is a solution of min {f(x) + g(y) : Ax − By = 0}.

EXAMPLE 1: DECOMPOSITION OF DOMAINS IN PDE’s. Ω1 Ω2 Γ Dirichlet problem on Ω: h ∈ L2(Ω) given, find z : Ω → IR solution of −∆z = h on Ω z = 0 on ∂Ω Variational formulation: min

z1 ∈ X1, z2 ∈ X2, [z] = 0

min {f1(z1) + f2(z2) : z1 ∈ X1, z2 ∈ X2, A1(z1) − A2(z2) = 0} . Ai : H1(Ωi) → Z = L2(Γ) is the trace operator, i = 1, 2.

yk+1 = argmin{g(η) + βkΨ(xk+1, η) + ν

Corresponding continuous dynamical system (MAGS): z(t) = (x(t), y(t))

x(t) + ∂f(x(t)) + β(t)∇xΨ(x(t), y(t)) ∋ 0 ˙ y(t) + ∂g(y(t)) + β(t)∇yΨ(x(t), y(t)) ∋ 0 β(t) → +∞ as t → +∞ = increasing weight of the cooperative behaviour aspects.

CONTENTS

2.1 β(t) → +∞. 2.2 ǫ(t) → 0. 2.3 Links with Passty theorem.

3.1 β(t) → +∞: the general case. 3.2 β(t) → +∞: the strongly monotone case. 3.3 β(t) → +∞: the finite dimensional case. 3.4 ǫ(t) → 0.

4.1 domain decomposition for PDE’s. 4.2 potential games and best response dynamics.

MULTISCALE FEATURES. SLOW-FAST DYNAMICS 1. (MAG) ˙ z(t) + ∂Φ(z(t)) + β(t)∂Ψ(z(t)) ∋ 0 is the combination of two dynamics:

˙ z(t) + ∂Φ(z(t)) ∋ 0.

˙ z(t) + β(t)∂Ψ(z(t)) ∋ 0. Change of time scaling in (2): take t = τ(s) and set z(τ(s)) = w(s). (2) ⇔

w(s) + ∂Ψ(w(s)) ∋ 0. Take ˙ τ(s)β(τ(s)) = 1, i.e., τ(s) β(ξ)dξ = s. Assume +∞ β(ξ)dξ = +∞, then (2) ⇔ ˙ w(s) + ∂Ψ(w(s)) ∋ 0.

MULTISCALE FEATURES. SLOW-FAST DYNAMICS 2. (MAG) ˙ z(t) + ∂Φ(z(t)) + β(t)∂Ψ(z(t)) ∋ 0. Change of time scaling: take t = τ(s) and set z(τ(s)) = w(s), ǫ(s) =

Equivalent system with ǫ(s) → 0 as s → +∞, +∞ ǫ(s)ds = +∞. ˙ w(s) + ǫ(s)∂Φ(w(s)) + ∂Ψ(w(s)) ∋ 0. From ˙ τ(s)β(τ(s)) = 1 , ˙ τ(s) =

+∞ ǫ(s)ds = lims→+∞τ(s) = +∞. Classical situation: Φ(w) = 1

Att.-Cominetti, Att.-Czarnecki, Cabot, Combettes-Hirstoaga, Peypouquet.

ERGODIC CONVERGENCE RESULTS: β(t) → +∞ (MAG) ˙ z(t) + A(z(t)) + β(t)∂Ψ(z(t)) ∋ 0.

Ψ∗= Fenchel conjugate of Ψ, σC= support function of C, NC= normal cone to C. Theorem 1 [A.-C.] Let us assume that,

A + NC is a maximal monotone operator and S := (A + NC)−1(0) = ∅ closed convex set.

∀p ∈ NC +∞ β(t)

Then,

t

β(t)Ψ(z(t))dt < +∞.

Interpretation of the condition (H1) ∀p ∈ NC +∞ β(t)

Ψ∗(z) = 1

(H1) ⇔ +∞

Theorem 1 ⇔ Baillon-Brezis ergodic convergence theorem for ˙ z(t) + A(z(t)) ∋ 0 with A maximal monotone operator.

ERGODIC CONVERGENCE RESULTS: ǫ(t) → 0 Equivalent system with ǫ(s) → 0 as s → +∞, +∞ ǫ(s)ds = +∞. ˙ w(s) + ǫ(s)A(w(s)) + ∂Ψ(w(s)) ∋ 0.

Theorem 2 [A.-C.] Let us assume that,

A + NC is a maximal monotone operator and S := (A + NC)−1(0) = ∅.

∀p ∈ NC +∞ [Ψ∗(ǫ(s)p) − σC(ǫ(s)p)] ds < +∞. Then, w − lims→+∞ 1 s

s w(τ)ǫ(τ)dτ = w∞ exists with w∞ ∈ S.

LINKS WITH PASSTY THEOREM

˙ w(s) + ǫ(s)M(w(s)) + ∂Ψ(w(s)) ∋ 0.

x(s) + ǫ(s)A(x(s)) + x(s) − y(s) ∋ 0 ˙ y(t) + ǫ(s)B(y(s)) + y(s) − x(s) ∋ 0 Discrete version:

yk+1 − yk + ǫ(sk)B(yk+1) + yk − xk+1 ∋ 0 yk+1 = (I + ǫkB)−1(I + ǫkA)−1yk Theorem [Passty, JMMA, 1979]: Suppose (ǫk)k∈II N ∈ l2(II N) \ l1(II N), then zn =

n

FROM ERGODIC CONVERGENCE TO CONVERGENCE: β(t) → +∞ Take A = ∂Φ a subdifferential operator, and use energy estimates. (MAG) ˙ z(t) + ∂Φ(z(t)) + β(t)∂Ψ(z(t)) ∋ 0. Theorem 3 [A.-C.] Let us assume

β : IR+ → IR+ is a smooth (C1) increasing function and there exists some positive constant k > 0 such that for t large enough: ˙ β(t) ≤ kβ(t). Then,

exists with z∞ ∈ S.

STRONG CONVERGENCE RESULTS (MAG) ˙ z(t) + A(z(t)) + β(t)∂Ψ(z(t)) ∋ 0.

∃α > 0 such that Au − Av, u − v ≥ αu − v2 ∀u, v ∈ H.

Theorem 4 [A.-C.] Let us assume that A is a strongly monotone operator and

A + NC is a maximal monotone operator.

Then,

CONVERGENCE RESULTS: THE FINITE DIMENSIONAL CASE (MAG) ˙ z(t) + ∂Φ(z(t)) + β(t)∂Ψ(z(t)) ∋ 0. Equivalent system with ǫ(s) → 0 as s → +∞, +∞ ǫ(s)ds = +∞ : ˙ w(s) + ǫ(s)∂

From Baillon and Cominetti, J. Funct. Analysis (2001) dist(w(s), S) tends to 0 as s → +∞. Combining with the Fejer monotonicity property (valid under (H1) ) + Opial’s lemma ⇒: