A Generic Mean Field Model for Optimization in Large-scale - - PowerPoint PPT Presentation

a generic mean field model for optimization in large
SMART_READER_LITE
LIVE PREVIEW

A Generic Mean Field Model for Optimization in Large-scale - - PowerPoint PPT Presentation

A Generic Mean Field Model for Optimization in Large-scale Stochastic Systems and Applications in Scheduling Nicolas Gast Bruno Gaujal Grenoble University Knoxville, May 13th-15th 2009 N. Gast (LIG) Mean Field Optimization Knoxville 2009 1


slide-1
SLIDE 1

A Generic Mean Field Model for Optimization in Large-scale Stochastic Systems and Applications in Scheduling

Nicolas Gast Bruno Gaujal

Grenoble University

Knoxville, May 13th-15th 2009

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 1 / 20

slide-2
SLIDE 2

Introduction

Mean field has been introduced by physicists to study systems of interacting objects. For example, the movement of particles in the air: First solution: the microscopic description The system is represented by the states of each particle. Many equations for each possible collision: impossible to solve exactly. Second solution (better!): macroscopic equations We are interested by the average behavior of the system: The system is described by its temperature. Deterministic equation. The transition from microscopic description to macroscopic equations is called the mean field approximation.

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 2 / 20

slide-3
SLIDE 3

Mean Field in Computer Science

More recently, Mean field has been used to analyze performance of communication systems. The objects are the users in the system. For example: Performance of TCP [Baccelli, McDonald, Reynier [02]] Reputation Systems [Le Boudec et al. [07]] 802.11 [Bordenave, McDonald, Prouti` ere [05]] . . . In many example, it can be shown that when the number of users grows, the average behavior of the system becomes deterministic.

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 3 / 20

slide-4
SLIDE 4

Mean Field in Computer Science

More recently, Mean field has been used to analyze performance of communication systems. The objects are the users in the system. For example: Performance of TCP [Baccelli, McDonald, Reynier [02]] Reputation Systems [Le Boudec et al. [07]] 802.11 [Bordenave, McDonald, Prouti` ere [05]] . . . In many example, it can be shown that when the number of users grows, the average behavior of the system becomes deterministic. Aim of this talk Show that mean field can also be used for optimization problem. Study a general framwork for wich we can prove the results.

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 3 / 20

slide-5
SLIDE 5

Example of mean field model

Example – Consider the following brokering problem: . . . S on/off sources tasks a1

t

ad

t

Broker . . . µ1 µ1 P1 processors E1 . . . µd µd Pd processors Ed . . . Stochastic system Objects are sources+Processors: There are S + P1 + · · · + Pd objects The state of an object is active or inactive (random) Evolution of state is markovian

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 4 / 20

slide-6
SLIDE 6

Example of mean field model

Example – Consider the following brokering problem: . . . S on/off sources tasks a1

t

ad

t

Broker . . . µ1 µ1 P1 processors E1 . . . µd µd Pd processors Ed . . . Stochastic system Objects are sources+Processors: There are S + P1 + · · · + Pd objects The state of an object is active or inactive (random) Evolution of state is markovian Mean field limit We scale S and Pi by N. We are interested in: (number of tasks sent)/N. (available processors in cluster i)/N.

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 4 / 20

slide-7
SLIDE 7

Main result

Stochastic system N

  • bjects.

Optimal system Compute optimal policy (hard) Remark: the purpose of this talk is not to solve the previous example but to study a general framework for optimization in stochastic systems.

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 5 / 20

slide-8
SLIDE 8

Main result

Stochastic system N

  • bjects.

Optimal system Mean Field Limit Compute optimal policy (hard) N → ∞

1

Compute the mean field limit (constructive definition).

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 5 / 20

slide-9
SLIDE 9

Main result

Stochastic system N

  • bjects.

Optimal system Mean Field Limit Optimal mean field Compute optimal policy (hard) N → ∞ Deterministic

  • ptimization

(Easier)

1

Compute the mean field limit (constructive definition).

2

Solve the deterministic problem.

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 5 / 20

slide-10
SLIDE 10

Main result

Stochastic system N

  • bjects.

Optimal system Mean Field Limit Optimal mean field Compute optimal policy (hard) N → ∞ Deterministic

  • ptimization

(Easier) We can exchange the limits Our results The optimal stochastic system also converges. More precisely, when N grows:

1

The optimal reward converges.

2

The optimal policy also converges.

3

The speed of convergence is O( √ N) (CLT theorem).

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 5 / 20

slide-11
SLIDE 11

1

Theoretical Results

2

A (simple) example

3

Conclusion

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 6 / 20

slide-12
SLIDE 12

An optimization problem

N objects evolving in a finite state space. Environment E(t) at time t (E(t) ∈ Rd)  

X1(0)

. . .

XN(0) E(0)

   

X1(1)

. . .

XN(1) E(1)

   

X1(T)

. . .

XN(T) E(T)

  random( 0) random( 1) random( T−1)

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 7 / 20

slide-13
SLIDE 13

An optimization problem

N objects evolving in a finite state space. Environment E(t) at time t (E(t) ∈ Rd)  

X1(0)

. . .

XN(0) E(0)

   

X1(1)

. . .

XN(1) E(1)

   

X1(T)

. . .

XN(T) E(T)

  random( 0) random( 1) random( T−1) Mean field assumption We define the Population mix M(t) – The ith component (M(t))i is the proportion of objects in state i at time t.

1

E(t + 1) only depends on the population mix M(t).

2

The evolution of an object depends on E(t) but is independent of the

  • ther objects.
  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 7 / 20

slide-14
SLIDE 14

An optimization problem

Controller, chooses an action a ∈ A a0 a1 aT−1  

X1(0)

. . .

XN(0) E(0)

   

X1(1)

. . .

XN(1) E(1)

   

X1(T)

. . .

XN(T) E(T)

  random(a0) random(a1) random(aT−1) The controller can change the dynamics of the system.

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 7 / 20

slide-15
SLIDE 15

An optimization problem

Controller, chooses an action a ∈ A a0 a1 aT−1  

X1(0)

. . .

XN(0) E(0)

   

X1(1)

. . .

XN(1) E(1)

   

X1(T)

. . .

XN(T) E(T)

  random(a0) random(a1) random(aT−1) Cost(X,E) Cost(X,E) + . . . + The controller can change the dynamics of the system. Goal Find a policy to maximize: finite-time expected cost or expected discounted cost Technical assumption: Mean field evolution Action set compact Continuous parameters

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 7 / 20

slide-16
SLIDE 16

Optimal cost convergence

V ∗N – optimal cost for the system of size N. v∗ – optimal cost for the deterministic limit. a∗

0a∗ 1a∗ 2 . . . – optimal actions for the deterministic limit.

Theorem (Convergence of the optimal cost) Under technical assumptions, for both discounted and finite-time cost: limN→∞ V ∗N = v∗ limN→∞ V ∗N = V N

a∗

0 a∗ 1 a∗ 2 ...

(a.s.) In particular, this shows that: Optimal cost converges Static policy a∗ is asymptotically optimal

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 8 / 20

slide-17
SLIDE 17

A central limit-like theorem

The convergence speed is in O(1/ √ N): Theorem (CLT for the evolution of objects) Under technical assumptions, if the actions taken by the controller are fixed, then there exists a Gaussian variable Gt s.t: √ N

  • MN

t − mt

Law − − → Gt The covariance of Gt can be effectively computed.

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 9 / 20

slide-18
SLIDE 18

A central limit-like theorem

The convergence speed is in O(1/ √ N): Theorem (CLT for the evolution of objects) Under technical assumptions, if the actions taken by the controller are fixed, then there exists a Gaussian variable Gt s.t: √ N

  • MN

t − mt

Law − − → Gt The covariance of Gt can be effectively computed. Theorem (CLT for cost) Under technical assumptions, when N goes to infinity: √ N

  • V ∗N

T

− V N

a∗

  • ≤st

β + γG0∞ √ N

  • V ∗N

T

− v∗

T

  • ≤st

β′ + γ′G0∞

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 9 / 20

slide-19
SLIDE 19

1

Theoretical Results

2

A (simple) example

3

Conclusion

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 10 / 20

slide-20
SLIDE 20

A simple resource allocation problem

Aim of the example Illustrate the framework by a concrete example When does N(= S + P1 + · · · + Pd) becomes large enough for the approximation to apply?

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 11 / 20

slide-21
SLIDE 21

A simple resource allocation problem

Aim of the example Illustrate the framework by a concrete example When does N(= S + P1 + · · · + Pd) becomes large enough for the approximation to apply? . . . S on/off sources tasks a1

t

ad

t

Broker . . . µ1 µ1 P1 processors E1 . . . µd µd Pd processors Ed . . . Stochastic arrivals Stochastic availability: failure,. . . Optimize the total completion time = T

t=0

d

i=1 Ei(t).

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 11 / 20

slide-22
SLIDE 22

Optimal policy: stochastic and limit case

The stochastic system is hard to solve

1

This problem is a multidimensional restless bandit problem

◮ Known to be hard ◮ Existence of heuristics (Index policies) 2

In practice in such systems [EGEE]

◮ Use of heuristics (JSQ)

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 12 / 20

slide-23
SLIDE 23

Optimal policy: stochastic and limit case

The stochastic system is hard to solve

1

This problem is a multidimensional restless bandit problem

◮ Known to be hard ◮ Existence of heuristics (Index policies) 2

In practice in such systems [EGEE]

◮ Use of heuristics (JSQ)

Using our framework: compute optimal mean field The problem becomes: Find an allocation to minimize the idle time of processors. All variable are in Rd. The optimal policy can be computed by a greedy algorithm.

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 12 / 20

slide-24
SLIDE 24

Computing the optimal policy π∗

Time t 1 2 3 4 5 6 Arrival of packets 9 1 1 7 6 6 Queue 1 I I Queue 2 Queue 3 I I Optimal allocation Grey = “off” processors. I : initial packets. P0: packets arrived at time 0.

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 13 / 20

slide-25
SLIDE 25

Computing the optimal policy π∗

Time t 1 2 3 4 5 6 Arrival of packets 9 1 1 7 6 6 Queue 1 I P0 P0 I P0 P0 P0 Queue 2 P0 Queue 3 I P0 I P0 P0 Optimal allocation 5 1 3 Grey = “off” processors. I : initial packets. P0: packets arrived at time 0.

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 13 / 20

slide-26
SLIDE 26

Computing the optimal policy π∗

Time t 1 2 3 4 5 6 Arrival of packets 9 1 1 7 6 6 Queue 1 I P0 P0 I P0 P0 P0 Queue 2 P0 Queue 3 I P0 P1 I P0 P0 Optimal allocation 5 . 1 . 3 1 Grey = “off” processors. I : initial packets. P0: packets arrived at time 0.

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 13 / 20

slide-27
SLIDE 27

Computing the optimal policy π∗

Time t 1 2 3 4 5 6 Arrival of packets 9 1 1 7 6 6 Queue 1 I P0 P0 P3 I P0 P0 P0 Queue 2 P0 Queue 3 I P0 P1 I P0 P0 Optimal allocation 5 . . 1 1 . . . 3 1 . . Grey = “off” processors. I : initial packets. P0: packets arrived at time 0.

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 13 / 20

slide-28
SLIDE 28

Computing the optimal policy π∗

Time t 1 2 3 4 5 6 Arrival of packets 9 1 1 7 6 6 Queue 1 I P0 P0 P3 P4 P4 I P0 P0 P4 P0 P4 P4 Queue 2 P0 Queue 3 I P0 P1 P4 I P0 P4 P0 Optimal allocation 5 . . 1 5 1 . . . . 3 1 . . 2 Grey = “off” processors. I : initial packets. P0: packets arrived at time 0.

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 13 / 20

slide-29
SLIDE 29

Computing the optimal policy π∗

Time t 1 2 3 4 5 6 Arrival of packets 9 1 1 7 6 6 Queue 1 I P0 P0 P3 P4 P4 I P0 P0 P4 P5 P0 P4 P4 Queue 2 P0 P5 P5 Queue 3 I P0 P1 P4 P5 I P0 P4 P5 P0 P5 Optimal allocation 5 . . 1 5 1 1 . . . . 2 3 1 . . 2 3 Grey = “off” processors. I : initial packets. P0: packets arrived at time 0.

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 13 / 20

slide-30
SLIDE 30

Computing the optimal policy π∗

Time t 1 2 3 4 5 6 Arrival of packets 9 1 1 7 6 6 Queue 1 I P0 P0 P3 P4 P4 P6 I P0 P0 P4 P5 P0 P4 P4 Queue 2 P0 P5 P6 P5 Queue 3 I P0 P1 P4 P5 P6 I P0 P4 P5 P6 P0 P5 Optimal allocation 5 . . 1 5 1 1 1 . . . . 2 1 3 1 . . 2 3 2 Grey = “off” processors. I : initial packets. P0: packets arrived at time 0.

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 13 / 20

slide-31
SLIDE 31

Computing the optimal policy π∗

Time t 1 2 3 4 5 6 Arrival of packets 9 1 1 7 6 6 Queue 1 I P0 P0 P3 P4 P4 P6 I P0 P0 P4 P5 P0 P4 P4 Queue 2 P0 P5 P6 P5 Queue 3 I P0 P1 P4 P5 P6 I P0 P4 P5 P6 P0 P5 Optimal allocation 5 . . 1 5 1 1+2 1 . . . . 2 1 3 1 . . 2 3 2 Grey = “off” processors. I : initial packets. P0: packets arrived at time 0. 2 packets remains at the end.

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 13 / 20

slide-32
SLIDE 32

Numerical example

This provides two policies for the initial stochastic system. π∗ : at t, we apply π∗

t (MN t , EN t ) – adaptive policy.

a∗ : we apply a∗

t def

= π∗

t (mt, et) – static policy.

We want to compare V ∗N – optimal cost for the system of size N V N

a∗ – cost when applying a∗

V N

π∗ – cost when applying π∗

V N

JSQ – cost of Join Shortest Queue.

v∗ – cost of the deterministic limit.

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 14 / 20

slide-33
SLIDE 33

Cost convergence

Cost

100 200 300 400 500 10 100 1000 10000 Deterministic cost A* policy Pi* policy JSQ weighted JSQ

Size of the system: N

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 15 / 20

slide-34
SLIDE 34

Cost convergence

Cost

100 200 300 400 500 10 100 1000 10000 Deterministic cost A* policy Pi* policy JSQ weighted JSQ

Asymptotically optimal

Size of the system: N

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 15 / 20

slide-35
SLIDE 35

Cost convergence

Cost

100 200 300 400 500 10 100 1000 10000 Deterministic cost A* policy Pi* policy JSQ weighted JSQ

Asymptotically optimal

Pi* Beats JSQ for N>50 A* Beats JSQ for N>200 Size of the system: N

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 15 / 20

slide-36
SLIDE 36

Cost convergence

Cost

100 200 300 400 500 10 100 1000 10000 Deterministic cost A* policy Pi* policy JSQ weighted JSQ

Asymptotically optimal

Pi* Beats JSQ for N>50 A* Beats JSQ for N>200 Pi* Good for small size Size of the system: N

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 15 / 20

slide-37
SLIDE 37

Speed of convergence – central limit theorem

Plot of √ N(V N

π∗ − v∗) and

√ N(V N

a∗ − v∗)

√ N(V N

X − v∗) 200 400 600 800 1000 1200 1400 10 100 1000 10000 A* policy Pi* policy

Size of the system: N

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 16 / 20

slide-38
SLIDE 38

Speed of convergence – central limit theorem

Plot of √ N(V N

π∗ − v∗) and

√ N(V N

a∗ − v∗)

√ N(V N

X − v∗) 200 400 600 800 1000 1200 1400 10 100 1000 10000 A* policy Pi* policy

Limit when N grows

Size of the system: N

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 16 / 20

slide-39
SLIDE 39

1

Theoretical Results

2

A (simple) example

3

Conclusion

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 17 / 20

slide-40
SLIDE 40

Conclusion

Optimal policy of the deterministic limit is asymptotically optimal. Works for low values of N (≈ 100 in the example). To apply this in practice, there are three cases (from best to worse):

1

We can solve the deterministic limit:

◮ apply a∗ or π∗. 2

Design an approximation algorithm for the deterministic system:

◮ also an approximation (asymptotically) for stochastic problem. 3

Use brute force computation:

◮ v ∗

t...T(m, e) = C(m, e) + supa v ∗ t+1...T(φa(m, e))

◮ Compared to the random case, there is no expectation to compute.

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 18 / 20

slide-41
SLIDE 41

Conclusion

Optimal policy of the deterministic limit is asymptotically optimal. Works for low values of N (≈ 100 in the example). To apply this in practice, there are three cases (from best to worse):

1

We can solve the deterministic limit:

◮ apply a∗ or π∗. 2

Design an approximation algorithm for the deterministic system:

◮ also an approximation (asymptotically) for stochastic problem. 3

Use brute force computation:

◮ v ∗

t...T(m, e) = C(m, e) + supa v ∗ t+1...T(φa(m, e))

◮ Compared to the random case, there is no expectation to compute.

In general, the stochastic case is impossible to solve and this problem is usually addressed using restricted classes of policies:

◮ With limited information, Static/Adaptative, ...

We showed that this distinction asymptotically collapses.

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 18 / 20

slide-42
SLIDE 42

References

Paper corresponding to this talk:

◮ Gast N., Gaujal B. – A Mean Field Approach for Optimization in

Particles Systems and Applications – RR 6877,

http://mescal.imag.fr/membres/nicolas.gast/

Mean field models:

◮ Le Boudec, McDonald, Mudinger – A Generic Mean Field Convergence

Result for Systems of Interacting Objects – QEST 2007

◮ Le Boudec, Bena¨

ım – A Class Of Mean Field Interaction Models for Computer and Communication Systems – Performance Evaluation

Infinite horizon results:

◮ Borkar – Stochastic Approximation: A Dynamical Systems Viewpoint –

Cambridge University Press 2008

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 19 / 20

slide-43
SLIDE 43

Thanks Thank you for your attention.

  • N. Gast (LIG)

Mean Field Optimization Knoxville 2009 20 / 20