Synthesis of Optimal Strategies Using H Y T ECH Patricia Bouyer 1 , - - PowerPoint PPT Presentation

synthesis of optimal strategies using
SMART_READER_LITE
LIVE PREVIEW

Synthesis of Optimal Strategies Using H Y T ECH Patricia Bouyer 1 , - - PowerPoint PPT Presentation

Synthesis of Optimal Strategies Using H Y T ECH Patricia Bouyer 1 , Franck Cassez 2 , Emmanuel Fleury 3 & Kim Guldstrand Larsen 3 1 LSV, ENS-Cachan, France 2 IRCCyN, Nantes, France 3 Comp. Science. Dept., Aalborg University, Danemark Games in


slide-1
SLIDE 1

c

IRCCyN/CNRS

Synthesis of Optimal Strategies Using HYTECH

Patricia Bouyer1, Franck Cassez2, Emmanuel Fleury3 & Kim Guldstrand Larsen3

1 LSV, ENS-Cachan, France 2 IRCCyN, Nantes, France 3 Comp. Science. Dept., Aalborg University, Danemark

Games in Design and Verification Boston, Massachusetts, USA July 18, 2004

http://www.lsv.ens-cachan.fr/aci-cortos/ptga

Synthesis of Optimal Strategies Using HYTECH page 1/35

slide-2
SLIDE 2

c

IRCCyN/CNRS

Contents

  • 1. Context & Related Work
  • 2. Priced Timed Game Automata
  • 3. From Optimal Control to Control

Computing The Optimal Cost Computing Optimal Strategies

  • 4. Implementation using HYTECH

Synthesis of Optimal Strategies Using HYTECH page 2/35

slide-3
SLIDE 3

c

IRCCyN/CNRS

Context

Timed Automata

ℓ0 ℓ1 [y = 0] ℓ2 ℓ3 Goal a2 a3 x ≥ 2; a4 x ≤ 2 ; a1 y := 0 x ≥ 2 ; a5 Timed Automata + Reachability [AD94]

Synthesis of Optimal Strategies Using HYTECH page 3-a/35

slide-4
SLIDE 4

c

IRCCyN/CNRS

Context

Timed Game Automata

ℓ0 ℓ1 [y = 0] ℓ2 ℓ3 Goal u u x ≥ 2; c2 x ≤ 2 ; c1 y := 0 x ≥ 2 ; c2 Timed Automata + Reachability [AD94] Timed Game Automata: Control [MPS95, AMPS98]

Synthesis of Optimal Strategies Using HYTECH page 3-b/35

slide-5
SLIDE 5

c

IRCCyN/CNRS

Context

As soon As Possible in Timed Automata

ℓ0 ℓ1 [y = 0] ℓ2 ℓ3 Goal u u x ≥ 2; a4 1 ≤ x ≤ 2 ; a1 y := 0 x ≥ 2 ; a5 Timed Automata + Reachability [AD94] Timed Game Automata: Control [MPS95, AMPS98] Time Optimal Control (Reachability) [AM99]

Synthesis of Optimal Strategies Using HYTECH page 3-c/35

slide-6
SLIDE 6

c

IRCCyN/CNRS

Context

Reachability in Priced Timed Automata

ℓ0 ℓ1 [y = 0] ℓ2 ℓ3 Goal a2 a3 x ≥ 2; a4

cost = 1

x ≤ 2 ; a1 y := 0 x ≥ 2 ; a5

cost = 7 cost(ℓ2) = 10 cost(ℓ0) = 5 cost(ℓ3) = 1

Timed Automata + Reachability [AD94] Timed Game Automata: Control [MPS95, AMPS98] Time Optimal Control (Reachability) [AM99] Priced (or Weighted) Timed Automata [LBB+01, ALTP01]

Synthesis of Optimal Strategies Using HYTECH page 3-d/35

slide-7
SLIDE 7

c

IRCCyN/CNRS

Context

Priced Timed Game Automata

ℓ0 ℓ1 [y = 0] ℓ2 ℓ3 Goal u u x ≥ 2; c2

cost = 1

x ≤ 2 ; c1 y := 0 x ≥ 2 ; c2

cost = 7 cost(ℓ2) = 10 cost(ℓ0) = 5 cost(ℓ3) = 1

Timed Automata + Reachability [AD94] Timed Game Automata: Control [MPS95, AMPS98] Time Optimal Control (Reachability) [AM99] Priced (or Weighted) Timed Automata [LBB+01, ALTP01]

Synthesis of Optimal Strategies Using HYTECH page 3-e/35

slide-8
SLIDE 8

c

IRCCyN/CNRS

A Simple Example

ℓ0

cost(ℓ0) = 5

ℓ1 [y = 0] ℓ2

cost(ℓ2) = 10

ℓ3

cost(ℓ3) = 1

Goal x ≤ 2; c1 y := 0 u u x ≥ 2; c2

cost = 1

x ≥ 2; c2

cost = 7

Model = Game = Controller vs. Environment What is the best cost whatever the environment does ?

Synthesis of Optimal Strategies Using HYTECH page 4-a/35

slide-9
SLIDE 9

c

IRCCyN/CNRS

A Simple Example

ℓ0

cost(ℓ0) = 5

ℓ1 [y = 0] ℓ2

cost(ℓ2) = 10

ℓ3

cost(ℓ3) = 1

Goal x ≤ 2; c1 y := 0 u u x ≥ 2; c2

cost = 1

x ≥ 2; c2

cost = 7

What is the best cost whatever the environment does ? inf

0≤t≤2 max{5t + 10(2 − t) + 1, 5t + (2 − t) + 7}

Synthesis of Optimal Strategies Using HYTECH page 4-b/35

slide-10
SLIDE 10

c

IRCCyN/CNRS

A Simple Example

ℓ0

cost(ℓ0) = 5

ℓ1 [y = 0] ℓ2

cost(ℓ2) = 10

ℓ3

cost(ℓ3) = 1

Goal x ≤ 2; c1 y := 0 u u x ≥ 2; c2

cost = 1

x ≥ 2; c2

cost = 7

What is the best cost whatever the environment does ? inf

0≤t≤2 max{5t+10(2−t)+1, 5t+(2−t)+7} at t = 4

3 inf = 141 3

Synthesis of Optimal Strategies Using HYTECH page 4-c/35

slide-11
SLIDE 11

c

IRCCyN/CNRS

A Simple Example

ℓ0

cost(ℓ0) = 5

ℓ1 [y = 0] ℓ2

cost(ℓ2) = 10

ℓ3

cost(ℓ3) = 1

Goal x ≤ 2; c1 y := 0 u u x ≥ 2; c2

cost = 1

x ≥ 2; c2

cost = 7

What is the best cost whatever the environment does ? = ⇒ 141

3 at t = 4 3

Synthesis of Optimal Strategies Using HYTECH page 4-d/35

slide-12
SLIDE 12

c

IRCCyN/CNRS

A Simple Example

ℓ0

cost(ℓ0) = 5

ℓ1 [y = 0] ℓ2

cost(ℓ2) = 10

ℓ3

cost(ℓ3) = 1

Goal x ≤ 2; c1 y := 0 u u x ≥ 2; c2

cost = 1

x ≥ 2; c2

cost = 7

What is the best cost whatever the environment does ? = ⇒ 141

3 at t = 4 3

Is there a strategy to achieve this optimal cost ?

Yes: because see later

Synthesis of Optimal Strategies Using HYTECH page 4-e/35

slide-13
SLIDE 13

c

IRCCyN/CNRS

A Simple Example

ℓ0

cost(ℓ0) = 5

ℓ1 [y = 0] ℓ2

cost(ℓ2) = 10

ℓ3

cost(ℓ3) = 1

Goal x ≤ 2; c1 y := 0 u u x ≥ 2; c2

cost = 1

x ≥ 2; c2

cost = 7

What is the best cost whatever the environment does ? = ⇒ 141

3 at t = 4 3

Is there a strategy to achieve this optimal cost ?

Yes: because see later

Can we compute such a strategy ?

Yes: in ℓ0, x < 4

3 wait then do c1; in ℓ2,3 do c2 when x ≥ 2

Synthesis of Optimal Strategies Using HYTECH page 4-f/35

slide-14
SLIDE 14

c

IRCCyN/CNRS

Optimal Control Problems

ℓ0

cost(ℓ0) = 5

ℓ1 [y = 0] ℓ2

cost(ℓ2) = 10

ℓ3

cost(ℓ3) = 1

Goal x ≤ 2; c1 y := 0 u u x ≥ 2; c2

cost = 1

x ≥ 2; c2

cost = 7

Can we find algorithms for these problems on PTGA:

  • 1. Compute the optimal cost
  • 2. Decide if there is an optimal strategy
  • 3. Compute an optimal strategy (if ∃)

Synthesis of Optimal Strategies Using HYTECH page 4-g/35

slide-15
SLIDE 15

c

IRCCyN/CNRS

Related Work

La Torre et al. [LTMM02] (IFIP TCS’02)

  • Acyclic Priced Timed Game Automata
  • Recursive definition of optimal cost

[= ⇒ La Torre et al. Def.]

  • Computation of the infimum of the optimal cost

OptCost = 2 could be 2 or 2 + ε

  • No strategy synthesis

Synthesis of Optimal Strategies Using HYTECH page 5-a/35

slide-16
SLIDE 16

c

IRCCyN/CNRS

Related Work

La Torre et al. [LTMM02] (IFIP TCS’02)

Acyclic Games, infimum, no strategy synthesis

Alur et al. [ABM04] (ICALP’04)

  • bounded optimality: optimal cost within k steps
  • complexity bound: exponential in k and #states of the

PTGA

  • no bound for the more general optimal problem
  • Computation of the infimum of the optimal cost
  • no strategy synthesis

Synthesis of Optimal Strategies Using HYTECH page 5-b/35

slide-17
SLIDE 17

c

IRCCyN/CNRS

Related Work

La Torre et al. [LTMM02] (IFIP TCS’02)

Acyclic Games, infimum, no strategy synthesis

Alur et al. [ABM04] (ICALP’04)

bounded optimality, complexity bound, infimum, no strategy synthesis

Our work [BCFL04]:

  • Run-based definition of optimal cost
  • We can decide whether ∃ an optimal strategy
  • We can synthesize an optimal strategy (if ∃)
  • We can prove structural properties of optimal strategies
  • Applies to Linear Hybrid Game (Automata)

Synthesis of Optimal Strategies Using HYTECH page 5-c/35

slide-18
SLIDE 18

c

IRCCyN/CNRS

Contents

  • 1. Context & Related Work
  • 2. Priced Timed Game Automata
  • 3. From Optimal Control to Control

Computing The Optimal Cost Computing Optimal Strategies

  • 4. Implementation using HYTECH

Synthesis of Optimal Strategies Using HYTECH page 6/35

slide-19
SLIDE 19

c

IRCCyN/CNRS

Priced Timed Game Automata

A Timed Game Automaton (PTGA) G is a tuple (L, ℓ0, Act, X,

E, inv, cost) where: L is a finite set of locations; ℓ0 ∈ L is the initial location; Act = Actc ∪ Actu is the set of actions (partitioned into

controllable and uncontrollable actions);

X is a finite set of real-valued clocks; E ⊆ L × B(X) × Act × 2X × L is a finite set of transitions; inv : L − → B(X) associates to each location its invariant;

Synthesis of Optimal Strategies Using HYTECH page 7-a/35

slide-20
SLIDE 20

c

IRCCyN/CNRS

Priced Timed Game Automata

A Priced Timed Game Automaton (PTGA) G is a tuple

(L, ℓ0, Act, X, E, inv, cost) where: L is a finite set of locations; E ⊆ L × B(X) × Act × 2X × L is a finite set of transitions; Priced Version: cost : L ∪ E − → N associates to each

location a cost rate and to each discrete transition a cost value.

[= ⇒ Example]

Synthesis of Optimal Strategies Using HYTECH page 7-b/35

slide-21
SLIDE 21

c

IRCCyN/CNRS

Priced Timed Game Automata

A Priced Timed Game Automaton (PTGA) G is a tuple

(L, ℓ0, Act, X, E, inv, cost) where: L is a finite set of locations; E ⊆ L × B(X) × Act × 2X × L is a finite set of transitions; Priced Version: cost : L ∪ E − → N associates to each

location a cost rate and to each discrete transition a cost value.

[= ⇒ Example]

we assume that PTGA are deterministic w.r.t. controllable

actions (+ time-deterministic)

A reachability PTGA (RPTGA) = PTGA with distinguished

set of states Goal ⊆ L.

Synthesis of Optimal Strategies Using HYTECH page 7-c/35

slide-22
SLIDE 22

c

IRCCyN/CNRS

Configurations, Runs, Costs

configuration: (ℓ, v) in L × RX

≥0

step: ti = (ℓi, vi) αi − → (ℓi+1, vi+1)

  • αi ∈ R>0 =

⇒ ℓi+1 = ℓi ∧ vi+1 = vi + αi αi ∈ Act = ⇒ ∃(ℓi, g, αi, Y, ℓi+1) ∈ E ∧ vi | = g ∧ vi+1 = vi[Y ] run ρ = t0t1t2 · · · tn−1 · · · finite or infinite sequence of ti cost of a transition:

  • Cost(ti) = αi.cost(ℓi) if αi ∈ R>0

Cost(ti) = cost((ℓi, g, αi, Y, ℓi+1)) if αi ∈ Act if ρ finite Cost(ρ) =

0≤i≤n−1 Cost(ti)

winning run if States(ρ) ∩ Goal = ∅

Synthesis of Optimal Strategies Using HYTECH page 8/35

slide-23
SLIDE 23

c

IRCCyN/CNRS

Strategies

strategy f over G = partial function from Runs(G) to Actc ∪ {λ}. Outcome((ℓ, v), f) (outcomes) of f from configuration (ℓ, v) = a subset of Runs((ℓ, v), G)

[= ⇒ Formal Definition of Outcome]

Synthesis of Optimal Strategies Using HYTECH page 9-a/35

slide-24
SLIDE 24

c

IRCCyN/CNRS

Strategies

ℓ0 ℓ1 [y = 0] ℓ2 ℓ3 Goal x ≤ 2; c1 y := 0 u u x ≥ 2; c2 x ≥ 2; c2

Example:

         f(ℓ0, x < 4

3) = λ

f(ℓ0, 4

3 ≤ x ≤ 2) = c1

f(ℓ1, −) undefined f(ℓ2, x < 2) = λ f(ℓ2, x ≥ 2) = c2 f(ℓ3, x < 2) = λ f(ℓ3, x ≥ 2) = c2

Synthesis of Optimal Strategies Using HYTECH page 9-b/35

slide-25
SLIDE 25

c

IRCCyN/CNRS

Strategies

strategy f over G = partial function from Runs(G) to Actc ∪ {λ}. Outcome((ℓ, v), f) = outcomes of f from configuration (ℓ, v);

[= ⇒ Formal Definition of Outcome]

a strategy f is winning from (ℓ, v) if Outcome((ℓ, v), f) ⊆ WinRuns((ℓ, v), G) The cost of f from (ℓ, v) is Cost((ℓ, v), f) = sup{Cost(ρ) | ρ ∈ Outcome((ℓ, v), f)}

Synthesis of Optimal Strategies Using HYTECH page 9-c/35

slide-26
SLIDE 26

c

IRCCyN/CNRS

(Formal) Optimal Control Problems

Optimal Cost Computation Problem: compute the optimal cost

  • ne can expect from s0 = (ℓ0,

0) OptCost(s0, G) = inf{Cost(s0, f) | f ∈ WinStrat(s0, G)}

Optimal Strategy Existence Problem: determine whether the

  • ptimal cost can actually be reached

∃?f ∈ WinStrat(s0, G) s.t. Cost(s0, f) = OptCost(s0, G)

Optimal Strategy Synthesis Problem: in case an optimal strategy

exists, compute a witness.

Synthesis of Optimal Strategies Using HYTECH page 10-a/35

slide-27
SLIDE 27

c

IRCCyN/CNRS

(Formal) Optimal Control Problems

Optimal Cost Computation Problem: compute the optimal cost

  • ne can expect from s0 = (ℓ0,

0) OptCost(s0, G) = inf{Cost(s0, f) | f ∈ WinStrat(s0, G)}

Optimal Strategy Existence Problem: determine whether the

  • ptimal cost can actually be reached

∃?f ∈ WinStrat(s0, G) s.t. Cost(s0, f) = OptCost(s0, G)

Optimal Strategy Synthesis Problem: in case an optimal strategy

exists, compute a witness. Relation to La Torre et al. work [LTMM02] (acyclic game): Theorem 1: OptCost(s0, G) = O(s0)

[= ⇒ Definition of O(q)]

Synthesis of Optimal Strategies Using HYTECH page 10-b/35

slide-28
SLIDE 28

c

IRCCyN/CNRS

Example: No Optimal Strategy

ℓ0

cost(ℓ0) = 1

ℓ1

cost(ℓ1) = 2

Goal x < 1; c x = 1; c x < 1 x ≤ 1 define fε with 0 < ε < 1 by:

in ℓ0: f(ℓ0, x < 1 − ε) = λ, f(ℓ0, 1 − ε ≤ x < 1) = c in ℓ1: f(ℓ1, x ≤ 1) = c

Cost(fε) = 1 + ε. there are RPTGA for which no optimal strategy exists In this case there is a family of strategies fε such that |Cost((ℓ0, 0), fε) − OptCost((ℓ0, 0), G)| < ε new problem: given ε, compute such an fε strategy.

Synthesis of Optimal Strategies Using HYTECH page 11/35

slide-29
SLIDE 29

c

IRCCyN/CNRS

Contents

  • 1. Context & Related Work
  • 2. Priced Timed Game Automata
  • 3. From Optimal Control to Control

Computing The Optimal Cost Computing Optimal Strategies

  • 4. Implementation using HYTECH

Synthesis of Optimal Strategies Using HYTECH page 12/35

slide-30
SLIDE 30

c

IRCCyN/CNRS

From Optimal Control to Control

ℓ0 ℓ1 [y = 0] ℓ2 Goal ℓ3 x ≤ 2; c1 y := 0 u u

cost(ℓ2) = 10 cost(ℓ0) = 5 cost(ℓ3) = 1 cost(ℓ2) = 10

x ≥ 2; c2

cost = 1

x ≥ 2; c2

cost = 7

A RPTGA A

Synthesis of Optimal Strategies Using HYTECH page 13-a/35

slide-31
SLIDE 31

c

IRCCyN/CNRS

From Optimal Control to Control

ℓ0 ℓ1 [y = 0] ℓ2 Goal ℓ3 x ≤ 2; c1 y := 0 u u

dCost dt

= −5

dCost dt

= −10

dCost dt

= −1 x ≥ 2; c2 Cost′ = Cost − 1 x ≥ 2; c2 Cost′ = Cost − 7 A Linear Hybrid Game Automaton H Reachability Game for H with goal = Goal ∧ Cost ≥ 0

Optimal Cost for RPTGA ⇐

⇒ Reachability Control on LHA

Synthesis of Optimal Strategies Using HYTECH page 13-b/35

slide-32
SLIDE 32

c

IRCCyN/CNRS

From Optimal Control to Control

ℓ0 ℓ1 [y = 0] ℓ2 Goal ℓ3 x ≤ 2; c1 y := 0 u u

dCost dt

= −5

dCost dt

= −10

dCost dt

= −1 x ≥ 2; c2 Cost′ = Cost − 1 x ≥ 2; c2 Cost′ = Cost − 7

Assume ∃ semi-algorithm CompWin s.t. WH = CompWin(H) and WH = largest set of winning states Theorem 2: If CompWin terminates for H then:

it terminates for A and WA

def

= CompWin(A) = ∃Cost.WH (q, c) ∈ WH ⇐ ⇒ ∃f ∈ WinStrat(q, WA) with Cost(q, f) ≤ c

Synthesis of Optimal Strategies Using HYTECH page 13-c/35

slide-33
SLIDE 33

c

IRCCyN/CNRS

Known Results for Reachability Games

Controllable Predecessors [MPS95, DAHM01] π(X) = Predt

  • X ∪ cPred(X), uPred(X)
  • [=

⇒ Formal Def. of π]

W (largest) set of winning states, goal = X0 W = µX.X0 ∪ π(X)

Synthesis of Optimal Strategies Using HYTECH page 14-a/35

slide-34
SLIDE 34

c

IRCCyN/CNRS

Known Results for Reachability Games

Controllable Predecessors [MPS95, DAHM01] π(X) = Predt

  • X ∪ cPred(X), uPred(X)
  • [=

⇒ Formal Def. of π]

W (largest) set of winning states, goal = X0 W = µX.X0 ∪ π(X) π preserves Cost upward-closed sets π(R ∧ Cost ≻ h) = R′ ∧ Cost ≻′ h′ semi-algorithm CompWin (preserves upwards closure) result of CompWin of the form ∪n∈N((ℓn, Rn ∧ Cost ≻n hn))

where hn is a piece-wise affine function

Synthesis of Optimal Strategies Using HYTECH page 14-b/35

slide-35
SLIDE 35

c

IRCCyN/CNRS

Contents

  • 1. Context & Related Work
  • 2. Priced Timed Game Automata
  • 3. From Optimal Control to Control

Computing The Optimal Cost Computing Optimal Strategies

  • 4. Implementation using HYTECH

Synthesis of Optimal Strategies Using HYTECH page 15/35

slide-36
SLIDE 36

c

IRCCyN/CNRS

Computing the Optimal Cost for PTGA

  • 1. ∃ semi-algorithm CompWin for LHG
  • 2. W = CompWin(H, Goal ∧ Cost ≥ 0)
  • 3. W0 = W ∩ (ℓ0,

0)

  • 4. projection on Cost: ∃(All \ {Cost}).W0

if Cost ≥ k, OptCost = k and ∃ an optimal strategy if Cost > k, OptCost = k and ∃ a family of sub-optimal

strategies Semi-algorithm for Priced Timed Hybrid Automata Termination ???

Synthesis of Optimal Strategies Using HYTECH page 16/35

slide-37
SLIDE 37

c

IRCCyN/CNRS

Termination for RPTGA

A a RPTGA s.t. non-zeno cost: ∃κ s.t. every cycle in the

region automaton has cost at least κ

A is bounded i.e. all clocks in A are bounded

Theorem 4 CompWin terminates for H, where H is the LHG associated with A

[= ⇒ Sketch of the Proof]

Synthesis of Optimal Strategies Using HYTECH page 17-a/35

slide-38
SLIDE 38

c

IRCCyN/CNRS

Termination for RPTGA

A a RPTGA s.t. non-zeno cost: ∃κ s.t. every cycle in the

region automaton has cost at least κ

A is bounded i.e. all clocks in A are bounded

Theorem 4 CompWin terminates for H, where H is the LHG associated with A

[= ⇒ Sketch of the Proof]

Non zeno cost really needed ? Complexity ???

Synthesis of Optimal Strategies Using HYTECH page 17-b/35

slide-39
SLIDE 39

c

IRCCyN/CNRS

Contents

  • 1. Context & Related Work
  • 2. Priced Timed Game Automata
  • 3. From Optimal Control to Control

Computing The Optimal Cost Computing Optimal Strategies

  • 4. Implementation using HYTECH

Synthesis of Optimal Strategies Using HYTECH page 18/35

slide-40
SLIDE 40

c

IRCCyN/CNRS

Optimal Strategy Synthesis

S algorithm for synthetizing strategies for reachability

timed games ? see [BCFL04] . . .

use S on the LHG H: strategies are cost-dependent

Theorem 5 If S terminates and ∃ an optimal strategy we can compute a witness (cost-dependent)

Synthesis of Optimal Strategies Using HYTECH page 19-a/35

slide-41
SLIDE 41

c

IRCCyN/CNRS

Optimal Strategy Synthesis

S algorithm for synthetizing strategies for reachability

timed games ? see [BCFL04] . . .

use S on the LHG H: strategies are cost-dependent

Theorem 5 If S terminates and ∃ an optimal strategy we can compute a witness (cost-dependent)

assume a RPTGA A is bounded, non zeno cost W is the set of winning states in the LHG H W = ∪n∈N((ℓn, Rn ∧ Cost ≥ hn)) (hn piece-wise lin. aff.)

Theorem 6 [State-based Strategies] Let WA = CompWin(A).

∃f state-based s.t. ∀(ℓ, v) ∈ WA Cost((ℓ, v), f) = OptCost(ℓ, v)

Synthesis of Optimal Strategies Using HYTECH page 19-b/35

slide-42
SLIDE 42

c

IRCCyN/CNRS

Synthesis of Cost-Dependent Strategies

for LHG winning states = fixed point of π operator W0 = Goal and Wi+1 = Predt

  • Wi ∪ cPred(Wi), uPred(Wi)
  • Synthesis of Optimal Strategies Using HYTECH

page 20-a/35

slide-43
SLIDE 43

c

IRCCyN/CNRS

Synthesis of Cost-Dependent Strategies

for LHG winning states = fixed point of π operator W0 = Goal and Wi+1 = Predt

  • Wi ∪ cPred(Wi), uPred(Wi)
  • synthesis of cost-dependent (state-based on LHG)

strategy:

  • assume fi is a winning, state-based strategy on Wi
  • compute Wi+1 = π(Wi) and let Y = Wi+1 \ Wi

Synthesis of Optimal Strategies Using HYTECH page 20-b/35

slide-44
SLIDE 44

c

IRCCyN/CNRS

Synthesis of Cost-Dependent Strategies

for LHG winning states = fixed point of π operator W0 = Goal and Wi+1 = Predt

  • Wi ∪ cPred(Wi), uPred(Wi)
  • synthesis of cost-dependent (state-based on LHG)

strategy:

  • assume fi is a winning, state-based strategy on Wi
  • compute Wi+1 = π(Wi) and let Y = Wi+1 \ Wi
  • on Wi define fi+1 = fi

Synthesis of Optimal Strategies Using HYTECH page 20-c/35

slide-45
SLIDE 45

c

IRCCyN/CNRS

Synthesis of Cost-Dependent Strategies

for LHG winning states = fixed point of π operator W0 = Goal and Wi+1 = Predt

  • Wi ∪ cPred(Wi), uPred(Wi)
  • synthesis of cost-dependent (state-based on LHG)

strategy:

  • assume fi is a winning, state-based strategy on Wi
  • compute Wi+1 = π(Wi) and let Y = Wi+1 \ Wi
  • on Wi define fi+1 = fi
  • on Yc = cPred(Wi) ∩ Y define fi+1 = {some c action}

Synthesis of Optimal Strategies Using HYTECH page 20-d/35

slide-46
SLIDE 46

c

IRCCyN/CNRS

Synthesis of Cost-Dependent Strategies

for LHG winning states = fixed point of π operator W0 = Goal and Wi+1 = Predt

  • Wi ∪ cPred(Wi), uPred(Wi)
  • synthesis of cost-dependent (state-based on LHG)

strategy:

  • assume fi is a winning, state-based strategy on Wi
  • compute Wi+1 = π(Wi) and let Y = Wi+1 \ Wi
  • on Wi define fi+1 = fi
  • on Yc = cPred(Wi) ∩ Y define fi+1 = {some c action}
  • on Yt = Y \ Yc define fi+1 = {λ}

Synthesis of Optimal Strategies Using HYTECH page 20-e/35

slide-47
SLIDE 47

c

IRCCyN/CNRS

Synthesis of Cost-Dependent Strategies

synthesis of cost-dependent (state-based on LHG)

strategy:

  • assume fi is a winning, state-based strategy on Wi
  • compute Wi+1 = π(Wi) and let Y = Wi+1 \ Wi
  • on Wi define fi+1 = fi
  • on Yc = cPred(Wi) ∩ Y define fi+1 = {some c action}
  • on Yt = Y \ Yc define fi+1 = {λ}

Problem ? ℓ0 Goal 0 < x < 1; c1

Synthesis of Optimal Strategies Using HYTECH page 20-f/35

slide-48
SLIDE 48

c

IRCCyN/CNRS

Synthesis of Cost-Dependent Strategies

synthesis of cost-dependent (state-based on LHG)

strategy:

  • assume fi is a winning, state-based strategy on Wi
  • compute Wi+1 = π(Wi) and let Y = Wi+1 \ Wi
  • on Wi define fi+1 = fi
  • on Yc = cPred(Wi) ∩ Y define fi+1 = {some c action}
  • on Yt = Y \ Yc define fi+1 = {λ}

Problem ? ℓ0 Goal 0 < x < 1; c1

  • W1 = {Goal} ∪ {(ℓ0, 0 ≤ x < 1)} and Y = (ℓ0, 0 ≤ x < 1)
  • f1(ℓ0, 0 < x < 1) = {c1} and f1(ℓ0, x = 0) = {λ}
  • blocking strategy

Synthesis of Optimal Strategies Using HYTECH page 20-g/35

slide-49
SLIDE 49

c

IRCCyN/CNRS

Synthesis of Cost-Dependent Strategies

synthesis of cost-dependent (state-based on LHG)

strategy:

  • assume fi is a winning, state-based strategy on Wi
  • compute Wi+1 = π(Wi) and let Y = Wi+1 \ Wi
  • on Wi define fi+1 = fi
  • on Yc = cPred(Wi) ∩ Y define fi+1 = {some c action}
  • on Yt = Y \ Yc define fi+1 = {λ}

Problem ? ℓ0 Goal 0 < x < 1; c1

  • Choose ε > 0
  • f1(ℓ0, ε ≤ x < 1) = {c1} and f1(ℓ0, 0 ≤ x < ε) = {λ}
  • new winning, state-based strategy

Synthesis of Optimal Strategies Using HYTECH page 20-h/35

slide-50
SLIDE 50

c

IRCCyN/CNRS

Synthesis of Cost-Dependent Strategies

synthesis of cost-dependent (state-based on LHG)

strategy:

  • assume fi is a winning, state-based strategy on Wi
  • compute Wi+1 = π(Wi) and let Y = Wi+1 \ Wi
  • on Wi define fi+1 = fi
  • on Yc = cPred(Wi) ∩ Y define fi+1 = {some c action}
  • on Yt = Y \ Yc define fi+1 = {λ}

Computation of a winning state-based strategy:

  • if guards of u actions are strict and guards on c actions

are large then fi+1 is winning (Yt is future-open)

  • otherwise fi+1 can be altered to be made winning
  • consequence: if π∗(W0) = Wk for some k ∈ N there is a

winning state-based (cost-dependent) strategy

Synthesis of Optimal Strategies Using HYTECH page 20-i/35

slide-51
SLIDE 51

c

IRCCyN/CNRS

Optimal Cost-Independent Strategy

compute a cost-dependent winning strategy f; f(q, cost) ∈ Actc ∪ {λ} Optimal cost-independent winning strategy f∗:

  • take the best action in each state: f∗(q) = e if
  • 1. e = f(q, cost)
  • 2. ∀e′ = e, f(q, cost′) = e′ =

⇒ cost′ ≥ cost result: under strictness assumptions, we can build a

uniform optimal strategy i.e. optimal in each state (non blocking)

[= ⇒ Algorithm & HYTECH ]

Synthesis of Optimal Strategies Using HYTECH page 21/35

slide-52
SLIDE 52

c

IRCCyN/CNRS

No Optimal Cost-Independent Strategy

ℓ0

cost(ℓ0) = 2

x < 1 ℓ1

cost(ℓ1) = 1

Goal x < 1;u y := x := 0 y > 0; c2 Optimal cost is 2 An optimal winning cost-dependent strategy f: f(ℓ1, −, cost < 2) = λ and f(ℓ1, −, cost = 2) = c2

assume u taken at time (1 − δ0):

Cost(f, (ℓ0, 0)) = 2 · (1 − δ0) + δ1

and according to f we have δ1 = 2 · δ2

Synthesis of Optimal Strategies Using HYTECH page 22-a/35

slide-53
SLIDE 53

c

IRCCyN/CNRS

No Optimal Cost-Independent Strategy

ℓ0

cost(ℓ0) = 2

x < 1 ℓ1

cost(ℓ1) = 1

Goal x < 1;u y := x := 0 y > 0; c2 Optimal cost is 2 assume ∃ f∗ cost-independent: f∗ must wait in ℓ1 at least ε

assume u taken at time (1 − δ):

Cost(f∗, (ℓ0, 0)) = 2 · (1 − δ) + ε

Take δ = ε

4: Cost(f∗, (ℓ0, 0)) = 2 + ε 2 and OptCost(f∗) = 2 + ε

Synthesis of Optimal Strategies Using HYTECH page 22-b/35

slide-54
SLIDE 54

c

IRCCyN/CNRS

Contents

  • 1. Context & Related Work
  • 2. Priced Timed Game Automata
  • 3. From Optimal Control to Control

Computing The Optimal Cost Computing Optimal Strategies

  • 4. Implementation using HYTECH

Synthesis of Optimal Strategies Using HYTECH page 23/35

slide-55
SLIDE 55

c

IRCCyN/CNRS

Experiment

computation of optimal cost and optimal strategies (if ∃)

implemented in HYTECH (Demo ?)

a cyclic example:

[= ⇒ See the strategy]

Antenna 1 lowx

cost = 1

highx

cost = 10

Goalx jamx?; x := 0 x ≥ 5; y := 0 x ≥ 10;

cost = 7

jamx?; x := 0 Antenna 2 lowy

cost = 2

highy

cost = 20

Goaly jamy?; y := 0 y ≥ 2; x := 0 y ≥ 10;

cost = 1

jamy?; y := 0 Jammer X Y x > 6; jamy! y > 6; jamx! x > 6; jamx! y > 6; jamy!

Synthesis of Optimal Strategies Using HYTECH page 24/35

slide-56
SLIDE 56

c

IRCCyN/CNRS

Optimal Strategy for the Mobile Phone

Optimal cost is 109

Synthesis of Optimal Strategies Using HYTECH page 25/35

slide-57
SLIDE 57

c

IRCCyN/CNRS

Conclusion & Future Work

Current State of Our Work

Semi-algorithm for computing the optimal cost for LHG in case it terminates:

  • decide if ∃ optimal strategy
  • compute an optimal strategy

Implementation in HYTECH

Open Problems

Optimal Control – Decidability issues (non zeno cost) maximal class for which CompWin terminates

Future Work

compute fε strategies safety games . . .

Synthesis of Optimal Strategies Using HYTECH page 26/35

slide-58
SLIDE 58

c

IRCCyN/CNRS

References (1)

[AD94]

  • R. Alur and D. Dill. A theory of timed automata. Theoretical Computer Science

(TCS), 126(2):183–235, 1994. 3 [ALTP01]

  • R. Alur, S. La Torre, and G. J. Pappas. Optimal paths in weighted timed automata. In
  • Proc. 4th Int. Work. Hybrid Systems: Computation and Control (HSCC’01), volume

2034 of LNCS, pages 49–62. Springer, 2001. 3 [ABM04]

  • R. Alur, M, Bernadsky, and P

. Madhusudan. Optimal reachability in weighted timed

  • games. In Proc. 31st International Colloquium on Automata, Languages and

Programming (ICALP’04), Lecture Notes in Computer Science. Springer, 2004. To

  • appear. 5

[AM99]

  • E. Asarin and O. Maler. As soon as possible: Time optimal control for timed
  • automata. In Proc. 2nd Int. Work. Hybrid Systems: Computation and Control

(HSCC’99), volume 1569 of LNCS, pages 19–30. Springer, 1999. 3 [AMPS98] E. Asarin, O. Maler, A. Pnueli, and J. Sifakis. Controller synthesis for timed

  • automata. In Proc. IFAC Symposium on System Structure and Control, pages

469–474. Elsevier Science, 1998. 3 [BCFL04] P . Bouyer, F. Cassez, E. Fleury, and K. G. Larsen. Optimal strategies in priced timed game automata. BRICS Report Series, Basic Research In Computer Science, Denmark, February 2004. 5, 19

Synthesis of Optimal Strategies Using HYTECH page 27/35

slide-59
SLIDE 59

c

IRCCyN/CNRS

References (2)

[DAHM01] L. De Alfaro, T. A. Henzinger, and R. Majumdar. Symbolic algorithms for infinite-state

  • games. In Proc. 12th International Conference on Concurrency Theory

(CONCUR’01), volume 2154 of LNCS, pages 536–550. Springer, 2001. 14 [LBB+01]

  • K. G. Larsen, G. Behrmann, E. Brinksma, A. Fehnker, T. Hune, P

. Pettersson, and

  • J. Romijn. As cheap as possible: Efficient cost-optimal reachability for priced timed
  • automata. In Proc. 13th International Conference on Computer Aided Verification

(CAV’01), volume 2102 of LNCS, pages 493–505. Springer, 2001. 3 [LTMM02] S. La Torre, S. Mukhopadhyay, and A. Murano. Optimal-reachability and control for acyclic weighted timed automata. In Proc. 2nd IFIP International Conference on Theoretical Computer Science (TCS 2002), volume 223 of IFIP Conference Proceedings, pages 485–497. Kluwer, 2002. 5, 10 [MPS95]

  • O. Maler, A. Pnueli, and J. Sifakis. On the synthesis of discrete controllers for timed
  • systems. In Proc. 12th Annual Symposium on Theoretical Aspects of Computer

Science (STACS’95), volume 900, pages 229–242. Springer, 1995. 3, 14

Synthesis of Optimal Strategies Using HYTECH page 28/35

slide-60
SLIDE 60

c

IRCCyN/CNRS

Recursive Definition of Optimal Cost

Let G be a RPTG. Let O be the function from Q to R≥0 ∪ {+∞} that is the least fixed point of the following functional: O(q)? q

t,p

− − → q′ max                           

Synthesis of Optimal Strategies Using HYTECH page 29-a/35

slide-61
SLIDE 61

c

IRCCyN/CNRS

Recursive Definition of Optimal Cost

Let G be a RPTG. Let O be the function from Q to R≥0 ∪ {+∞} that is the least fixed point of the following functional: O(q)? q

t,p

− − → q′ max                            min               min

q′ c,p′

− − →q′′

c∈Actc

p + p′ + O(q′′)        , p + O(q′)        Controllable actions in q′

Synthesis of Optimal Strategies Using HYTECH page 29-b/35

slide-62
SLIDE 62

c

IRCCyN/CNRS

Recursive Definition of Optimal Cost

Let G be a RPTG. Let O be the function from Q to R≥0 ∪ {+∞} that is the least fixed point of the following functional: O(q)? q

t,p

− − → q′ max                            min               min

q′ c,p′

− − →q′′

c∈Actc

p + p′ + O(q′′)        , p + O(q′)        sup

q

t′,p′

− − − →q′′

t′≤t

max

q′′ u,p′′

− − − →q′′′

u∈Actu

p′ + p′′ + O(q′′′) Controllable actions in q′ Uncontrollable actions before q′

Synthesis of Optimal Strategies Using HYTECH page 29-c/35

slide-63
SLIDE 63

c

IRCCyN/CNRS

Recursive Definition of Optimal Cost

Let G be a RPTG. Let O be the function from Q to R≥0 ∪ {+∞} that is the least fixed point of the following functional: O(q) = inf

q

t,p

− − →q′

t∈R≥0

max                            min               min

q′ c,p′

− − →q′′

c∈Actc

p + p′ + O(q′′)        , p + O(q′)        sup

q

t′,p′

− − − →q′′

t′≤t

max

q′′ u,p′′

− − − →q′′′

u∈Actu

p′ + p′′ + O(q′′′) Controllable actions in q′ Uncontrollable actions before q′ Minimize over t

Synthesis of Optimal Strategies Using HYTECH page 29-d/35

slide-64
SLIDE 64

c

IRCCyN/CNRS

Outcome

Let G = (L, ℓ0, Act, X, E, inv, cost) be a (R)PTGA and f a strategy over G. The outcome Outcome((ℓ, v), f) of f from configuration (ℓ, v) in G is the subset of Runs((ℓ, v), G) defined inductively by: (ℓ, v) ∈ Outcome((ℓ, v), f), if ρ ∈ Outcome((ℓ, v), f) then ρ′ = ρ

e

− − → (ℓ′, v′) ∈ Outcome((ℓ, v), f) if ρ′ ∈ Runs((ℓ, v), G) and one of the following three conditions hold:

  • 1. e ∈ Actu,
  • 2. e ∈ Actc and e = f(ρ),
  • 3. e ∈ R≥0 and ∀0 ≤ e′ < e, ∃(ℓ′′, v′′) ∈ (L × RX

≥0) s.t. last(ρ) e′

− − → (ℓ′′, v′′) ∧ f(ρ

e′

− − → (ℓ′′, v′′)) = λ. an infinite run ρ is in ∈ Outcome((ℓ, v), f) if all the finite prefixes of ρ are in Outcome((ℓ, v), f).

[= ⇒ Back to Strategies]

Synthesis of Optimal Strategies Using HYTECH page 30/35

slide-65
SLIDE 65

c

IRCCyN/CNRS

A Tricky Example

ℓ0

cost(ℓ0) = 1

x ≤ 1 ℓ1

cost(ℓ1) = 2

x ≤ 1 Goal y > 0; u ; y := 0 y > 0; c ; y := 0 x = 1; c

Synthesis of Optimal Strategies Using HYTECH page 31-a/35

slide-66
SLIDE 66

c

IRCCyN/CNRS

A Tricky Example

ℓ0

cost(ℓ0) = 1

x ≤ 1 ℓ1

cost(ℓ1) = 2

x ≤ 1 Goal y > 0; u ; y := 0 y > 0; c ; y := 0 x = 1; c what is the optimal cost? Is there an optimal strategy?

Synthesis of Optimal Strategies Using HYTECH page 31-b/35

slide-67
SLIDE 67

c

IRCCyN/CNRS

A Tricky Example

ℓ0

cost(ℓ0) = 1

x ≤ 1 ℓ1

cost(ℓ1) = 2

x ≤ 1 Goal y > 0; u ; y := 0 y > 0; c ; y := 0 x = 1; c what is the optimal cost? Is there an optimal strategy? . . . assume you start with 2 . . . start with less than 2 (2 − ǫ)

Synthesis of Optimal Strategies Using HYTECH page 31-c/35

slide-68
SLIDE 68

c

IRCCyN/CNRS

π Operator

(Un)Controllable Predecessors Preda(X) = {q ∈ Q | q

a

− − → q′, q′ ∈ X} cPred(X) =

c∈Actc Predc(X)

uPred(X) =

u∈Actu Predu(X)

Safe Time Predecessors Predt(X, Y ) = {q ∈ Q | ∃δ ∈ R≥0 | q

δ

− → q′, q′ ∈ X ∧ Post[0,δ](q) ⊆ Y } Post[0,δ](q) = {q′ ∈ Q | ∃t ∈ [0, δ] | q

t

− → q′} π Operator (uncontrollable actions “cannot win”): π(X) = Predt

  • X ∪ cPred(X), uPred(X)
  • Synthesis of Optimal Strategies Using HYTECH

page 32-a/35

slide-69
SLIDE 69

c

IRCCyN/CNRS

π Operator

(Un)Controllable Predecessors Preda(X) = {q ∈ Q | q

a

− − → q′, q′ ∈ X} cPred(X) =

c∈Actc Predc(X)

uPred(X) =

u∈Actu Predu(X)

Safe Time Predecessors Predt(X, Y ) = {q ∈ Q | ∃δ ∈ R≥0 | q

δ

− → q′, q′ ∈ X ∧ Post[0,δ](q) ⊆ Y } Post[0,δ](q) = {q′ ∈ Q | ∃t ∈ [0, δ] | q

t

− → q′} π′: uncontrollable actions sometimes can win: π′(X) = π(X) ∪ Predt

  • uPred(X) ∩ STOP, uPred(X)
  • Synthesis of Optimal Strategies Using HYTECH

page 32-b/35

slide-70
SLIDE 70

c

IRCCyN/CNRS

π Operator

(Un)Controllable Predecessors Preda(X) = {q ∈ Q | q

a

− − → q′, q′ ∈ X} cPred(X) =

c∈Actc Predc(X)

uPred(X) =

u∈Actu Predu(X)

Safe Time Predecessors Predt(X, Y ) = {q ∈ Q | ∃δ ∈ R≥0 | q

δ

− → q′, q′ ∈ X ∧ Post[0,δ](q) ⊆ Y } Post[0,δ](q) = {q′ ∈ Q | ∃t ∈ [0, δ] | q

t

− → q′} π′′: uncontrollable actions bound to happen win: π′′(X) = π(X)∪Predt

  • Inv ∩ Predt(uPred(X) ∩ Inv), uPred(X)
  • Synthesis of Optimal Strategies Using HYTECH

page 32-c/35

slide-71
SLIDE 71

c

IRCCyN/CNRS

Termination Criterion for RPTGA

R is a (bounded) region of the region automaton (RA) every cycle in the RA costs at least κ m1

ℓ,R

M1

ℓ,R

cost ≥ f1

ℓ,R =↑f1 ℓ,R

κ κ κ κ κ κ

cost

R

[= ⇒ Back to Termination]

Synthesis of Optimal Strategies Using HYTECH page 33-a/35

slide-72
SLIDE 72

c

IRCCyN/CNRS

Termination Criterion for RPTGA

R is a (bounded) region of the region automaton (RA) every cycle in the RA costs at least κ m1

ℓ,R

M1

ℓ,R

cost ≥ f1

ℓ,R =↑f1 ℓ,R

κ f2

ℓ,R

m2

ℓ,R

κ κ κ κ κ

cost

R

[= ⇒ Back to Termination]

Synthesis of Optimal Strategies Using HYTECH page 33-b/35

slide-73
SLIDE 73

c

IRCCyN/CNRS

Termination Criterion for RPTGA

R is a (bounded) region of the region automaton (RA) every cycle in the RA costs at least κ m1

ℓ,R

M1

ℓ,R

cost ≥ f1

ℓ,R =↑f1 ℓ,R

κ f2

ℓ,R

m2

ℓ,R

κ κ κ κ κ m7

ℓ,R

↑f7

ℓ,R

cost

R

[= ⇒ Back to Termination]

Synthesis of Optimal Strategies Using HYTECH page 33-c/35

slide-74
SLIDE 74

c

IRCCyN/CNRS

Optimal Cost-Independent Strategy

ℓ0 (5) ℓ1 [y = 0] ℓ2 (10) ℓ3 (1) Goal x ≤ 2; c1; y := 0 u u x ≥ 2; c2 ; (1) x ≥ 2; c2 ; (7)

Synthesis of Optimal Strategies Using HYTECH page 34-a/35

slide-75
SLIDE 75

c

IRCCyN/CNRS

Optimal Cost-Independent Strategy

4 3

141

3

7 x cost ℓ0 (5) ℓ1 [y = 0] ℓ2 (10) ℓ3 (1) Goal x ≤ 2; c1; y := 0 u u x ≥ 2; c2 ; (1) x ≥ 2; c2 ; (7)

Synthesis of Optimal Strategies Using HYTECH page 34-b/35

slide-76
SLIDE 76

c

IRCCyN/CNRS

Optimal Cost-Independent Strategy

4 3

141

3

7 x cost W [λ]

4

ℓ0 (5) ℓ1 [y = 0] ℓ2 (10) ℓ3 (1) Goal x ≤ 2; c1; y := 0 u u x ≥ 2; c2 ; (1) x ≥ 2; c2 ; (7)

Synthesis of Optimal Strategies Using HYTECH page 34-c/35

slide-77
SLIDE 77

c

IRCCyN/CNRS

Optimal Cost-Independent Strategy

4 3

141

3

7 x cost W [λ]

4

W [c1]

4

ℓ0 (5) ℓ1 [y = 0] ℓ2 (10) ℓ3 (1) Goal x ≤ 2; c1; y := 0 u u x ≥ 2; c2 ; (1) x ≥ 2; c2 ; (7)

Synthesis of Optimal Strategies Using HYTECH page 34-d/35

slide-78
SLIDE 78

c

IRCCyN/CNRS

Optimal Cost-Independent Strategy

4 3

141

3

7 x cost λ c1

  • ℓ0 (5)

ℓ1 [y = 0] ℓ2 (10) ℓ3 (1) Goal x ≤ 2; c1; y := 0 u u x ≥ 2; c2 ; (1) x ≥ 2; c2 ; (7)

Synthesis of Optimal Strategies Using HYTECH page 34-e/35

slide-79
SLIDE 79

c

IRCCyN/CNRS

Optimal Cost-Independent Strategy

tagged sets: keep information how to win on Wi+1

  • compute Wi+1 = π(Wi) and let Y = Wi+1 \ Wi
  • W [c]

i+1 can reach Wi doing a c

  • W [λ]

i+1 can reach Wi or cPred(Wi) by time-elapsing

  • ptimal state-based strategy:
  • on W [c]

i+1 ≤ W [λ] i+1 do c

  • on W [λ]

i+1 < W [c] i+1 do λ

Synthesis of Optimal Strategies Using HYTECH page 34-f/35

slide-80
SLIDE 80

c

IRCCyN/CNRS

How-To Cost-Independent Strategy

ℓ0

cost(ℓ0) = 2

ℓ1

cost(ℓ1) = 1

ℓ2

cost(ℓ2) = 3

Goal x < 1;u y := 0 x < 1;u y > 0; c1 x = 1; c2 Optimal cost is 3

Synthesis of Optimal Strategies Using HYTECH page 35-a/35

slide-81
SLIDE 81

c

IRCCyN/CNRS

How-To Cost-Independent Strategy

ℓ0

cost(ℓ0) = 2

ℓ1

cost(ℓ1) = 1

ℓ2

cost(ℓ2) = 3

Goal x < 1;u y := 0 x < 1;u y > 0; c1 x = 1; c2 Optimal cost is 3 Optimal move in (ℓ1, y > 0) = c1, in (ℓ1, 0) = λ

Synthesis of Optimal Strategies Using HYTECH page 35-b/35

slide-82
SLIDE 82

c

IRCCyN/CNRS

How-To Cost-Independent Strategy

ℓ0

cost(ℓ0) = 2

ℓ1

cost(ℓ1) = 1

ℓ2

cost(ℓ2) = 3

Goal x < 1;u y := 0 x < 1;u y > 0; c1 x = 1; c2 Optimal cost is 3 Optimal move in (ℓ1, y > 0) = c1, in (ℓ1, 0) = λ Optimal strategy: f∗(ℓ1, 0 < y < 1

2) = λ, in (ℓ1, y ≥ 1 2) = c1

f∗(ℓ2, x < 1) = λ and f∗(ℓ2, x ≥ 1) = c2

Synthesis of Optimal Strategies Using HYTECH page 35-c/35