Approachability Theory and Differential Games Sylvain Sorin - - PowerPoint PPT Presentation

approachability theory and differential games
SMART_READER_LITE
LIVE PREVIEW

Approachability Theory and Differential Games Sylvain Sorin - - PowerPoint PPT Presentation

Approachability Theory and Differential Games Sylvain Sorin UPMC-Paris 6 Ecole Polytechnique sorin@math.jussieu.fr Sixi` emes Journ ees Franco-Chiliennes dOptimisation 19-21 mai 2008 Universit e du Sud Toulon-Var Sylvain Sorin


slide-1
SLIDE 1

Approachability Theory and Differential Games

Sylvain Sorin

UPMC-Paris 6 Ecole Polytechnique sorin@math.jussieu.fr

Sixi` emes Journ´ ees Franco-Chiliennes d’Optimisation 19-21 mai 2008 Universit´ e du Sud Toulon-Var

Sylvain Sorin ATDG

slide-2
SLIDE 2

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

This is a joint work with S. As Soulaimani and M. Quincampoix

Sylvain Sorin ATDG

slide-3
SLIDE 3

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

1

The model

2

Preliminaries

3

Weak approachability and differential games with fixed duration

4

Approachability and B-sets

5

Approachability and qualitative differential games

6

On strategies in the differential games and the repeated games

Sylvain Sorin ATDG

slide-4
SLIDE 4

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Approachability theory

: Blackwell (1956) Given an I × J matrix A with coefficients in I Rk, a two-person infinitely repeated game form G is defined as follows. At each stage n = 1, 2, ..., each player chooses an element in his set of actions: in ∈ I for Player 1 (resp. jn ∈ J for Player 2), the corresponding outcome is gn = Ainjn ∈ I Rk and the couple of actions (in, jn) is announced to both players. gn = 1

n

n

m=1 gm is the average outcome up to stage n.

The aim of Player 1 is that gn approaches a target set C ⊂ I Rk. Approachability: generalization of max-min level in a (one shot) game with real payoff where C = [v, +∞).

Sylvain Sorin ATDG

slide-5
SLIDE 5

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Approachability theory

: Blackwell (1956) Given an I × J matrix A with coefficients in I Rk, a two-person infinitely repeated game form G is defined as follows. At each stage n = 1, 2, ..., each player chooses an element in his set of actions: in ∈ I for Player 1 (resp. jn ∈ J for Player 2), the corresponding outcome is gn = Ainjn ∈ I Rk and the couple of actions (in, jn) is announced to both players. gn = 1

n

n

m=1 gm is the average outcome up to stage n.

The aim of Player 1 is that gn approaches a target set C ⊂ I Rk. Approachability: generalization of max-min level in a (one shot) game with real payoff where C = [v, +∞).

Sylvain Sorin ATDG

slide-6
SLIDE 6

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Approachability theory

: Blackwell (1956) Given an I × J matrix A with coefficients in I Rk, a two-person infinitely repeated game form G is defined as follows. At each stage n = 1, 2, ..., each player chooses an element in his set of actions: in ∈ I for Player 1 (resp. jn ∈ J for Player 2), the corresponding outcome is gn = Ainjn ∈ I Rk and the couple of actions (in, jn) is announced to both players. gn = 1

n

n

m=1 gm is the average outcome up to stage n.

The aim of Player 1 is that gn approaches a target set C ⊂ I Rk. Approachability: generalization of max-min level in a (one shot) game with real payoff where C = [v, +∞).

Sylvain Sorin ATDG

slide-7
SLIDE 7

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Approachability theory

: Blackwell (1956) Given an I × J matrix A with coefficients in I Rk, a two-person infinitely repeated game form G is defined as follows. At each stage n = 1, 2, ..., each player chooses an element in his set of actions: in ∈ I for Player 1 (resp. jn ∈ J for Player 2), the corresponding outcome is gn = Ainjn ∈ I Rk and the couple of actions (in, jn) is announced to both players. gn = 1

n

n

m=1 gm is the average outcome up to stage n.

The aim of Player 1 is that gn approaches a target set C ⊂ I Rk. Approachability: generalization of max-min level in a (one shot) game with real payoff where C = [v, +∞).

Sylvain Sorin ATDG

slide-8
SLIDE 8

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Approachability theory

: Blackwell (1956) Given an I × J matrix A with coefficients in I Rk, a two-person infinitely repeated game form G is defined as follows. At each stage n = 1, 2, ..., each player chooses an element in his set of actions: in ∈ I for Player 1 (resp. jn ∈ J for Player 2), the corresponding outcome is gn = Ainjn ∈ I Rk and the couple of actions (in, jn) is announced to both players. gn = 1

n

n

m=1 gm is the average outcome up to stage n.

The aim of Player 1 is that gn approaches a target set C ⊂ I Rk. Approachability: generalization of max-min level in a (one shot) game with real payoff where C = [v, +∞).

Sylvain Sorin ATDG

slide-9
SLIDE 9

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Approachability theory

: Blackwell (1956) Given an I × J matrix A with coefficients in I Rk, a two-person infinitely repeated game form G is defined as follows. At each stage n = 1, 2, ..., each player chooses an element in his set of actions: in ∈ I for Player 1 (resp. jn ∈ J for Player 2), the corresponding outcome is gn = Ainjn ∈ I Rk and the couple of actions (in, jn) is announced to both players. gn = 1

n

n

m=1 gm is the average outcome up to stage n.

The aim of Player 1 is that gn approaches a target set C ⊂ I Rk. Approachability: generalization of max-min level in a (one shot) game with real payoff where C = [v, +∞).

Sylvain Sorin ATDG

slide-10
SLIDE 10

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Hn = (I × J)n is the set of possible histories at stage n + 1 and H∞ = (I × J)∞ be the set of plays. Σ (resp. T ) is the set of strategies of Player 1 (resp. Player 2): mappings from H = ∪n≥0Hn to the sets of mixed actions U = ∆(I) (probabilities on I) (resp. V = ∆(J)). At stage n, given the history hn−1 ∈ Hn−1, Player 1 chooses an action in I according to the probability distribution σ(hn−1) ∈ U (and similarly for Player 2). A couple (σ, τ) of strategies induces a probability on H∞ and Eσ,τ denotes the corresponding expectation.

Sylvain Sorin ATDG

slide-11
SLIDE 11

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Hn = (I × J)n is the set of possible histories at stage n + 1 and H∞ = (I × J)∞ be the set of plays. Σ (resp. T ) is the set of strategies of Player 1 (resp. Player 2): mappings from H = ∪n≥0Hn to the sets of mixed actions U = ∆(I) (probabilities on I) (resp. V = ∆(J)). At stage n, given the history hn−1 ∈ Hn−1, Player 1 chooses an action in I according to the probability distribution σ(hn−1) ∈ U (and similarly for Player 2). A couple (σ, τ) of strategies induces a probability on H∞ and Eσ,τ denotes the corresponding expectation.

Sylvain Sorin ATDG

slide-12
SLIDE 12

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Hn = (I × J)n is the set of possible histories at stage n + 1 and H∞ = (I × J)∞ be the set of plays. Σ (resp. T ) is the set of strategies of Player 1 (resp. Player 2): mappings from H = ∪n≥0Hn to the sets of mixed actions U = ∆(I) (probabilities on I) (resp. V = ∆(J)). At stage n, given the history hn−1 ∈ Hn−1, Player 1 chooses an action in I according to the probability distribution σ(hn−1) ∈ U (and similarly for Player 2). A couple (σ, τ) of strategies induces a probability on H∞ and Eσ,τ denotes the corresponding expectation.

Sylvain Sorin ATDG

slide-13
SLIDE 13

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Hn = (I × J)n is the set of possible histories at stage n + 1 and H∞ = (I × J)∞ be the set of plays. Σ (resp. T ) is the set of strategies of Player 1 (resp. Player 2): mappings from H = ∪n≥0Hn to the sets of mixed actions U = ∆(I) (probabilities on I) (resp. V = ∆(J)). At stage n, given the history hn−1 ∈ Hn−1, Player 1 chooses an action in I according to the probability distribution σ(hn−1) ∈ U (and similarly for Player 2). A couple (σ, τ) of strategies induces a probability on H∞ and Eσ,τ denotes the corresponding expectation.

Sylvain Sorin ATDG

slide-14
SLIDE 14

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

In the general theory of repeated two main approaches have been studied:

  • asympotic analysis: study of the limit of values of finitely

repeated games or discounted games as the expected length goes to ∞. This amounts to consider finer and finer time discretizations of a continuous time game played on [0, 1],

  • or uniform analysis through robustness properties of

strategies: they should be approximately optimal in any sufficiently long game. In our framework this leads to the 2 following notions:

Sylvain Sorin ATDG

slide-15
SLIDE 15

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

In the general theory of repeated two main approaches have been studied:

  • asympotic analysis: study of the limit of values of finitely

repeated games or discounted games as the expected length goes to ∞. This amounts to consider finer and finer time discretizations of a continuous time game played on [0, 1],

  • or uniform analysis through robustness properties of

strategies: they should be approximately optimal in any sufficiently long game. In our framework this leads to the 2 following notions:

Sylvain Sorin ATDG

slide-16
SLIDE 16

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

In the general theory of repeated two main approaches have been studied:

  • asympotic analysis: study of the limit of values of finitely

repeated games or discounted games as the expected length goes to ∞. This amounts to consider finer and finer time discretizations of a continuous time game played on [0, 1],

  • or uniform analysis through robustness properties of

strategies: they should be approximately optimal in any sufficiently long game. In our framework this leads to the 2 following notions:

Sylvain Sorin ATDG

slide-17
SLIDE 17

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

In the general theory of repeated two main approaches have been studied:

  • asympotic analysis: study of the limit of values of finitely

repeated games or discounted games as the expected length goes to ∞. This amounts to consider finer and finer time discretizations of a continuous time game played on [0, 1],

  • or uniform analysis through robustness properties of

strategies: they should be approximately optimal in any sufficiently long game. In our framework this leads to the 2 following notions:

Sylvain Sorin ATDG

slide-18
SLIDE 18

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

In the general theory of repeated two main approaches have been studied:

  • asympotic analysis: study of the limit of values of finitely

repeated games or discounted games as the expected length goes to ∞. This amounts to consider finer and finer time discretizations of a continuous time game played on [0, 1],

  • or uniform analysis through robustness properties of

strategies: they should be approximately optimal in any sufficiently long game. In our framework this leads to the 2 following notions:

Sylvain Sorin ATDG

slide-19
SLIDE 19

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Definition A nonempty closed set C in I Rk is weakly approachable by Player 1 in G if, for every ε > 0, there exists N ∈ I N such that for any n ≥ N there is a strategy σ = σ(n, ε) of Player 1 such that, for any strategy τ of Player 2 Eσ,τ(dC(gn)) ≤ ε. where dC stands for the distance to C. If vn is the value of the n-stage game with payoff −E(dC(¯ gn)), weak-approachability means vn → 0.

Sylvain Sorin ATDG

slide-20
SLIDE 20

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Definition A nonempty closed set C in I Rk is weakly approachable by Player 1 in G if, for every ε > 0, there exists N ∈ I N such that for any n ≥ N there is a strategy σ = σ(n, ε) of Player 1 such that, for any strategy τ of Player 2 Eσ,τ(dC(gn)) ≤ ε. where dC stands for the distance to C. If vn is the value of the n-stage game with payoff −E(dC(¯ gn)), weak-approachability means vn → 0.

Sylvain Sorin ATDG

slide-21
SLIDE 21

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

The uniform notion is Definition A nonempty closed set C in I Rk is approachable by Player 1 in G if, for every ε > 0, there exists a strategy σ = σ(ε) of Player 1 and N ∈ I N such that, for any strategy τ of Player 2 and any n ≥ N Eσ,τ(dC(gn)) ≤ ε. Asymptotically the average outcome remains close in expectation to the target C, uniformly with respect to the

  • pponent’s behavior.

Dual notion : excludability.

Sylvain Sorin ATDG

slide-22
SLIDE 22

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

The uniform notion is Definition A nonempty closed set C in I Rk is approachable by Player 1 in G if, for every ε > 0, there exists a strategy σ = σ(ε) of Player 1 and N ∈ I N such that, for any strategy τ of Player 2 and any n ≥ N Eσ,τ(dC(gn)) ≤ ε. Asymptotically the average outcome remains close in expectation to the target C, uniformly with respect to the

  • pponent’s behavior.

Dual notion : excludability.

Sylvain Sorin ATDG

slide-23
SLIDE 23

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

1

The model

2

Preliminaries

3

Weak approachability and differential games with fixed duration

4

Approachability and B-sets

5

Approachability and qualitative differential games

6

On strategies in the differential games and the repeated games

Sylvain Sorin ATDG

slide-24
SLIDE 24

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

2.1 The “expected deterministic” repeated game form G∗ Alternative two-person infinitely repeated game associated, as the previous one, to the matrix A. At each stage n = 1, 2, ..., Player 1 (resp. Player 2) chooses un ∈ U = ∆(I) (resp. vn ∈ V = ∆(J)), the outcome is g∗

n = unAvn and (un, vn) is announced.

Accordingly, a strategy σ∗ for Player 1 in G∗ is a map from H∗ =∪n≥0H∗

n to U where H∗ n = (U × V)n. A strategy τ ∗ for Player 2

is defined similarly. A couple of strategies induces a play {(un, vn)} and a sequence

  • f outcomes {g∗

n}, and g∗ n = 1 n

n

m=1 g∗ m denotes the average

  • utcome up to stage n.

G∗ is the game played in “mixed moves” or in distribution. Weak ∗approachability, v∗

n and ∗approachability are defined similarly.

Sylvain Sorin ATDG

slide-25
SLIDE 25

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

2.1 The “expected deterministic” repeated game form G∗ Alternative two-person infinitely repeated game associated, as the previous one, to the matrix A. At each stage n = 1, 2, ..., Player 1 (resp. Player 2) chooses un ∈ U = ∆(I) (resp. vn ∈ V = ∆(J)), the outcome is g∗

n = unAvn and (un, vn) is announced.

Accordingly, a strategy σ∗ for Player 1 in G∗ is a map from H∗ =∪n≥0H∗

n to U where H∗ n = (U × V)n. A strategy τ ∗ for Player 2

is defined similarly. A couple of strategies induces a play {(un, vn)} and a sequence

  • f outcomes {g∗

n}, and g∗ n = 1 n

n

m=1 g∗ m denotes the average

  • utcome up to stage n.

G∗ is the game played in “mixed moves” or in distribution. Weak ∗approachability, v∗

n and ∗approachability are defined similarly.

Sylvain Sorin ATDG

slide-26
SLIDE 26

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

2.1 The “expected deterministic” repeated game form G∗ Alternative two-person infinitely repeated game associated, as the previous one, to the matrix A. At each stage n = 1, 2, ..., Player 1 (resp. Player 2) chooses un ∈ U = ∆(I) (resp. vn ∈ V = ∆(J)), the outcome is g∗

n = unAvn and (un, vn) is announced.

Accordingly, a strategy σ∗ for Player 1 in G∗ is a map from H∗ =∪n≥0H∗

n to U where H∗ n = (U × V)n. A strategy τ ∗ for Player 2

is defined similarly. A couple of strategies induces a play {(un, vn)} and a sequence

  • f outcomes {g∗

n}, and g∗ n = 1 n

n

m=1 g∗ m denotes the average

  • utcome up to stage n.

G∗ is the game played in “mixed moves” or in distribution. Weak ∗approachability, v∗

n and ∗approachability are defined similarly.

Sylvain Sorin ATDG

slide-27
SLIDE 27

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

2.1 The “expected deterministic” repeated game form G∗ Alternative two-person infinitely repeated game associated, as the previous one, to the matrix A. At each stage n = 1, 2, ..., Player 1 (resp. Player 2) chooses un ∈ U = ∆(I) (resp. vn ∈ V = ∆(J)), the outcome is g∗

n = unAvn and (un, vn) is announced.

Accordingly, a strategy σ∗ for Player 1 in G∗ is a map from H∗ =∪n≥0H∗

n to U where H∗ n = (U × V)n. A strategy τ ∗ for Player 2

is defined similarly. A couple of strategies induces a play {(un, vn)} and a sequence

  • f outcomes {g∗

n}, and g∗ n = 1 n

n

m=1 g∗ m denotes the average

  • utcome up to stage n.

G∗ is the game played in “mixed moves” or in distribution. Weak ∗approachability, v∗

n and ∗approachability are defined similarly.

Sylvain Sorin ATDG

slide-28
SLIDE 28

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

2.1 The “expected deterministic” repeated game form G∗ Alternative two-person infinitely repeated game associated, as the previous one, to the matrix A. At each stage n = 1, 2, ..., Player 1 (resp. Player 2) chooses un ∈ U = ∆(I) (resp. vn ∈ V = ∆(J)), the outcome is g∗

n = unAvn and (un, vn) is announced.

Accordingly, a strategy σ∗ for Player 1 in G∗ is a map from H∗ =∪n≥0H∗

n to U where H∗ n = (U × V)n. A strategy τ ∗ for Player 2

is defined similarly. A couple of strategies induces a play {(un, vn)} and a sequence

  • f outcomes {g∗

n}, and g∗ n = 1 n

n

m=1 g∗ m denotes the average

  • utcome up to stage n.

G∗ is the game played in “mixed moves” or in distribution. Weak ∗approachability, v∗

n and ∗approachability are defined similarly.

Sylvain Sorin ATDG

slide-29
SLIDE 29

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

2.2 Differential games Consider zero-sum differential games of the kind ˙ x = f(x, u, v) where x is the state and u, v the moves. Assume        (i) U, V are compact subsets of I Rk, (ii) f : I Rk × U × V → I Rk is continuous, (iii) f(., u, v) is a l- Lipschitz map for all (u, v) ∈ U × V, (iv) U is convex, and f is affine in u. (1)

Sylvain Sorin ATDG

slide-30
SLIDE 30

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

2.2 Differential games Consider zero-sum differential games of the kind ˙ x = f(x, u, v) where x is the state and u, v the moves. Assume        (i) U, V are compact subsets of I Rk, (ii) f : I Rk × U × V → I Rk is continuous, (iii) f(., u, v) is a l- Lipschitz map for all (u, v) ∈ U × V, (iv) U is convex, and f is affine in u. (1)

Sylvain Sorin ATDG

slide-31
SLIDE 31

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

2.2 Differential games Consider zero-sum differential games of the kind ˙ x = f(x, u, v) where x is the state and u, v the moves. Assume        (i) U, V are compact subsets of I Rk, (ii) f : I Rk × U × V → I Rk is continuous, (iii) f(., u, v) is a l- Lipschitz map for all (u, v) ∈ U × V, (iv) U is convex, and f is affine in u. (1)

Sylvain Sorin ATDG

slide-32
SLIDE 32

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Sets of controls : U = {u : [0, +∞) → U; u is measurable} and similarly V. Induced dynamics with x0 ∈ I Rk and (u, v) ∈ U × V: ˙ x(t) = f(x(t), u(t), v(t)) for almost every t ≥ 0 x(0) = x0. (2) In addition Isaacs condition, namely : for any ζ ∈ I Rk sup

v∈V

inf

u∈Uζ, f(x, u, v) = inf u∈U sup v∈V

ζ, f(x, u, v). (3)

Sylvain Sorin ATDG

slide-33
SLIDE 33

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Sets of controls : U = {u : [0, +∞) → U; u is measurable} and similarly V. Induced dynamics with x0 ∈ I Rk and (u, v) ∈ U × V: ˙ x(t) = f(x(t), u(t), v(t)) for almost every t ≥ 0 x(0) = x0. (2) In addition Isaacs condition, namely : for any ζ ∈ I Rk sup

v∈V

inf

u∈Uζ, f(x, u, v) = inf u∈U sup v∈V

ζ, f(x, u, v). (3)

Sylvain Sorin ATDG

slide-34
SLIDE 34

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Sets of controls : U = {u : [0, +∞) → U; u is measurable} and similarly V. Induced dynamics with x0 ∈ I Rk and (u, v) ∈ U × V: ˙ x(t) = f(x(t), u(t), v(t)) for almost every t ≥ 0 x(0) = x0. (2) In addition Isaacs condition, namely : for any ζ ∈ I Rk sup

v∈V

inf

u∈Uζ, f(x, u, v) = inf u∈U sup v∈V

ζ, f(x, u, v). (3)

Sylvain Sorin ATDG

slide-35
SLIDE 35

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

1

The model

2

Preliminaries

3

Weak approachability and differential games with fixed duration

4

Approachability and B-sets

5

Approachability and qualitative differential games

6

On strategies in the differential games and the repeated games

Sylvain Sorin ATDG

slide-36
SLIDE 36

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

(Vieille, 1992) The aim is to obtain a good average outcome at stage n. First consider the game G∗. Then use as state variable the cumulative payoff and consider the differential game Λ of fixed duration played on [0, 1] starting from x(0) = 0 with dynamics: ˙ x(t) = u(t)Av(t) and payoff −dC(x(1)). The state variable is x(t) = t

0 gsds with gs being the payoff at

time s. G∗

n appears then as a discrete time approximation of Λ.

Let Φ(t, x) be the value of the game played on [t, 1] starting from x ( i.e. with total outcome x + 1

t gsds).

Sylvain Sorin ATDG

slide-37
SLIDE 37

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

(Vieille, 1992) The aim is to obtain a good average outcome at stage n. First consider the game G∗. Then use as state variable the cumulative payoff and consider the differential game Λ of fixed duration played on [0, 1] starting from x(0) = 0 with dynamics: ˙ x(t) = u(t)Av(t) and payoff −dC(x(1)). The state variable is x(t) = t

0 gsds with gs being the payoff at

time s. G∗

n appears then as a discrete time approximation of Λ.

Let Φ(t, x) be the value of the game played on [t, 1] starting from x ( i.e. with total outcome x + 1

t gsds).

Sylvain Sorin ATDG

slide-38
SLIDE 38

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

(Vieille, 1992) The aim is to obtain a good average outcome at stage n. First consider the game G∗. Then use as state variable the cumulative payoff and consider the differential game Λ of fixed duration played on [0, 1] starting from x(0) = 0 with dynamics: ˙ x(t) = u(t)Av(t) and payoff −dC(x(1)). The state variable is x(t) = t

0 gsds with gs being the payoff at

time s. G∗

n appears then as a discrete time approximation of Λ.

Let Φ(t, x) be the value of the game played on [t, 1] starting from x ( i.e. with total outcome x + 1

t gsds).

Sylvain Sorin ATDG

slide-39
SLIDE 39

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

(Vieille, 1992) The aim is to obtain a good average outcome at stage n. First consider the game G∗. Then use as state variable the cumulative payoff and consider the differential game Λ of fixed duration played on [0, 1] starting from x(0) = 0 with dynamics: ˙ x(t) = u(t)Av(t) and payoff −dC(x(1)). The state variable is x(t) = t

0 gsds with gs being the payoff at

time s. G∗

n appears then as a discrete time approximation of Λ.

Let Φ(t, x) be the value of the game played on [t, 1] starting from x ( i.e. with total outcome x + 1

t gsds).

Sylvain Sorin ATDG

slide-40
SLIDE 40

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

(Vieille, 1992) The aim is to obtain a good average outcome at stage n. First consider the game G∗. Then use as state variable the cumulative payoff and consider the differential game Λ of fixed duration played on [0, 1] starting from x(0) = 0 with dynamics: ˙ x(t) = u(t)Av(t) and payoff −dC(x(1)). The state variable is x(t) = t

0 gsds with gs being the payoff at

time s. G∗

n appears then as a discrete time approximation of Λ.

Let Φ(t, x) be the value of the game played on [t, 1] starting from x ( i.e. with total outcome x + 1

t gsds).

Sylvain Sorin ATDG

slide-41
SLIDE 41

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

(Vieille, 1992) The aim is to obtain a good average outcome at stage n. First consider the game G∗. Then use as state variable the cumulative payoff and consider the differential game Λ of fixed duration played on [0, 1] starting from x(0) = 0 with dynamics: ˙ x(t) = u(t)Av(t) and payoff −dC(x(1)). The state variable is x(t) = t

0 gsds with gs being the payoff at

time s. G∗

n appears then as a discrete time approximation of Λ.

Let Φ(t, x) be the value of the game played on [t, 1] starting from x ( i.e. with total outcome x + 1

t gsds).

Sylvain Sorin ATDG

slide-42
SLIDE 42

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Theorem 1) Φ(x, t) is the unique viscosity solution of d dt Φ(x, t) + valU×V∇Φ(x, t), uAv = 0 with Φ(x, 1) = −dC(x). 2) lim v∗

n = Φ(0, 0)

Sylvain Sorin ATDG

slide-43
SLIDE 43

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

The last step is to relate the values in G∗

n and in Gn.

Theorem lim v∗

n = lim vn

Consider an optimal strategy in G∗

n.

Each stage m in this game will correspond to a block of L stages in GLn where player 1 will play i.i.d. the prescribed strategy in G∗ and will define inductively y∗

m as the empirical

distribution of moves of Player 2 during this block. Corollary Every set is weakly approachable or excludable.

Sylvain Sorin ATDG

slide-44
SLIDE 44

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

The last step is to relate the values in G∗

n and in Gn.

Theorem lim v∗

n = lim vn

Consider an optimal strategy in G∗

n.

Each stage m in this game will correspond to a block of L stages in GLn where player 1 will play i.i.d. the prescribed strategy in G∗ and will define inductively y∗

m as the empirical

distribution of moves of Player 2 during this block. Corollary Every set is weakly approachable or excludable.

Sylvain Sorin ATDG

slide-45
SLIDE 45

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

The last step is to relate the values in G∗

n and in Gn.

Theorem lim v∗

n = lim vn

Consider an optimal strategy in G∗

n.

Each stage m in this game will correspond to a block of L stages in GLn where player 1 will play i.i.d. the prescribed strategy in G∗ and will define inductively y∗

m as the empirical

distribution of moves of Player 2 during this block. Corollary Every set is weakly approachable or excludable.

Sylvain Sorin ATDG

slide-46
SLIDE 46

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

The last step is to relate the values in G∗

n and in Gn.

Theorem lim v∗

n = lim vn

Consider an optimal strategy in G∗

n.

Each stage m in this game will correspond to a block of L stages in GLn where player 1 will play i.i.d. the prescribed strategy in G∗ and will define inductively y∗

m as the empirical

distribution of moves of Player 2 during this block. Corollary Every set is weakly approachable or excludable.

Sylvain Sorin ATDG

slide-47
SLIDE 47

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

The last step is to relate the values in G∗

n and in Gn.

Theorem lim v∗

n = lim vn

Consider an optimal strategy in G∗

n.

Each stage m in this game will correspond to a block of L stages in GLn where player 1 will play i.i.d. the prescribed strategy in G∗ and will define inductively y∗

m as the empirical

distribution of moves of Player 2 during this block. Corollary Every set is weakly approachable or excludable.

Sylvain Sorin ATDG

slide-48
SLIDE 48

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

1

The model

2

Preliminaries

3

Weak approachability and differential games with fixed duration

4

Approachability and B-sets

5

Approachability and qualitative differential games

6

On strategies in the differential games and the repeated games

Sylvain Sorin ATDG

slide-49
SLIDE 49

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

The main notion was introduced by Blackwell: Definition A closed set C in I Rk is a B-set for Player 1 (given A), if for any z / ∈ C, there exists y ∈ πC(z) and a mixed action u = ˆ u(z) in U = ∆(I) such that the hyperplane through y orthogonal to the segment [yz] separates z from uAV: uAv − y, z − y ≤ 0, ∀v ∈ V. where πC(z) denotes the set of closest points to z in C.

Sylvain Sorin ATDG

slide-50
SLIDE 50

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

The main notion was introduced by Blackwell: Definition A closed set C in I Rk is a B-set for Player 1 (given A), if for any z / ∈ C, there exists y ∈ πC(z) and a mixed action u = ˆ u(z) in U = ∆(I) such that the hyperplane through y orthogonal to the segment [yz] separates z from uAV: uAv − y, z − y ≤ 0, ∀v ∈ V. where πC(z) denotes the set of closest points to z in C.

Sylvain Sorin ATDG

slide-51
SLIDE 51

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

The basic Blackwell’s result is : Theorem Let C be a B-set for Player 1. Then it is approachable in G and ∗approachable in G∗ by that player. An approchability strategy is given by σ(hn) = ˆ u(¯ gn) (resp. σ∗(h∗

n) = ˆ

u(¯ g∗

n)).

The proof for approachability is Proposition 8 in [1]. The other

  • ne is a simple adaptation where the outcome ¯

gn is replaced by ¯ g∗

n.

  • Remark. The previous Proposition implies that a B-set remains

approachable (resp. ∗approachable) in the game where the

  • nly information of Player 1 after stage n is the current outcome

gn (resp. g∗

n) rather than the complete previous history hn (resp.

h∗

n). (natural state variable)

Sylvain Sorin ATDG

slide-52
SLIDE 52

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

The basic Blackwell’s result is : Theorem Let C be a B-set for Player 1. Then it is approachable in G and ∗approachable in G∗ by that player. An approchability strategy is given by σ(hn) = ˆ u(¯ gn) (resp. σ∗(h∗

n) = ˆ

u(¯ g∗

n)).

The proof for approachability is Proposition 8 in [1]. The other

  • ne is a simple adaptation where the outcome ¯

gn is replaced by ¯ g∗

n.

  • Remark. The previous Proposition implies that a B-set remains

approachable (resp. ∗approachable) in the game where the

  • nly information of Player 1 after stage n is the current outcome

gn (resp. g∗

n) rather than the complete previous history hn (resp.

h∗

n). (natural state variable)

Sylvain Sorin ATDG

slide-53
SLIDE 53

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

The basic Blackwell’s result is : Theorem Let C be a B-set for Player 1. Then it is approachable in G and ∗approachable in G∗ by that player. An approchability strategy is given by σ(hn) = ˆ u(¯ gn) (resp. σ∗(h∗

n) = ˆ

u(¯ g∗

n)).

The proof for approachability is Proposition 8 in [1]. The other

  • ne is a simple adaptation where the outcome ¯

gn is replaced by ¯ g∗

n.

  • Remark. The previous Proposition implies that a B-set remains

approachable (resp. ∗approachable) in the game where the

  • nly information of Player 1 after stage n is the current outcome

gn (resp. g∗

n) rather than the complete previous history hn (resp.

h∗

n). (natural state variable)

Sylvain Sorin ATDG

slide-54
SLIDE 54

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

An important consequence of this property is Theorem A convex set C is either approachable or excludable. A further result due to Spinat [11] characterizes minimal approachable sets: Theorem A set C is approachable iff it contains a B-set.

Sylvain Sorin ATDG

slide-55
SLIDE 55

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

1

The model

2

Preliminaries

3

Weak approachability and differential games with fixed duration

4

Approachability and B-sets

5

Approachability and qualitative differential games

6

On strategies in the differential games and the repeated games

Sylvain Sorin ATDG

slide-56
SLIDE 56

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

To study ∗approachability, we introduce an allternative differential game Γ where both the dynamics and the payoff differs from the previous differential game Λ. The aim is to control the average payoff hence the discrete dynamics on the state variable is of the form ¯ gn+1 − ¯ gn = 1 n + 1(gn+1 − ¯ gn) Continuous counterpart is γ(u, v)(t) = 1

t

t

0 u(s)Av(s)ds.

Change of variable x(s) = γ(es) leads to ˙ x(t) = u(t)Av(t) − x(t). (4) which is the dynamics of a differential game Γ with f(x, u, v) = uAv − x

Sylvain Sorin ATDG

slide-57
SLIDE 57

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

To study ∗approachability, we introduce an allternative differential game Γ where both the dynamics and the payoff differs from the previous differential game Λ. The aim is to control the average payoff hence the discrete dynamics on the state variable is of the form ¯ gn+1 − ¯ gn = 1 n + 1(gn+1 − ¯ gn) Continuous counterpart is γ(u, v)(t) = 1

t

t

0 u(s)Av(s)ds.

Change of variable x(s) = γ(es) leads to ˙ x(t) = u(t)Av(t) − x(t). (4) which is the dynamics of a differential game Γ with f(x, u, v) = uAv − x

Sylvain Sorin ATDG

slide-58
SLIDE 58

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

To study ∗approachability, we introduce an allternative differential game Γ where both the dynamics and the payoff differs from the previous differential game Λ. The aim is to control the average payoff hence the discrete dynamics on the state variable is of the form ¯ gn+1 − ¯ gn = 1 n + 1(gn+1 − ¯ gn) Continuous counterpart is γ(u, v)(t) = 1

t

t

0 u(s)Av(s)ds.

Change of variable x(s) = γ(es) leads to ˙ x(t) = u(t)Av(t) − x(t). (4) which is the dynamics of a differential game Γ with f(x, u, v) = uAv − x

Sylvain Sorin ATDG

slide-59
SLIDE 59

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

To study ∗approachability, we introduce an allternative differential game Γ where both the dynamics and the payoff differs from the previous differential game Λ. The aim is to control the average payoff hence the discrete dynamics on the state variable is of the form ¯ gn+1 − ¯ gn = 1 n + 1(gn+1 − ¯ gn) Continuous counterpart is γ(u, v)(t) = 1

t

t

0 u(s)Av(s)ds.

Change of variable x(s) = γ(es) leads to ˙ x(t) = u(t)Av(t) − x(t). (4) which is the dynamics of a differential game Γ with f(x, u, v) = uAv − x

Sylvain Sorin ATDG

slide-60
SLIDE 60

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

In addition the aim of Player 1 is to stay in a certain set C. We introduce the following definitions: Definition A map α : V → U is a nonanticipative strategy if, for any t ≥ 0 and for any v1 and v2 of V, which coincide almost everywhere

  • n [0, t] of [0, +∞), the images α(v1) and α(v2) coincide also

almost everywhere on [0, t]. M(V, U) is the set of nonanticipative strategies from V to U.

Sylvain Sorin ATDG

slide-61
SLIDE 61

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

In addition the aim of Player 1 is to stay in a certain set C. We introduce the following definitions: Definition A map α : V → U is a nonanticipative strategy if, for any t ≥ 0 and for any v1 and v2 of V, which coincide almost everywhere

  • n [0, t] of [0, +∞), the images α(v1) and α(v2) coincide also

almost everywhere on [0, t]. M(V, U) is the set of nonanticipative strategies from V to U.

Sylvain Sorin ATDG

slide-62
SLIDE 62

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

In addition the aim of Player 1 is to stay in a certain set C. We introduce the following definitions: Definition A map α : V → U is a nonanticipative strategy if, for any t ≥ 0 and for any v1 and v2 of V, which coincide almost everywhere

  • n [0, t] of [0, +∞), the images α(v1) and α(v2) coincide also

almost everywhere on [0, t]. M(V, U) is the set of nonanticipative strategies from V to U.

Sylvain Sorin ATDG

slide-63
SLIDE 63

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Definition A non-empty closed set C in I Rk is a discriminating domain for Player 1, given f if: ∀x ∈ C, ∀p ∈ NPC(x), sup

v∈V

inf

u∈Uf(x, u, v), p ≤ 0,

(5) where NPC(x) is the set of proximal normals to C at x NPC(x) = {p ∈ I RK; dC(x + p) = p} The interpretation is that, at any boundary point x ∈ C, Player 1 can react to any control of Player 2 in order to keep the trajectory in the half space facing a proximal normal p.

Sylvain Sorin ATDG

slide-64
SLIDE 64

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Definition A non-empty closed set C in I Rk is a discriminating domain for Player 1, given f if: ∀x ∈ C, ∀p ∈ NPC(x), sup

v∈V

inf

u∈Uf(x, u, v), p ≤ 0,

(5) where NPC(x) is the set of proximal normals to C at x NPC(x) = {p ∈ I RK; dC(x + p) = p} The interpretation is that, at any boundary point x ∈ C, Player 1 can react to any control of Player 2 in order to keep the trajectory in the half space facing a proximal normal p.

Sylvain Sorin ATDG

slide-65
SLIDE 65

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Definition A non-empty closed set C in I Rk is a discriminating domain for Player 1, given f if: ∀x ∈ C, ∀p ∈ NPC(x), sup

v∈V

inf

u∈Uf(x, u, v), p ≤ 0,

(5) where NPC(x) is the set of proximal normals to C at x NPC(x) = {p ∈ I RK; dC(x + p) = p} The interpretation is that, at any boundary point x ∈ C, Player 1 can react to any control of Player 2 in order to keep the trajectory in the half space facing a proximal normal p.

Sylvain Sorin ATDG

slide-66
SLIDE 66

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

The following theorem, due to Cardaliaguet [2], states that Player 1 can ensure remaining in a discriminating domain as soon as he knows, at each time t, Player 2’s control up to time t. Theorem Assume that f satisfies conditions (1), and that C is a closed subset of I

  • Rk. Then C is a discriminating domain if and only if

for every x0 belonging to C, there exists a nonanticipative strategy α ∈ M(V, U), such that for any v ∈ V, the solution x[x0, α(v), v](t) remains in C for every t ≥ 0. We shall say that such a strategy α preserves the set C.

Sylvain Sorin ATDG

slide-67
SLIDE 67

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

The following theorem, due to Cardaliaguet [2], states that Player 1 can ensure remaining in a discriminating domain as soon as he knows, at each time t, Player 2’s control up to time t. Theorem Assume that f satisfies conditions (1), and that C is a closed subset of I

  • Rk. Then C is a discriminating domain if and only if

for every x0 belonging to C, there exists a nonanticipative strategy α ∈ M(V, U), such that for any v ∈ V, the solution x[x0, α(v), v](t) remains in C for every t ≥ 0. We shall say that such a strategy α preserves the set C.

Sylvain Sorin ATDG

slide-68
SLIDE 68

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Theorem Let f(x, u, v) = uAv − x. A closed set C ⊂ I Rk is a discriminating domain for Player 1, if and only if C is a B-set for Player 1. First condition: start from z, x = πC(z), there exists u such that for all v uAv − x, z − x ≤ 0. Second condition: start from x and p ∈ NPC(x), then sup

v∈V

inf

u∈Uf(x, u, v), p ≤ 0.

Sylvain Sorin ATDG

slide-69
SLIDE 69

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Theorem Let f(x, u, v) = uAv − x. A closed set C ⊂ I Rk is a discriminating domain for Player 1, if and only if C is a B-set for Player 1. First condition: start from z, x = πC(z), there exists u such that for all v uAv − x, z − x ≤ 0. Second condition: start from x and p ∈ NPC(x), then sup

v∈V

inf

u∈Uf(x, u, v), p ≤ 0.

Sylvain Sorin ATDG

slide-70
SLIDE 70

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Theorem Let f(x, u, v) = uAv − x. A closed set C ⊂ I Rk is a discriminating domain for Player 1, if and only if C is a B-set for Player 1. First condition: start from z, x = πC(z), there exists u such that for all v uAv − x, z − x ≤ 0. Second condition: start from x and p ∈ NPC(x), then sup

v∈V

inf

u∈Uf(x, u, v), p ≤ 0.

Sylvain Sorin ATDG

slide-71
SLIDE 71

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

It is easy to deduce that starting from any point, not necessarily in C one has: Theorem If a closed set C ⊂ I Rk is a B-set for Player 1, there exists a nonanticipative strategy of player 1 in Γ, α ∈ M(V, U), such that for every v ∈ V ∀t ≥ 1 dC(x[α(v), v](t)) ≤ Me−t. (6)

Sylvain Sorin ATDG

slide-72
SLIDE 72

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Theorem A closed set C is ∗approachable for Player 1 in G∗ if and only if it contains a B-set for Player 1 (given A). The direct part follows from Blackwell’s proof. To obtain the converse implication, the proof follows several steps: First, we construct a map Ψ from strategies of Player 1 in G∗ to nonanticipative strategies in Γ. In particular given ε > 0 and a strategy σε that ε-approaches C in G∗, we define its image αε = Ψ(σε). The next step consists in proving that the trajectories in the differential game Γ compatible with αε approach asymptotically C + ε¯ B.

Sylvain Sorin ATDG

slide-73
SLIDE 73

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Theorem A closed set C is ∗approachable for Player 1 in G∗ if and only if it contains a B-set for Player 1 (given A). The direct part follows from Blackwell’s proof. To obtain the converse implication, the proof follows several steps: First, we construct a map Ψ from strategies of Player 1 in G∗ to nonanticipative strategies in Γ. In particular given ε > 0 and a strategy σε that ε-approaches C in G∗, we define its image αε = Ψ(σε). The next step consists in proving that the trajectories in the differential game Γ compatible with αε approach asymptotically C + ε¯ B.

Sylvain Sorin ATDG

slide-74
SLIDE 74

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Theorem A closed set C is ∗approachable for Player 1 in G∗ if and only if it contains a B-set for Player 1 (given A). The direct part follows from Blackwell’s proof. To obtain the converse implication, the proof follows several steps: First, we construct a map Ψ from strategies of Player 1 in G∗ to nonanticipative strategies in Γ. In particular given ε > 0 and a strategy σε that ε-approaches C in G∗, we define its image αε = Ψ(σε). The next step consists in proving that the trajectories in the differential game Γ compatible with αε approach asymptotically C + ε¯ B.

Sylvain Sorin ATDG

slide-75
SLIDE 75

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Theorem A closed set C is ∗approachable for Player 1 in G∗ if and only if it contains a B-set for Player 1 (given A). The direct part follows from Blackwell’s proof. To obtain the converse implication, the proof follows several steps: First, we construct a map Ψ from strategies of Player 1 in G∗ to nonanticipative strategies in Γ. In particular given ε > 0 and a strategy σε that ε-approaches C in G∗, we define its image αε = Ψ(σε). The next step consists in proving that the trajectories in the differential game Γ compatible with αε approach asymptotically C + ε¯ B.

Sylvain Sorin ATDG

slide-76
SLIDE 76

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Theorem A closed set C is ∗approachable for Player 1 in G∗ if and only if it contains a B-set for Player 1 (given A). The direct part follows from Blackwell’s proof. To obtain the converse implication, the proof follows several steps: First, we construct a map Ψ from strategies of Player 1 in G∗ to nonanticipative strategies in Γ. In particular given ε > 0 and a strategy σε that ε-approaches C in G∗, we define its image αε = Ψ(σε). The next step consists in proving that the trajectories in the differential game Γ compatible with αε approach asymptotically C + ε¯ B.

Sylvain Sorin ATDG

slide-77
SLIDE 77

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Theorem A closed set C is ∗approachable for Player 1 in G∗ if and only if it contains a B-set for Player 1 (given A). The direct part follows from Blackwell’s proof. To obtain the converse implication, the proof follows several steps: First, we construct a map Ψ from strategies of Player 1 in G∗ to nonanticipative strategies in Γ. In particular given ε > 0 and a strategy σε that ε-approaches C in G∗, we define its image αε = Ψ(σε). The next step consists in proving that the trajectories in the differential game Γ compatible with αε approach asymptotically C + ε¯ B.

Sylvain Sorin ATDG

slide-78
SLIDE 78

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Then, we show that the ω-limit set of any trajectory compatible with some α ∈ M(V, U) is a nonempty compact discriminating domain for f. Explicitely, let D(α) =

  • θ≥0

cl{x[x0, α(w), w](t); t ≥ θ, w ∈ V}. (where cl is the closure operator). Lemma D(α) is a nonempty compact discriminating domain for Player 1 given f.

Sylvain Sorin ATDG

slide-79
SLIDE 79

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Then, we show that the ω-limit set of any trajectory compatible with some α ∈ M(V, U) is a nonempty compact discriminating domain for f. Explicitely, let D(α) =

  • θ≥0

cl{x[x0, α(w), w](t); t ≥ θ, w ∈ V}. (where cl is the closure operator). Lemma D(α) is a nonempty compact discriminating domain for Player 1 given f.

Sylvain Sorin ATDG

slide-80
SLIDE 80

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Then, we show that the ω-limit set of any trajectory compatible with some α ∈ M(V, U) is a nonempty compact discriminating domain for f. Explicitely, let D(α) =

  • θ≥0

cl{x[x0, α(w), w](t); t ≥ θ, w ∈ V}. (where cl is the closure operator). Lemma D(α) is a nonempty compact discriminating domain for Player 1 given f.

Sylvain Sorin ATDG

slide-81
SLIDE 81

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Recall that Spinat [11] proved that a closed set C is approachable in G if and only if it contains a B-set, hence we deduce the following corollary. Corollary Approachability and ∗approachability coincide.

Sylvain Sorin ATDG

slide-82
SLIDE 82

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

1

The model

2

Preliminaries

3

Weak approachability and differential games with fixed duration

4

Approachability and B-sets

5

Approachability and qualitative differential games

6

On strategies in the differential games and the repeated games

Sylvain Sorin ATDG

slide-83
SLIDE 83

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

First introduce a new notion of strategies crucial for time discretization. Definition A map δ : V → U is a nonanticipative strategy with delay (NAD) if there exits a sequence of times 0 < t1 < t2 < .. < tn < .. going to ∞ with the following property : For every control v1, v2 ∈ U such that v1(s) = v2(s) for almost every s ∈ [0, ti] then δ(v1)(s) = δ(v2)(s) for almost every s ∈ [0, ti+1]. Denote by Md(V, U) the set of such nonanticipative strategies with delay from V to U.

Sylvain Sorin ATDG

slide-84
SLIDE 84

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

First introduce a new notion of strategies crucial for time discretization. Definition A map δ : V → U is a nonanticipative strategy with delay (NAD) if there exits a sequence of times 0 < t1 < t2 < .. < tn < .. going to ∞ with the following property : For every control v1, v2 ∈ U such that v1(s) = v2(s) for almost every s ∈ [0, ti] then δ(v1)(s) = δ(v2)(s) for almost every s ∈ [0, ti+1]. Denote by Md(V, U) the set of such nonanticipative strategies with delay from V to U.

Sylvain Sorin ATDG

slide-85
SLIDE 85

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

First introduce a new notion of strategies crucial for time discretization. Definition A map δ : V → U is a nonanticipative strategy with delay (NAD) if there exits a sequence of times 0 < t1 < t2 < .. < tn < .. going to ∞ with the following property : For every control v1, v2 ∈ U such that v1(s) = v2(s) for almost every s ∈ [0, ti] then δ(v1)(s) = δ(v2)(s) for almost every s ∈ [0, ti+1]. Denote by Md(V, U) the set of such nonanticipative strategies with delay from V to U.

Sylvain Sorin ATDG

slide-86
SLIDE 86

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

We establish a link between preserving NA strategies in the differential game Γ and approachability strategies in the repeated game G. The idea of the construction is the following: a) Given a NA strategy α, show that it can be approximated in term of range by a NAD strategy ¯ α. b) When applied to α preserving C (hence approaching C),

  • btain a NAD strategy ¯

α approaching C. c) This NAD strategy ¯ α produces an ∗approachability strategy in the repeated game G∗. d) Finally ∗approachability strategies in G∗ induce approachability strategies in G.

Sylvain Sorin ATDG

slide-87
SLIDE 87

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

We establish a link between preserving NA strategies in the differential game Γ and approachability strategies in the repeated game G. The idea of the construction is the following: a) Given a NA strategy α, show that it can be approximated in term of range by a NAD strategy ¯ α. b) When applied to α preserving C (hence approaching C),

  • btain a NAD strategy ¯

α approaching C. c) This NAD strategy ¯ α produces an ∗approachability strategy in the repeated game G∗. d) Finally ∗approachability strategies in G∗ induce approachability strategies in G.

Sylvain Sorin ATDG

slide-88
SLIDE 88

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

We establish a link between preserving NA strategies in the differential game Γ and approachability strategies in the repeated game G. The idea of the construction is the following: a) Given a NA strategy α, show that it can be approximated in term of range by a NAD strategy ¯ α. b) When applied to α preserving C (hence approaching C),

  • btain a NAD strategy ¯

α approaching C. c) This NAD strategy ¯ α produces an ∗approachability strategy in the repeated game G∗. d) Finally ∗approachability strategies in G∗ induce approachability strategies in G.

Sylvain Sorin ATDG

slide-89
SLIDE 89

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

We establish a link between preserving NA strategies in the differential game Γ and approachability strategies in the repeated game G. The idea of the construction is the following: a) Given a NA strategy α, show that it can be approximated in term of range by a NAD strategy ¯ α. b) When applied to α preserving C (hence approaching C),

  • btain a NAD strategy ¯

α approaching C. c) This NAD strategy ¯ α produces an ∗approachability strategy in the repeated game G∗. d) Finally ∗approachability strategies in G∗ induce approachability strategies in G.

Sylvain Sorin ATDG

slide-90
SLIDE 90

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

We establish a link between preserving NA strategies in the differential game Γ and approachability strategies in the repeated game G. The idea of the construction is the following: a) Given a NA strategy α, show that it can be approximated in term of range by a NAD strategy ¯ α. b) When applied to α preserving C (hence approaching C),

  • btain a NAD strategy ¯

α approaching C. c) This NAD strategy ¯ α produces an ∗approachability strategy in the repeated game G∗. d) Finally ∗approachability strategies in G∗ induce approachability strategies in G.

Sylvain Sorin ATDG

slide-91
SLIDE 91

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Step a. Range associated to a nonanticipative strategy α ∈ M(V, U): R(α, t) = cl{y ∈ I Rk ∃v ∈ V, y = x[x0, α(v), v](t)}. The next result is due to Cardaliaguet ([4]) and is inspired by the ”extremal aiming” method of Krasowkii and Subbotin [9], and is very much in the spirit of proximal normals and approachability.

Sylvain Sorin ATDG

slide-92
SLIDE 92

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Step a. Range associated to a nonanticipative strategy α ∈ M(V, U): R(α, t) = cl{y ∈ I Rk ∃v ∈ V, y = x[x0, α(v), v](t)}. The next result is due to Cardaliaguet ([4]) and is inspired by the ”extremal aiming” method of Krasowkii and Subbotin [9], and is very much in the spirit of proximal normals and approachability.

Sylvain Sorin ATDG

slide-93
SLIDE 93

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Proposition Consider the differential game (2). For any ε > 0, T > 0 and any nonanticipative strategy α ∈ M(V, U), there exists some nonanticipative strategy with delay α ∈ Md(V, U) such that, for all t ∈ [0, T] and all v ∈ V: dR(α,t)(x[x0, α(v), v](t)) ≤ ε. Assume that xk does not belong to R(α, tk). Then there exists some control vk ∈ V such that yk := x[t0, x0, α(vk), vk](tk) is an approximate closest point to xk in R(α, tk). Note pk := xk − yk and take uk ∈ U such that sup

v∈V

< f(xk, uk, v), pk >= inf

u∈U sup v∈V

< f(xk, u, v), pk >= Ak . (7) In words, uk is optimal in the local game at xk in direction pk.

Sylvain Sorin ATDG

slide-94
SLIDE 94

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Proposition Consider the differential game (2). For any ε > 0, T > 0 and any nonanticipative strategy α ∈ M(V, U), there exists some nonanticipative strategy with delay α ∈ Md(V, U) such that, for all t ∈ [0, T] and all v ∈ V: dR(α,t)(x[x0, α(v), v](t)) ≤ ε. Assume that xk does not belong to R(α, tk). Then there exists some control vk ∈ V such that yk := x[t0, x0, α(vk), vk](tk) is an approximate closest point to xk in R(α, tk). Note pk := xk − yk and take uk ∈ U such that sup

v∈V

< f(xk, uk, v), pk >= inf

u∈U sup v∈V

< f(xk, u, v), pk >= Ak . (7) In words, uk is optimal in the local game at xk in direction pk.

Sylvain Sorin ATDG

slide-95
SLIDE 95

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Proposition Consider the differential game (2). For any ε > 0, T > 0 and any nonanticipative strategy α ∈ M(V, U), there exists some nonanticipative strategy with delay α ∈ Md(V, U) such that, for all t ∈ [0, T] and all v ∈ V: dR(α,t)(x[x0, α(v), v](t)) ≤ ε. Assume that xk does not belong to R(α, tk). Then there exists some control vk ∈ V such that yk := x[t0, x0, α(vk), vk](tk) is an approximate closest point to xk in R(α, tk). Note pk := xk − yk and take uk ∈ U such that sup

v∈V

< f(xk, uk, v), pk >= inf

u∈U sup v∈V

< f(xk, u, v), pk >= Ak . (7) In words, uk is optimal in the local game at xk in direction pk.

Sylvain Sorin ATDG

slide-96
SLIDE 96

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Proposition Consider the differential game (2). For any ε > 0, T > 0 and any nonanticipative strategy α ∈ M(V, U), there exists some nonanticipative strategy with delay α ∈ Md(V, U) such that, for all t ∈ [0, T] and all v ∈ V: dR(α,t)(x[x0, α(v), v](t)) ≤ ε. Assume that xk does not belong to R(α, tk). Then there exists some control vk ∈ V such that yk := x[t0, x0, α(vk), vk](tk) is an approximate closest point to xk in R(α, tk). Note pk := xk − yk and take uk ∈ U such that sup

v∈V

< f(xk, uk, v), pk >= inf

u∈U sup v∈V

< f(xk, u, v), pk >= Ak . (7) In words, uk is optimal in the local game at xk in direction pk.

Sylvain Sorin ATDG

slide-97
SLIDE 97

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

The next result relies explicitely on the specific form (4) of the dynamics f in Γ and extends the approximation from a compact interval to to I R+. Proposition Fix x0 ∈ I

  • Rk. For any ε > 0 and any nonanticipative strategy

α ∈ M(V, U) in the game Γ, there is some nonanticipative strategy with delay α ∈ Md(V, U) such that, for all t ≥ 0 and all v ∈ V: dR(α,t)(x[x0, α(v), v](t)) ≤ ε.

Sylvain Sorin ATDG

slide-98
SLIDE 98

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

The next result relies explicitely on the specific form (4) of the dynamics f in Γ and extends the approximation from a compact interval to to I R+. Proposition Fix x0 ∈ I

  • Rk. For any ε > 0 and any nonanticipative strategy

α ∈ M(V, U) in the game Γ, there is some nonanticipative strategy with delay α ∈ Md(V, U) such that, for all t ≥ 0 and all v ∈ V: dR(α,t)(x[x0, α(v), v](t)) ≤ ε.

Sylvain Sorin ATDG

slide-99
SLIDE 99

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

In particular, step b) Proposition Let C be a B-set. For any ε > 0 there is some nonanticipative strategy with delay α ∈ Md(V, U) in the game Γ and some T such that for any v in V dC(γ[ α(v), v](t)) ≤ ε, ∀t ≥ T.

Sylvain Sorin ATDG

slide-100
SLIDE 100

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

step c) Proposition For any ε > 0 and any nonanticipative strategy α ∈ M(V, U) preserving C in the game Γ, there is some nonanticipative strategy with delay α ∈ Md(V, U) that induces an ε-approachability strategy σ∗ for C in G∗. Idea is to use the delay to define a strategy that depens only on the past moves.

Sylvain Sorin ATDG

slide-101
SLIDE 101

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

The last step d) is Proposition Given σ∗ a strategy that ∗approach C up to ε > 0 in the game G∗, there exists σ a strategy that approach C up to 2ε in the game G.

Sylvain Sorin ATDG

slide-102
SLIDE 102

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

The last step d) is Proposition Given σ∗ a strategy that ∗approach C up to ε > 0 in the game G∗, there exists σ a strategy that approach C up to 2ε in the game G.

Sylvain Sorin ATDG

slide-103
SLIDE 103

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Blackwell D. (1956) An analog of the minmax theorem for vector payoffs, Pacific Journal of Mathematics, 6, 1-8. Cardaliaguet P . (1996) A differential game with two players and one target, SIAM J. Control and Optimization, 34, 1441-1460. Cardaliaguet P . (1997) Nonsmooth semi-permeable barriers, Isaacs equation, and application to a differential game with one target and two players, Applied Mathematics and Optimisation, 36, 125-146. Cardaliaguet P . (1997) Nonzero-sum differential games revisited, Working paper (unpublished). Cardaliaguet P ., M. Quincampoix and P . Saint-Pierre (2007) Differential games through viability theory: Old and recent

Sylvain Sorin ATDG

slide-104
SLIDE 104

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

results, Jorgensen S., M. Quincampoix and T. Vincent (Eds.), Advances in Dynamic Games Theory, Annals of International Society of Dynamic Games, Birkhauser, 3-36. Elliot N.J. and N.J. Kalton (1972) The existence of value in differential games of pursuit and evasion, J. Differential Equations, 12, 504-523. Evans L.C. and Souganidis P .E. (1984) Differential games and representation formulas for solutions of Hamilton-Jacobi equations, Indiana Univ. Math. J., 33, 773-797. Hou T.-F. (1971) Approachability in a two-person game, The Annals of Mathematical Statistics, 42, 735-744.

Sylvain Sorin ATDG

slide-105
SLIDE 105

The model Preliminaries Weak approachability and differential games with fixed duration Approachability and B-sets Approachability and qualitative differential games On strategies in the differential games and the repeated games

Krasovskii N.N. and A.I. Subbotin (1988) Game-Theorical Control Problems, Springer. Roxin E. (1969) The axiomatic approach in differential games, J. Optim. Theory Appl., 3, 153-163. Spinat X. (2002) A necessary and sufficient condition for approachability, Mathematics of Operations Research, 27, 31-44. Varaiya P . (1967) The existence of solution to a differential game, SIAM J. Control Optim., 5, 153-162. Vieille N. (1992) Weak approachability, Mathematics of Operations Research, 17, 781-791.

Sylvain Sorin ATDG