Satisfiability Bounds for -Regular Properties in Bounded-Parameter - - PowerPoint PPT Presentation

satisfiability bounds for regular properties in bounded
SMART_READER_LITE
LIVE PREVIEW

Satisfiability Bounds for -Regular Properties in Bounded-Parameter - - PowerPoint PPT Presentation

Satisfiability Bounds for -Regular Properties in Bounded-Parameter Markov Decision Processes M. Weininger T. Meggendorfer J. Kretinsky Satisfiability Bounds for -Regular Properties in Bounded-Parameter Markov Decision Processes M.


slide-1
SLIDE 1

Satisfiability Bounds for ω-Regular Properties in Bounded-Parameter Markov Decision Processes

  • M. Weininger T. Meggendorfer J. Kretinsky
slide-2
SLIDE 2

Satisfiability Bounds for ω-Regular Properties in Bounded-Parameter Markov Decision Processes

  • M. Weininger T. Meggendorfer J. Kretinsky
slide-3
SLIDE 3

Bounded-Parameter Markov Decision Process

Station Broken Valley Hills Probe

0.8 0.2 0.5 0.5

slide-4
SLIDE 4

Bounded-Parameter Markov Decision Process

Station Broken Valley Hills Probe

[0.1, 1] [0, 0.5] [0.1, 0.5] [0.2, 0.8]

Station

slide-5
SLIDE 5

Bounded-Parameter Markov Decision Process

Station Broken Valley Hills Probe

[0.1, 1] [0, 0.5] [0.2, 0.8]

Station Broken Valley Hills Probe

0.8 0.2 0.5 0.5 [0.1, 0.5]

Station Station

slide-6
SLIDE 6

Bounded-Parameter Markov Decision Process

Station Broken Valley Hills Probe

[0.1, 1] [0, 0.5] [0.2, 0.8]

Station Broken Valley Hills Probe

1 0.5 0.5 [0.1, 0.5]

Station Station

slide-7
SLIDE 7

Satisfiability bounds for ω-Regular Properties

Station Broken Valley Hills Probe

[0.1, 1] [0, 0.5] [0.2, 0.8] [0.1, 0.5]

Station

slide-8
SLIDE 8

Satisfiability bounds for ω-Regular Properties

Station Broken Valley Hills Probe

[0.1, 1] [0, 0.5] [0.2, 0.8]

“Eventually take a probe” F (Probe) “Always take a probe in the future and bring it to the station” G (F (Probe) ∧ Probe ⇒ X (Station))

[0.1, 0.5]

Station

slide-9
SLIDE 9

Satisfiability bounds for ω-Regular Properties

Station Broken Valley Hills Probe

[0.1, 1] [0, 0.5] [0.2, 0.8]

Find optimal controller 𝓜 ≤ ℙ(System ⊨ Property) ≤ 𝓥 “Eventually take a probe” F (Probe) “Always take a probe in the future and bring it to the station” G (F (Probe) ∧ Probe ⇒ X (Station))

[0.1, 0.5]

Station

slide-10
SLIDE 10

Semantics of the intervals

𝓜: Adversarial Environment 𝓥: Design choice

slideshare.net/jefffarias9 letsgetsciencey.com/best-microscope-for-kids/

slide-11
SLIDE 11

Semantics of the intervals

𝓜: Adversarial Environment 𝓥: Design choice

slideshare.net/jefffarias9 letsgetsciencey.com/best-microscope-for-kids/

slide-12
SLIDE 12

Resolving intervals in ”Design choice” setting

Broken Hills

[0.1, 1] [0, 0.5]

Station

slide-13
SLIDE 13

Resolving intervals in ”Design choice” setting

Broken Hills

[0.1, 1] [0, 0.5]

Station Broken Hills Design Choice Station Station

slide-14
SLIDE 14

Resolving intervals in ”Design choice” setting

Broken Hills

[0.1, 1] [0, 0.5]

Station Broken Hills Design Choice Station Station

1

slide-15
SLIDE 15

Resolving intervals in ”Design choice” setting

Broken Hills

[0.1, 1] [0, 0.5]

Station Broken Hills Design Choice

0.5 0.5

Station Station

1

slide-16
SLIDE 16

Resolving intervals in ”Design choice” setting

Broken Hills

[0.1, 1] [0, 0.5]

Station Broken Hills Design Choice

0.5 0.5 0.3127 0.6983

Station Station

1

slide-17
SLIDE 17

Resolving intervals in ”Design choice” setting

Broken Hills

[0.1, 1] [0, 0.5]

Station Broken Hills Design Choice

0.5 0.5 0.3127 0.6983

Station Station

1

...

slide-18
SLIDE 18

Resolving intervals in ”Design choice” setting

Broken Hills

[0.1, 1] [0, 0.5]

Station Broken Hills Design Choice

0.5 0.5 1

Station Station

slide-19
SLIDE 19

Resolving intervals in ”Design choice” setting

Broken Hills

[0.1, 1] [0, 0.5]

Station Broken Hills Design Choice

0.5 0.5 1

Station Station

Basic Feasible Solutions [HM18]

slide-20
SLIDE 20

Resolving intervals in ”Design choice” setting

Broken Hills

[0.1, 1] [0, 0.5]

Station Broken Hills Design Choice

0.5 0.5 1

Station Station

Basic Feasible Solutions [HM18] Solving MDP e.g. [Put94] yields controller and probability

slide-21
SLIDE 21

Idea in short

1. New state for every action 2. Basic feasible solutions as its actions 3. Solve MDP

bpMDP MDP

Design choice

slide-22
SLIDE 22

Idea in short

1. New state for every action (other player!) 2. Basic feasible solutions as its actions 3. Solve MDP Stochastic Game

bpMDP MDP SG

Design choice A d v e r s a r i a l

slide-23
SLIDE 23

Idea in short

1. New state for every action (other player!) 2. Basic feasible solutions as its actions 3. Solve MDP Stochastic Game

bpMDP MDP SG

Design choice A d v e r s a r i a l

Solving SG e.g. [CH06] yields controller and probability

slide-24
SLIDE 24

The bigger picture

bpMDP MDP SG

Design choice A d v e r s a r i a l

slide-25
SLIDE 25

The bigger picture

bpMDP MDP IMC SG

Design choice A d v e r s a r i a l

slide-26
SLIDE 26

The bigger picture

bpMDP MDP IMC SG

A d v e r s a r i a l Design choice A d v e r s a r i a l [ H M 1 8 ] E X P

slide-27
SLIDE 27

The bigger picture

bpMDP MDP IMC SG

A d v e r s a r i a l Design choice A d v e r s a r i a l [ H M 1 8 ] E X P EXP E X P

slide-28
SLIDE 28

The bigger picture

bpMDP MDP IMC SG

A d v e r s a r i a l Design choice A d v e r s a r i a l [ H M 1 8 ] E X P

MC

Design choice EXP [DC18] POL E X P

slide-29
SLIDE 29

The bigger picture

bpMDP MDP IMC SG

A d v e r s a r i a l Design choice A d v e r s a r i a l [ H M 1 8 ] E X P

MC

Design choice POL [DC18] POL E X P

slide-30
SLIDE 30

The bigger picture

bpMDP MDP IMC SG

A d v e r s a r i a l Design choice A d v e r s a r i a l [ H M 1 8 ] E X P

MC

Design choice POL [DC18] POL E X P [ D C 1 8 ] P O L

slide-31
SLIDE 31

The bigger picture

bpMDP MDP IMC SG

A d v e r s a r i a l Design choice A d v e r s a r i a l [ H M 1 8 ] E X P

MC

Design choice POL [DC18] POL E X P [ D C 1 8 ] P O L

bpSG

Adversarial Design choice

slide-32
SLIDE 32

Future work

  • Practical implementation (using our previous work)
slide-33
SLIDE 33

Future work

  • Practical implementation (using our previous work)
  • Other imprecisions in system model, e.g. parametrized MDPs
  • Multiple objectives