learning based synthesis of safety controllers
play

Learning-based Synthesis of Safety Controllers Oliver Markgraf 1,2 - PowerPoint PPT Presentation

Learning-based Synthesis of Safety Controllers Oliver Markgraf 1,2 Daniel Neider 1 1 Max Planck Institute for Software Systems 2 Technical University of Kaiserslautern FMCAD 2019, San Jose, California, USA 24 October 2019 Motivation Oliver


  1. Learning-based Synthesis of Safety Controllers Oliver Markgraf 1,2 Daniel Neider 1 1 Max Planck Institute for Software Systems 2 Technical University of Kaiserslautern FMCAD 2019, San Jose, California, USA 24 October 2019

  2. Motivation Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 2

  3. Synthesis of Reactive Controllers Specification + Environment Infinite duration, two-player game over a graph Strategy / Controller Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 2

  4. Safety Games Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 3

  5. Safety Games ◮ Vertices of Player 0 V 0 Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 3

  6. Safety Games ◮ Vertices of Player 0 V 0 , vertices of Player 1 V 1 Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 3

  7. Safety Games ◮ Vertices of Player 0 V 0 , vertices of Player 1 V 1 ◮ Edges E Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 3

  8. Safety Games ◮ Vertices of Player 0 V 0 , vertices of Player 1 V 1 ◮ Edges E ◮ Initial vertices I Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 3

  9. Safety Games ◮ Vertices of Player 0 V 0 , vertices of Player 1 V 1 ◮ Edges E ◮ Initial vertices I ◮ Safe vertices F := Player 0 := Player 1 := Initial := Safe Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 3

  10. Safety Games ◮ Vertices of Player 0 V 0 , vertices of Player 1 V 1 ◮ Edges E ◮ Initial vertices I ◮ Safe vertices F := Player 0 := Player 1 := Initial := Safe Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 3

  11. Safety Games ◮ Vertices of Player 0 V 0 , vertices of Player 1 V 1 ◮ Edges E ◮ Initial vertices I ◮ Safe vertices F := Player 0 := Player 1 := Initial := Safe Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 3

  12. Safety Games ◮ Vertices of Player 0 V 0 , vertices of Player 1 V 1 ◮ Edges E ◮ Initial vertices I ◮ Safe vertices F := Player 0 := Player 1 := Initial := Safe Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 3

  13. Safety Games ◮ Vertices of Player 0 V 0 , vertices of Player 1 V 1 ◮ Edges E ◮ Initial vertices I ◮ Safe vertices F := Player 0 := Player 1 := Initial := Safe Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 3

  14. Safety Games ◮ Successively remove vertices from which a stay inside the safe vertices cannot be enforced := Player 0 := Player 1 := Initial := Safe := Winning region Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 3

  15. Safety Games ◮ Successively remove vertices from which a stay inside the safe vertices cannot be enforced := Player 0 := Player 1 := Initial := Safe := Winning region Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 3

  16. Safety Games ◮ Successively remove vertices from which a stay inside the safe vertices cannot be enforced := Player 0 := Player 1 := Initial := Safe := Winning region Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 3

  17. Safety Games ◮ Successively remove vertices from which a stay inside the safe vertices cannot be enforced := Player 0 := Player 1 := Initial := Safe := Winning region Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 3

  18. Safety Games ◮ Successively remove vertices from which a stay inside the safe vertices cannot be enforced ◮ Winning strategy for Player 0, winning strategy for Player 1 := Player 0 := Player 1 := Initial := Safe := Winning region Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 3

  19. Motivation Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 4

  20. Outline 1. Example encoding of a safety game over Linear Real Arithmetic 2. Solving Safety Games via Learning 3. Evaluation Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 5

  21. 1. Example encoding of a safety game over Linear Real Arithmetic

  22. Safety Games Definition A safety game is a five-tuple G = ( V 0 , V 1 , E , I , F ) consisting of ◮ a set V 0 encoding the vertices of Player 0 ◮ a set V 1 encoding the vertices of Player 1 ◮ a set I encoding the initial vertices ◮ a set F encoding the safe vertices ◮ a relation E ⊆ V × V encoding the edges Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 6

  23. Safety Games Definition A safety game is a five-tuple G = ( V 0 , V 1 , E , I , F ) consisting of ◮ a set V 0 encoding the vertices of Player 0 ◮ a set V 1 encoding the vertices of Player 1 ◮ a set I encoding the initial vertices ◮ a set F encoding the safe vertices ◮ a relation E ⊆ V × V encoding the edges Assumption Each vertex has only a finite number of successors Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 6

  24. Safety Games Over Infinite Game Graphs – Example Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 7

  25. Safety Games Over Infinite Game Graphs – Example Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 7

  26. Safety Games Over Infinite Game Graphs – Example . . . 0 1 2 3 4 5 Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 7

  27. Safety Games Over Infinite Game Graphs – Example . . . 0 1 2 3 4 5 . . . . . . 0 . 27 1 . 27 2 . 27 3 . 27 4 . 27 . . . Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 7

  28. Safety Games Over Infinite Game Graphs – Example . . . . . . 0 . 27 1 . 27 2 . 27 3 . 27 4 . 27 . . . Let x ∈ R be the position of the robot and p ∈ { 0 , 1 } indicate which player is in control of the robot φ V 0 ( x , p ) := p = 0 φ V 1 ( x , p ) := p = 1 Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 8

  29. Safety Games Over Infinite Game Graphs – Example . . . . . . 0 . 27 1 . 27 2 . 27 3 . 27 4 . 27 . . . φ I ( x , p ) := x ≥ 3 ∧ x < 4 ∧ p = 0 φ F ( x , p ) := x ≥ 2 Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 8

  30. Safety Games Over Infinite Game Graphs – Example . . . . . . 0 . 27 1 . 27 2 . 27 3 . 27 4 . 27 . . . Model robot movements φ Move _ Right ( x , p , x ′ , p ′ ) := x ′ = x + 1 ∧ p = 1 − p ′ φ Move _ Left ( x , p , x ′ , p ′ ) := x ′ = x − 1 ∧ p = 1 − p ′ Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 8

  31. Safety Games Over Infinite Game Graphs – Example . . . . . . 0 . 27 1 . 27 2 . 27 3 . 27 4 . 27 . . . Model the edge relation E φ E ( x , p , x ′ , p ′ ) := φ Move _ Right ∨ φ Move _ Left Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 8

  32. Safety Games Over Infinite Game Graphs – Example . . . . . . 0 . 27 1 . 27 2 . 27 3 . 27 4 . 27 . . . Winning set W W = x ≥ 3 Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 8

  33. Winning Sets F I Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 9

  34. Winning Sets W F I Winning Set A W of vertices is a winning set if is satisfies ◮ I ⊆ W ◮ W ⊆ F ◮ E ( { v } ) ∩ W � = ∅ for all v ∈ W ∩ V 0 ( existential closedness ) ◮ E ( { v } ) ⊆ W for all v ∈ W ∩ V 1 ( universal closedness ). Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 9

  35. 2. Solving Safety Games via Learning

  36. Counterexample-Guided Inductive Synthesis Hypothesis H ⊆ V Learner Teacher Counterexample Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 10

  37. Counterexample-Guided Inductive Synthesis Hypothesis H ⊆ V Learner Teacher Counterexample Teacher ◮ implementation based on SMT-solver Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 10

  38. Winning Sets W F I Winning Set A W of vertices is a winning set if is satisfies ◮ I ⊆ W ◮ W ⊆ F ◮ E ( { v } ) ∩ W � = ∅ for all v ∈ W ∩ V 0 ( existential closedness ) ◮ E ( { v } ) ⊆ W for all v ∈ W ∩ V 1 ( universal closedness ). Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 11

  39. Teacher F I Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 12

  40. Teacher F I v Counterexample Let H be the Hypothesis ◮ Positive counterexample: v ∈ I \ H Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 12

  41. Teacher F v I Counterexample Let H be the Hypothesis ◮ Positive counterexample: v ∈ I \ H ◮ Negative counterexample: v ∈ H \ F Oliver Markgraf and Daniel Neider: Learning-based Synthesis of Safety Controllers 12

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend