Game‐Theoretic Learning for Verification and Control
Sanjit A. Seshia
Professor EECS, UC Berkeley
Dagstuhl Seminar March 16, 2017
Game Theoretic Learning for Verification and Control Sanjit A. - - PowerPoint PPT Presentation
Game Theoretic Learning for Verification and Control Sanjit A. Seshia Professor EECS, UC Berkeley Joint work with Dorsa Sadigh, Jon Kotker, Daniel Bundala, Anca Dragan, Alexander Rakhlin, S. Shankar Sastry Dagstuhl Seminar March 16, 2017 Two
Dagstuhl Seminar March 16, 2017
Control: Human Cyber-Physical Systems (e.g. autonomous/semi-autonomous driving) Learning (Synthesizing) Models of Human Behavior
2
Verification: Timing Analysis of Embedded Software Learning (Synthesizing) Model of Platform (how platform impacts a program’s timing behavior)
3
5
that Leverages Effects on Human Actions. In RSS, 2016.
8
9
∗ .
, , ∗ ,
, ,
∗ as optimizing
10
,
,
(a) Features for the boundaries of the road (b) Feature for staying inside the lanes. (c) Features for avoiding
11
, ,
, ,
, ,
Newton): (solve using L-BFGS technique)
, , ∗ ,
y of Human Vehicle x of Autonomous Vehicle
20
Control: Human Cyber-Physical Systems (e.g. autonomous/semi-autonomous driving) Learning (Synthesizing) Models of Human Behavior
21
Verification: Timing Analysis of Embedded Software Learning (Synthesizing) Model of Platform (how platform impacts a program’s timing behavior)
22
Game-Theoretic Learning. In ACM Trans. Embed. Sys., 2012.
2008.
– 23 –
– 24 –
flag!=0 flag!=0 flag=1; (*x)++;
Program CFG unrolled to a DAG
– 25 –
– 26 –
Measurement-based, but minimal instrumentation
Learn Environment Model
Online, randomized algorithm: GameTime
Uses satisfiability modulo theories (SMT) solvers
– 27 –
Complexity Path Space x Platform State Space
Model as a 2-player Game: Tool vs. Platform
Questions:
– 28 –
– 29 –
– 30 –
– 31 –
Tool wins iff its prediction is correct
– 32 –
Mean Perturbation Assumption: x Paths | E [ x . t ] | ≤ max
(exec. time)
– 33 –
GameTime is Efficient
Accurately predicts WCET for complex platforms
Basis paths effectively encode information about
GameTime can accurately estimate the distribution
34
Verification/Analysis Control/Synthesis
beforehand, observe entire system state
rational agent, not actively violating robot’s obj.