Safe AI for CPS Andr Platzer Carnegie Mellon University Joint work - PowerPoint PPT Presentation

Safe AI for CPS André Platzer Carnegie Mellon University Joint work with Nathan Fulton

Safety-Critical Systems "How can we provide people with cyber-physical systems they can bet their lives on?" - Jeannette Wing

This Talk Ensure the safety of Autonomous Cyber-Physical Systems. Best of both worlds: learning together with CPS safety Flexibility of learning • Guarantees of CPS formal methods • Diametrically opposed: flexibility+adaptability versus predictability+simplicity 1. Cyber-Physical Systems with Differential Dynamic Logic 2. Sandboxed reinforcement learning is provably safe 3. Model-update learning addresses uncertainty with multiple models

Airborne Collision Avoidance System ACAS X Developed by FAA to replace current TCAS in aircraft ● Approximately optimizes MDP on a grid ● Advisory from lookup tables with 5D interpolation regions ● Identified safe region per advisory and proved in KeYmaera X ● dela y 1 6 2 3 5 1 4 STTT’17

Comparison: ACAS X issues DNC h No change Following ownship 10,80 (ft) advisory DNC trajectory path 0 without DNC 10,400 intruder path 10,200 ownship 10,000 path with DNC 9,600 20 15 10 5 0 5 10 15 20 time to crossing (s) But CL1500 or no change would not lead to a collision

Model-Based Verification Reinforcement Learning φ

Model-Based Verification Reinforcement Learning pos < stopSign

Model-Based Verification Reinforcement Learning ctrl pos < stopSign

Model-Based Verification Reinforcement Learning ctrl pos < stopSign Approach : prove that control software achieves a specification with respect to a model of the physical system.

Model-Based Verification Reinforcement Learning φ Benefits: Strong safety guarantees ● Automated analysis ●

Model-Based Verification Reinforcement Learning φ Benefits: Strong safety guarantees ● Automated analysis ● Drawbacks: Control policies are typically ● non-deterministic: answers “what is safe”, not “what is useful”

Model-Based Verification Reinforcement Learning φ Benefits: Strong safety guarantees ● Automated analysis ● Drawbacks: Control policies are typically ● non-deterministic: answers “what is safe”, not “what is useful” Assumes accurate model ●

Model-Based Verification Reinforcement Learning Act φ Benefits: Observe Strong safety guarantees ● Automated analysis ● Drawbacks: Control policies are typically ● non-deterministic: answers “what is safe”, not “what is useful” Assumes accurate model. ●

Model-Based Verification Reinforcement Learning Act φ Observe Benefits: Benefits: Strong safety guarantees No need for complete model ● ● Automated analysis Optimal (effective) policies ● ● Drawbacks: Control policies are typically ● non-deterministic: answers “what is safe”, not “what is useful” Assumes accurate model. ●

Model-Based Verification Reinforcement Learning Act φ Observe Benefits: Benefits: Strong safety guarantees No need for complete model ● ● Automated analysis Optimal (effective) policies ● ● Drawbacks: Drawbacks: Control policies are typically No strong safety guarantees ● ● non-deterministic: answers Proofs are obtained and ● “what is safe”, not “what is checked by hand useful” Formal proofs = decades-long ● Assumes accurate model. proof development ●

Model-Based Verification Reinforcement Learning Act φ Observe Goal: Provably correct reinforcement learning Benefits: Benefits: Strong safety guarantees No need for complete model ● ● Aomputational aids (ATP) Optimal (effective) policies ● ● Drawbacks: Drawbacks: Control policies are typically No strong safety guarantees ● ● non-deterministic: answers Proofs are obtained and ● “what is safe”, not “what is checked by hand useful” Formal proofs = decades-long ● Assumes accurate model proof development ●

Model-Based Verification Reinforcement Learning Act φ Observe Goal: Provably correct reinforcement learning Benefits: Benefits: 1. Learn Safety Strong safety guarantees No need for complete model ● ● 2. Learn a Safe Policy Aomputational aids (ATP) Optimal (effective) policies ● ● 3. Justify claims of safety Drawbacks: Drawbacks: Control policies are typically No strong safety guarantees ● ● non-deterministic: answers Proofs are obtained and ● “what is safe”, not “what is checked by hand useful” Formal proofs = decades-long ● Assumes accurate model proof development ●

Part I: Differential Dynamic Logic Trustworthy Proofs for Hybrid Systems

Hybrid Programs x := t x=x 0 x=t y=y 0 y=y 0 z=z 0 z=z 0 ... ...

Hybrid Programs a;b x := t x=x 0 x=t a;b y=y 0 y=y 0 z=z 0 z=z 0 a b ... ...

Hybrid Programs a;b x := t x=x 0 x=t a;b y=y 0 y=y 0 z=z 0 z=z 0 a b ... ... If P is true: no change ?P If P is false: terminate

Hybrid Programs a;b x := t x=x 0 x=t a;b y=y 0 y=y 0 z=z 0 z=z 0 a b ... ... If P is true: no change ?P If P is false: terminate a* a ...a...

Hybrid Programs a;b x := t x=x 0 x=t a;b y=y 0 y=y 0 z=z 0 z=z 0 a b ... ... a ∪ b If P is true: no change ?P If P is false: terminate a* a ...a...

Hybrid Programs a;b x := t x=x 0 x=t a;b y=y 0 y=y 0 z=z 0 z=z 0 a b ... ... a ∪ b If P is true: no change ?P If P is false: terminate x=F(0) ... x’=f(x) a* x=x 0 a ...a... ⋮ ... x=F(T) ...

Approaching a Stopped Car Own Car Stopped Car Is this property true? [ { {accel ∪ brake}; t:=0; {pos’=vel,vel’=accel,t’=1 & vel ≥ 0 & t ≤ T} }* ](pos <= stoppedCarPos)

Approaching a Stopped Car Own Car Stopped Car Assuming we only accelerate when it’s safe to do so , is this property true? [ { {accel ∪ brake}; t:=0; {pos’=vel,vel’=accel,t’=1 & vel ≥ 0 & t ≤ T} }* ](pos <= stoppedCarPos)

Approaching a Stopped Car Own Car Stopped Car safeDistance(pos,vel,stoppedCarPos,B) if we also assume the system is safe initially: safeDistance(pos,vel,stoppedCarPos,B) → [ { {accel ∪ brake}; t:=0; {pos’=vel,vel’=accel,t’=1 & vel ≥ 0 & t ≤ T} }* ](pos <= stoppedCarPos)

Approaching a Stopped Car Own Car Stopped Car safeDistance(pos,vel,stoppedCarPos,B) safeDistance(pos,vel,stoppedCarPos,B) → [ { {accel ∪ brake}; t:=0; {pos’=vel,vel’=accel,t’=1 & vel ≥ 0 & t ≤ T} }* ](pos <= stoppedCarPos)

The Fundamental Question Proofs give strong mathematical evidence of safety. Why would our program not work if we have a proof ?

The Fundamental Question Why would our program not work if we have a proof ? 1. Was the proof correct?

The Fundamental Question Why would our program not work if we have a proof ? 1. Was the proof correct? 2. Was the model accurate enough? ≠

The Fundamental Question Why would our program not work if we have a proof ? 1. Was the proof correct? KeYmaera X 2. Was the model accurate enough? dI Tactic: DI Axiom: ODE & Controls Tooling [{x'=f&Q}]P↔([?Q]P←(Q→[{x'=f&Q}]P')) Example: [v’=r p v 2 -g,t’=1]v ≥ v 0 - gt ↔ Clever Bellerophon … ↔ Programs Side derivation: [v’:=r p v 2 -g][t’:=1]v’ ≥ -g*t’ ↔ (v ≥ v 0 - r p v 2 -g ≥ -g ↔ gt)’ ↔ H → r p ≥ 0 ... ↔ ... ↔ KyX qed Axioms ... H=r p ≥0 & r a ≥0 & g>0 & ...

The Fundamental Question Why would our program not work if we have a proof ? 1. Was the proof correct? KeYmaera X 2. Was the model accurate enough? Safe RL dI Tactic: DI Axiom: ODE & Controls Tooling [{x'=f&Q}]P↔([?Q]P←(Q→[{x'=f&Q}]P')) Example: [v’=r p v 2 -g,t’=1]v ≥ v 0 - gt ↔ Clever Bellerophon … ↔ Programs Side derivation: [v’:=r p v 2 -g][t’:=1]v’ ≥ -g*t’ ↔ (v ≥ v 0 - r p v 2 -g ≥ -g ↔ gt)’ ↔ H → r p ≥ 0 ... ↔ ... ↔ KyX qed Axioms ... H=r p ≥0 & r a ≥0 & g>0 & ...

Part II: Justified Speculative Control ≠ Safe reinforcement learning in partially modeled environments AAAI 2018

Model-Based Verification Accurate, analyzable models often exist! { ∪ brake ∪ ?safeTurn; turn}; {?safeAccel;accel {pos’ = vel, vel’ = acc} }*

Model-Based Verification Accurate , analyzable models often exist! { ∪ brake ∪ ?safeTurn; turn}; {?safeAccel;accel {pos’ = vel, vel’ = acc} discrete control }* Continuous motion

Model-Based Verification Accurate , analyzable models often exist! { ∪ brake ∪ ?safeTurn; turn}; {?safeAccel;accel {pos’ = vel, vel’ = acc} discrete, non-deterministic }* Continuous control motion

Model-Based Verification Accurate , analyzable models often exist! init → [ { ∪ brake ∪ ?safeTurn; turn}; { ?safeAccel;accel {pos’ = vel, vel’ = acc} }* ]pos < stopSign

Safe AI for CPS Andr Platzer Carnegie Mellon University Joint work - PowerPoint PPT Presentation

Safe AI for CPS Andr Platzer Carnegie Mellon University Joint work with Nathan Fulton Safety-Critical Systems "How can we provide people with cyber-physical systems they can bet their lives on?" - Jeannette Wing Safety-Critical

Time-regularized versus framewise reconstruction (a) A = 2 . 8 cps (c) A = 5 . 7 cps (e) A = 5 . 7

Objectives You should be able to ... The CPS Transform Youve seen how to write CPS functions

Standards Oversight Council Full Revision Process CPS 582 Open Channel CPS 580 Streambank &

School Year 2017- 18 CPS American Indian Education Program Lisa Bernal February 21, 2018 CPS

Memory Hierarchy Heechul Yun 1 Topics Introduction to Real-Time Systems, CPS CPS

WANIPIGOW SILICA SAND JANUARY 2020 INVESTOR PRESENTATION Why Invest in CPS? CPS is uniquely

NAVIGATING CPS September 18, 2013 AmySue Mertens Director of Communications and Policy

Carol Kraus, CPA Sean Berberet Dawn Pyles Grant Accountability and Transparency Unit General

AND CPS ENERGY P RE SE NTED BY: John Bonnin Vice President, Energy Supply & Market

Learning Centers in the Music Room CPS Lesson Lab Demonstration January 23, 2019 Diana Nevills,

Real-Time Architecture Heechul Yun 1 Topics Introduction to Real-Time Systems, CPS CPS

Optimizing with persistent data structures Adventures in CPS soup Andy Wingo ~ wingo@igalia.com

CPS 214: Computer Networks CPS 214: Computer Networks Slides by Adolfo Rodriguez Paper

Partial Evaluation Based CPS Transformation: An Implementation Case Study Rajesh Jayaprakash

Conversion from callcc to Continuation Passing Style (CPS) Tom Nikken Based on slides from

1 September 11, 2001 September 11, 2001 Broader Importance of Distributed Software Technology

Competency-Based Education Competencies, Criteria, & Assessment Saturday Session #2 of 4

Google Classroom Training How to Access Google Classroom How to Join a Class How to find

FEDERAL LAW convenient, and responsible manner Also reduces diversion and the introduction of

OSS CVE Trends Kazuki Omo( ): ka-omo@sios.com SIOS T echnology, Inc. Who am I ? -

Cyber-Physical Systems Introduction IECE 553/453 Fall 2019 Prof. Dola Saha 1 Introductions

What about Virtualization? Basic Goals Basic goals: Stephen Hand et al (Xen,

Certificate Policy & Certification Practices Statement Internet Drafts Steve Kent BBN

Arts Liaison Training #1: Leading from Your Sphere of Influence September 21, 2017 Charles A.

Safe AI for CPS Andr Platzer Carnegie Mellon University Joint work - PowerPoint PPT Presentation

Safe AI for CPS Andr Platzer Carnegie Mellon University Joint work with Nathan Fulton Safety-Critical Systems "How can we provide people with cyber-physical systems they can bet their lives on?" - Jeannette Wing Safety-Critical

Time-regularized versus framewise reconstruction (a) A = 2 . 8 cps (c) A = 5 . 7 cps (e) A = 5 . 7

Objectives You should be able to ... The CPS Transform Youve seen how to write CPS functions

Standards Oversight Council Full Revision Process CPS 582 Open Channel CPS 580 Streambank &amp;

School Year 2017- 18 CPS American Indian Education Program Lisa Bernal February 21, 2018 CPS

Memory Hierarchy Heechul Yun 1 Topics Introduction to Real-Time Systems, CPS CPS

WANIPIGOW SILICA SAND JANUARY 2020 INVESTOR PRESENTATION Why Invest in CPS? CPS is uniquely

NAVIGATING CPS September 18, 2013 AmySue Mertens Director of Communications and Policy

Carol Kraus, CPA Sean Berberet Dawn Pyles Grant Accountability and Transparency Unit General

AND CPS ENERGY P RE SE NTED BY: John Bonnin Vice President, Energy Supply &amp; Market

Learning Centers in the Music Room CPS Lesson Lab Demonstration January 23, 2019 Diana Nevills,

Real-Time Architecture Heechul Yun 1 Topics Introduction to Real-Time Systems, CPS CPS

Optimizing with persistent data structures Adventures in CPS soup Andy Wingo ~ wingo@igalia.com

CPS 214: Computer Networks CPS 214: Computer Networks Slides by Adolfo Rodriguez Paper

Partial Evaluation Based CPS Transformation: An Implementation Case Study Rajesh Jayaprakash

Conversion from callcc to Continuation Passing Style (CPS) Tom Nikken Based on slides from

1 September 11, 2001 September 11, 2001 Broader Importance of Distributed Software Technology

Competency-Based Education Competencies, Criteria, &amp; Assessment Saturday Session #2 of 4

Google Classroom Training How to Access Google Classroom How to Join a Class How to find

FEDERAL LAW convenient, and responsible manner Also reduces diversion and the introduction of

OSS CVE Trends Kazuki Omo( ): ka-omo@sios.com SIOS T echnology, Inc. Who am I ? -

Cyber-Physical Systems Introduction IECE 553/453 Fall 2019 Prof. Dola Saha 1 Introductions

What about Virtualization? Basic Goals Basic goals: Stephen Hand et al (Xen,

Certificate Policy &amp; Certification Practices Statement Internet Drafts Steve Kent BBN

Arts Liaison Training #1: Leading from Your Sphere of Influence September 21, 2017 Charles A.

Standards Oversight Council Full Revision Process CPS 582 Open Channel CPS 580 Streambank &

AND CPS ENERGY P RE SE NTED BY: John Bonnin Vice President, Energy Supply & Market

Competency-Based Education Competencies, Criteria, & Assessment Saturday Session #2 of 4

Certificate Policy & Certification Practices Statement Internet Drafts Steve Kent BBN