Verifiable ASICs: trustworthy hardware with untrusted components - PowerPoint PPT Presentation

Verifiable ASICs: trustworthy hardware with untrusted components Riad S. Wahby ◦ ⋆ , Max Howald † ⋆ , Siddharth Garg ⋆ , abhi shelat ‡ , and Michael Walfish ⋆ ◦ Stanford University ⋆ New York University † The Cooper Union ‡ The University of Virginia May 25 th , 2016

Untrusted manufacturers can craft hardware Trojans

Untrusted manufacturers can craft hardware Trojans Trusted fabrication is not a panacea: ✗ Only 5 countries have cutting-edge fabs on-shore ✗ Building a new fab takes $$$$$$, years of R&D ✗ An old fab could mean 10 8 × performance hit accounting for speed, chip area, and energy Can we get trust more cheaply?

Can we build Verifiable ASICs? Principal F → designs for P , V

Can we build Verifiable ASICs? Principal F → designs Trusted Untrusted for P , V fab (slow) fab (fast) builds V builds P

Can we build Verifiable ASICs? Principal F → designs Trusted Untrusted for P , V fab (slow) fab (fast) builds V builds P Integrator V P

Can we build Verifiable ASICs? Principal F → designs Trusted Untrusted for P , V fab (slow) fab (fast) builds V builds P Integrator input V P output

Can we build Verifiable ASICs? Principal F → designs Trusted Untrusted for P , V fab (slow) fab (fast) builds V builds P Integrator input x V y P output proof that y = F( x )

Can we build Verifiable ASICs? input x V y P vs. F output proof that y = F( x ) • Makes sense if V + P are cheaper than trusted F

Can we build Verifiable ASICs? input x V y P vs. F output proof that y = F( x ) • Makes sense if V + P are cheaper than trusted F • Reasons for hope: • running time of V < running time of F (asymptotically) • speed of cutting-edge fab might offset P ’s overheads

Can we build Verifiable ASICs? input x V y P vs. F output proof that y = F( x ) • Makes sense if V + P are cheaper than trusted F • Reasons for hope: • running time of V < running time of F (asymptotically) • speed of cutting-edge fab might offset P ’s overheads • Challenges remain: • Hardware issues: energy, chip area • Need physically realizable circuit design • V needs to save work at plausible computation sizes

Zebra: a hardware design that saves costs

A qualified success Zebra: a hardware design that saves costs. . . . . . sometimes.

Probabilistic proof systems, briefly input x V y P output proof that y = F( x ) F must be expressed as an arithmetic circuit (AC) generalized boolean circuit over F p ∨ → + ∧ → ×

Probabilistic proof systems, briefly input x V y P output proof that y = F( x ) F must be expressed as an arithmetic circuit (AC) AC satisfiable ⇐ ⇒ F was executed correctly P convinces V that the AC is satisfiable

Probabilistic proof systems, briefly input x V y P output proof that y = F( x ) Arguments [GGPR13, IPs SBVBPW13, PGHR13, BCTV14] [GKR08, CMT12, VSBW13] e.g., Muggles, CMT, Allspice e.g., Zaatar, Pinocchio, libsnark

Probabilistic proof systems, briefly input x V y P output proof that y = F( x ) Arguments [GGPR13, IPs SBVBPW13, PGHR13, BCTV14] [GKR08, CMT12, VSBW13] e.g., Muggles, CMT, Allspice e.g., Zaatar, Pinocchio, libsnark – “Quasi–straight line” F + F with RAM, complex control flow – Lots of V - P communication + Little V - P communication

Probabilistic proof systems, briefly input x V y P output proof that y = F( x ) Arguments [GGPR13, IPs SBVBPW13, PGHR13, BCTV14] [GKR08, CMT12, VSBW13] e.g., Muggles, CMT, Allspice e.g., Zaatar, Pinocchio, libsnark – “Quasi–straight line” F + F with RAM, complex control flow – Lots of V - P communication + Little V - P communication Unsuited to hardware ✗ implementation

Probabilistic proof systems, briefly input x V y P output proof that y = F( x ) Arguments [GGPR13, IPs SBVBPW13, PGHR13, BCTV14] [GKR08, CMT12, VSBW13] e.g., Muggles, CMT, Allspice e.g., Zaatar, Pinocchio, libsnark – “Quasi–straight line” F + F with RAM, complex control flow – Lots of V - P communication + Little V - P communication Suited to hardware Unsuited to hardware ✓ ✗ implementation implementation

Zebra builds on IPs of GKR [GKR08, CMT12, VSBW13] F must be expressed as a layered arithmetic circuit. Note: this is an abstraction of F, not a physical circuit!

Zebra builds on IPs of GKR [GKR08, CMT12, VSBW13] 1. V sends inputs

Zebra builds on IPs of GKR [GKR08, CMT12, VSBW13] 1. V sends inputs 2. P evaluates circuit

Zebra builds on IPs of GKR [GKR08, CMT12, VSBW13] 1. V sends inputs 2. P evaluates circuit, returns output y y

Zebra builds on IPs of GKR [GKR08, CMT12, VSBW13] 1. V sends inputs 2. P evaluates circuit, returns output y 3. V cross-examines P about the last layer

Zebra builds on IPs of GKR [GKR08, CMT12, VSBW13] 1. V sends inputs 2. P evaluates circuit, returns output y 3. V cross-examines P about the last layer, ends up with claim about second-last layer

Zebra builds on IPs of GKR [GKR08, CMT12, VSBW13] 1. V sends inputs 2. P evaluates circuit, returns output y 3. V cross-examines P about the last layer, ends up with claim about second-last layer 4. V iterates

Zebra builds on IPs of GKR [GKR08, CMT12, VSBW13] 1. V sends inputs 2. P evaluates circuit, returns output y 3. V cross-examines P about the last layer, ends up with claim about second-last layer 4. V iterates, ends up with claim about inputs

Zebra builds on IPs of GKR [GKR08, CMT12, VSBW13] 1. V sends inputs 2. P evaluates circuit, returns output y 3. V cross-examines P about the last layer, ends up with claim about second-last layer 4. V iterates, ends up with claim about inputs 5. V checks consistency with the inputs V ’s work ≈ O (depth · log width), so it saves work when width ≫ depth

Can we parallelize this interaction? Can V and P interact about all of F’s layers at once? No. V must ask questions in correct order or P can cheat!

Can we parallelize this interaction? Can V and P interact about all of F’s layers at once? No. V must ask questions in correct order or P can cheat! But: Zebra uses pipelining to parallelize several Fs.

Extracting parallelism through pipelining V questions P about F( x 1 )’s output layer. F( x 1 )

Extracting parallelism through pipelining V questions P about F( x 1 )’s output layer. Simultaneously, P returns F( x 2 ). F( x 1 ) F( x 2 )

Extracting parallelism through pipelining V questions P about F( x 1 )’s next layer F( x 1 )

Extracting parallelism through pipelining V questions P about F( x 1 )’s next layer, and F( x 2 )’s output layer. F( x 1 ) F( x 2 )

Extracting parallelism through pipelining V questions P about F( x 1 )’s next layer, and F( x 2 )’s output layer. Meanwhile, P returns F( x 3 ). F( x 1 ) F( x 2 ) F( x 3 )

Extracting parallelism through pipelining This process continues until the pipeline is full. F( x 1 ) F( x 2 ) F( x 3 ) F( x 4 )

Extracting parallelism through pipelining This process continues until the pipeline is full. F( x 1 ) F( x 2 ) F( x 3 ) F( x 4 ) F( x 5 )

Extracting parallelism through pipelining F( x 1 ) This process continues F( x 2 ) until the pipeline is full. F( x 3 ) V and P can complete one proof in each time F( x 4 ) step. F( x 5 ) F( x 6 ) F( x 7 ) F( x 8 )

Zebra’s design approach ✓ Extract parallelism e.g., pipelined proving

Zebra’s design approach ✓ Extract parallelism e.g., pipelined proving ✓ Exploit locality: distribute data and control e.g., no RAM: data is kept close to places it is needed e.g., latency-insensitive design: distributed state machine avoids bottlenecks associated with central controller

Zebra’s design approach ✓ Extract parallelism e.g., pipelined proving ✓ Exploit locality: distribute data and control e.g., no RAM: data is kept close to places it is needed e.g., latency-insensitive design: distributed state machine avoids bottlenecks associated with central controller ✓ Reduce, reuse, recycle e.g., computation: save energy by adding memoization to P e.g., hardware: save chip area by reusing the same circuits

Architectural challenges Interaction between V and P requires a lot of bandwidth ✗ V and P on circuit board? Too much energy, circuit area Protocol requires input-independent precomputation [Allspice13]

Verifiable ASICs: trustworthy hardware with untrusted components - PowerPoint PPT Presentation

Verifiable ASICs: trustworthy hardware with untrusted components Riad S. Wahby , Max Howald , Siddharth Garg , abhi shelat , and Michael Walfish Stanford University New York University The Cooper Union

When NOT to Use ASICs When NOT to Use ASICs Rick Van Berg HEPIC2013 When NOT to Use ASICs When

Verifiable ASICs: trustworthy hardware with untrusted components Riad S. Wahby , Max

Verifiable ASICs: trustworthy hardware with untrusted components Riad S. Wahby , Max

Trustworthy Computing * Reverse engineers agree on that! Trustworthy Computing Trustworthy

Confinement (Running Untrusted Programs) Chester Rebeiro Indian Institute of Technology Madras

ECON ASICs Gregory Deptuch, Zoltan Gecse, Jim Hirschauer, Sandeep Miryala, Paul Rubinov ASICs

VERIFIABLE DELAY FUNCTIONS Benjamin Wesolowski VERIFIABLE DELAY FUNCTIONS How to slow things

Hardware Observability Framework Hardware Observability Framework Hardware Observability

TCIPG TECHNICAL CLUSTERS AND THREADS Trustworthy Trustworthy Technologies for Wide Technologies

Get to ASICs Faster Get to ASICs Faster A Novel Mixed Signal Design Methodology Dr Greg

Placement Challenges for Structured Placement Challenges for Structured g g ASICs ASICs

SoC Design SoC Design g Lecture 4: Programmable ASICs L Lecture 4: Programmable ASICs L 4 P

ECON ASICs Jim Hirschauer, Ralph Wickwire ASICs PMG 11 Nov 2019 DOE CD-1 IPR and CERN P2UG

Verifiable ASICs Aarhus Workshop on Secure Multiparty Computation 1 June 2016 Michael Walfish

Generating Verifiable Java Code from Verified PVS Specifications NFM2012 Generating Verifiable

Verifiable Random Functions and Verifiable Delay Functions Caleb Smith University of

11-731 Machine Translation Speech 2 Speech Translation Speech Translation Three part systems

CAPITAMALL TRUST Singapores First & Largest REIT Annual General Meeting 16 April 2015

through Glo lobal Valu lue Chains: Methodological Innovations in measuring technological

next steps Srinath Setty Microsoft Research (Thanks to Michael Walfish for some of the slides.)

From Identification using Rejection Sampling to Signatures via the Fiat-Shamir Transform:

INCMAP: A JOURNEY TOWARDS ONTOLOGY -BASED DATA INTEGRATION CHRISTOPH PINKEL (MAIN AUTHOR),

Hadoop Jrg Mllenkamp Principal Field Technologist Sun Microsystems Agenda Introduction

TCP-Friendliness of SCTP and Concurrent Multipath Transfer (CMT) ILKNUR AYDIN ICCRG meeting

Verifiable ASICs: trustworthy hardware with untrusted components - PowerPoint PPT Presentation

Verifiable ASICs: trustworthy hardware with untrusted components Riad S. Wahby , Max Howald , Siddharth Garg , abhi shelat , and Michael Walfish Stanford University New York University The Cooper Union

When NOT to Use ASICs When NOT to Use ASICs Rick Van Berg HEPIC2013 When NOT to Use ASICs When

Verifiable ASICs: trustworthy hardware with untrusted components Riad S. Wahby , Max

Verifiable ASICs: trustworthy hardware with untrusted components Riad S. Wahby , Max

Trustworthy Computing * Reverse engineers agree on that! Trustworthy Computing Trustworthy

Confinement (Running Untrusted Programs) Chester Rebeiro Indian Institute of Technology Madras

ECON ASICs Gregory Deptuch, Zoltan Gecse, Jim Hirschauer, Sandeep Miryala, Paul Rubinov ASICs

VERIFIABLE DELAY FUNCTIONS Benjamin Wesolowski VERIFIABLE DELAY FUNCTIONS How to slow things

Hardware Observability Framework Hardware Observability Framework Hardware Observability

TCIPG TECHNICAL CLUSTERS AND THREADS Trustworthy Trustworthy Technologies for Wide Technologies

Get to ASICs Faster Get to ASICs Faster A Novel Mixed Signal Design Methodology Dr Greg

Placement Challenges for Structured Placement Challenges for Structured g g ASICs ASICs

SoC Design SoC Design g Lecture 4: Programmable ASICs L Lecture 4: Programmable ASICs L 4 P

ECON ASICs Jim Hirschauer, Ralph Wickwire ASICs PMG 11 Nov 2019 DOE CD-1 IPR and CERN P2UG

Verifiable ASICs Aarhus Workshop on Secure Multiparty Computation 1 June 2016 Michael Walfish

Generating Verifiable Java Code from Verified PVS Specifications NFM2012 Generating Verifiable

Verifiable Random Functions and Verifiable Delay Functions Caleb Smith University of

11-731 Machine Translation Speech 2 Speech Translation Speech Translation Three part systems

CAPITAMALL TRUST Singapores First &amp; Largest REIT Annual General Meeting 16 April 2015

through Glo lobal Valu lue Chains: Methodological Innovations in measuring technological

next steps Srinath Setty Microsoft Research (Thanks to Michael Walfish for some of the slides.)

From Identification using Rejection Sampling to Signatures via the Fiat-Shamir Transform:

INCMAP: A JOURNEY TOWARDS ONTOLOGY -BASED DATA INTEGRATION CHRISTOPH PINKEL (MAIN AUTHOR),

Hadoop Jrg Mllenkamp Principal Field Technologist Sun Microsystems Agenda Introduction

TCP-Friendliness of SCTP and Concurrent Multipath Transfer (CMT) ILKNUR AYDIN ICCRG meeting

CAPITAMALL TRUST Singapores First & Largest REIT Annual General Meeting 16 April 2015