Confidential Accelerators
Stavros Volos Microsoft Research
Confidential Accelerators Stavros Volos Microsoft Research - - PowerPoint PPT Presentation
Confidential Accelerators Stavros Volos Microsoft Research Accelerators Play Pivotal Role in Cloud GPU, FPGA, AI Accelerators (e.g., Brainwave, TPU) Increasing compute performance and bandwidth E.g., Nvidia V100 offers 14 TFLOPS & ~1
Stavros Volos Microsoft Research
GPU, FPGA, AI Accelerators (e.g., Brainwave, TPU) Increasing compute performance and bandwidth
But, not designed with confidential computing primitives
Operating System App Hypervisor Hardware App Data Code
Runtim e
Data Code Data Code
attackers
‡ Stavros Volos, Kapil Vaswani, and Rodrigo Bruno. Graviton: Trusted Execution Environment on GPUs,
OSDI’18
GPU
CPU Host Memory PCIe Bridge High bandwidth memory PCIe host interface Memory controller Command processor DMA engines L2 cache
PCIe Bus System Bus
Trusted,
e memory
Execution is context-based
Kernels operate on data in device memory
GPU cores and DMA engines controlled via command channels
Context CPU GPU
Device memory
DMA buffer Command buffer
Context CPU GPU
Device memory
Channels Command buffer
abstraction for context isolation
Page directory Page tables DMA buffer
Key idea: TEE takes the form of secure context
Modest hardware complexity: extensions limited to command processor
Software extensions limited to GPU runtime and GPU driver
Complexity hidden behind CUDA programming model
GPU
CPU Host Memory PCIe Bridge High bandwidth memory PCIe host interface Memory controller Command processor DMA engines L2 cache
PCIe Bus System Bus
PUF, ECDSA key gen, signing Attestation, isolation, secure command submission Blocks MMIO accesses Security module
Current cloud trends
Trusted execution environment for accelerators