Deterministic Memory Abstraction and Supporting Multicore System Architecture
Farzad Farshchi$, Prathap Kumar Valsan^, Renato Mancuso*, Heechul Yun$
$ University of Kansas, ^ Intel, * Boston University
Deterministic Memory Abstraction and Supporting Multicore System - - PowerPoint PPT Presentation
Deterministic Memory Abstraction and Supporting Multicore System Architecture Farzad Farshchi $ , Prathap Kumar Valsan ^ , Renato Mancuso * , Heechul Yun $ $ University of Kansas, ^ Intel, * Boston University Multicore Processors in CPS
Farzad Farshchi$, Prathap Kumar Valsan^, Renato Mancuso*, Heechul Yun$
$ University of Kansas, ^ Intel, * Boston University
2 Audi zFAS platform
Google/Waymo self-driving car DJI drone Audi A8
Die-shot of a multicore processor
3
Core1 Core2 Core3 Core4 Memory Controller (MC) Shared Cache DRAM Task 1 Task 2 Task 3 Task 4 I D I D I D I D
4
DRAM LLC Core1 Core2 Core3 Core4
subject co-runner(s)
IsolBench EEMBC, SD-VBS on Tegra TK1
5
Silberschatz et al., “Operating System Concepts.”
6
7
Inter-core interference Inherent timing legend Best-effort memory Deterministic memory Worst-case memory delay
8
Space allocation Request scheduling WCET bounds Deterministic memory Dedicated resources Predictability focused Tight Best-effort memory Shared resources Performance focused Pessimistic
Space allocation: e.g., cache space, DRAM banks Request scheduling: e.g., DRAM controller, shared buses
9
Application view (logical) System-level view (physical)
Deterministic memory Best-effort memory
Deterministic Memory-Aware Memory Hierarchy
Core1 Core2 Core3 Core4
W 5 W 1 W 2 W 3 W 4
I D I D I D I D
B 1 B 2 B 3 B 4 B 5 B 6 B 7 B 8 DRAM banks Cache ways
10
OS MMU/TLB
Page table entry DM bit=1|0
LLC MC
Paddr, DM Paddr, DM Paddr, DM Vaddr DRAM cmd/addr.
hardware software
11
Way 0 Way 1 Way 2 Way 3
Core 0 partition Core 1 partition
best-effort line (any core, DM=0) deterministic line (Core0, DM=1) deterministic line (Core1, DM=1)
Way 4
shared partition
Set 0 Set 1 Set 2 Set 3
DM Tag Line data
12
Determinism focused scheduling (Round-Robin) Throughput focused scheduling (FR-FCFS) yes no Deterministic memory request exist?
(a) Memory controller (MC) architecture (b) Scheduling algorithm
(*) P. Valsan, et al., “MEDUSA: A Predictable and High-Performance DRAM Controller for Multicore based Embedded Systems.” CPSNA, 2015
ARMv7 page table entry (PTE)
13 (*) H. Yun et. al, “PALLOC: DRAM Bank-Aware Memory Allocator for Performance Isolation on Multicore Platforms.” RTAS’14
14
15
16
svm L1 miss distribution
17
18
DRAM LLC Core1 Core2 Core3 Core4
RT co-runner(s)
19
20
21
22
23
Disclaimer: Our research has been supported by the National Science Foundation (NSF) under the grant number CNS 1718880
24