Cache attacks: From side channels to fault attacks Clmentine Maurice - PowerPoint PPT Presentation

Cache attacks: Flush+Reload cached c a c h e d Victim address space Cache Attacker address space Step 1: Attacker maps shared library (shared memory, in cache) 10

Cache attacks: Flush+Reload fl u s h e s Victim address space Cache Attacker address space Step 1: Attacker maps shared library (shared memory, in cache) Step 2: Attacker flushes the shared cache line 10

Cache attacks: Flush+Reload loads data Victim address space Cache Attacker address space Step 1: Attacker maps shared library (shared memory, in cache) Step 2: Attacker flushes the shared cache line Step 3: Victim loads the data 10

Cache attacks: Flush+Reload r e l o a d s d a t a Victim address space Cache Attacker address space Step 1: Attacker maps shared library (shared memory, in cache) Step 2: Attacker flushes the shared cache line Step 3: Victim loads the data Step 4: Attacker reloads the data 10

• i.e. , every bit of the address except the lower 6 • with almost no false positives What did the attacker learn? 17 5 31 16 6 0 Address Tag Index Offset • the victim accessed a particular cache line 11

What did the attacker learn? 17 5 31 16 6 0 Address Tag Index Offset • the victim accessed a particular cache line • i.e. , every bit of the address except the lower 6 • with almost no false positives 11

• Cache Template Attacks: automatically finds information leakage side channel on keystrokes and AES T-tables implementation Flush+Reload: Applications • cross-VM side channel attacks on crypto algorithms • RSA: 96.7% of secret key bits in a single signature • AES: full key recovery in 30000 dec. (a few seconds) Y. Yarom and K. Falkner. “Flush+Reload: a High Resolution, Low Noise, L3 Cache Side-Channel Attack”. In: USENIX Security Symposium . 2014 B. Gülmezoglu, M. S. Inci, T. Eisenbarth, and B. Sunar. “A Faster and More Realistic Flush+Reload Attack on AES”. In: COSADE’15 . 2015 D. Gruss, R. Spreitzer, and S. Mangard. “Cache Template Attacks: Automating Attacks on Inclusive Last-Level Caches”. In: USENIX Security Symposium . 2015 https://github.com/IAIK/cache_template_attacks 12

Flush+Reload: Applications • cross-VM side channel attacks on crypto algorithms • RSA: 96.7% of secret key bits in a single signature • AES: full key recovery in 30000 dec. (a few seconds) • Cache Template Attacks: automatically finds information leakage → side channel on keystrokes and AES T-tables implementation Y. Yarom and K. Falkner. “Flush+Reload: a High Resolution, Low Noise, L3 Cache Side-Channel Attack”. In: USENIX Security Symposium . 2014 B. Gülmezoglu, M. S. Inci, T. Eisenbarth, and B. Sunar. “A Faster and More Realistic Flush+Reload Attack on AES”. In: COSADE’15 . 2015 D. Gruss, R. Spreitzer, and S. Mangard. “Cache Template Attacks: Automating Attacks on Inclusive Last-Level Caches”. In: USENIX Security Symposium . 2015 https://github.com/IAIK/cache_template_attacks 12

What if there is no shared memory?

• data evicted from the LLC is also evicted from L1 and L2 • a core can evict lines in the private L1 of another core Inclusive property core 0 core 1 • inclusive LLC: superset of L1 and L2 L1 L2 LLC 14

• data evicted from the LLC is also evicted from L1 and L2 • a core can evict lines in the private L1 of another core Inclusive property core 0 core 1 • inclusive LLC: superset of L1 and L2 L1 L2 inclusion LLC 14

• data evicted from the LLC is also evicted from L1 and L2 • a core can evict lines in the private L1 of another core Inclusive property core 0 core 1 • inclusive LLC: superset of L1 and L2 L1 L2 LLC 14

• a core can evict lines in the private L1 of another core Inclusive property core 0 core 1 • inclusive LLC: superset of L1 and L2 • data evicted from the LLC is also L1 evicted from L1 and L2 L2 LLC 14

Inclusive property core 0 core 1 • inclusive LLC: superset of L1 and L2 • data evicted from the LLC is also L1 evicted from L1 and L2 L2 • a core can evict lines in the private L1 eviction of another core LLC 14

Cache attacks: Prime+Probe Victim address space Cache Attacker address space 15

Cache attacks: Prime+Probe Victim address space Cache Attacker address space Step 1: Attacker primes, i.e. , fills, the cache (no shared memory) 15

Cache attacks: Prime+Probe loads data Victim address space Cache Attacker address space Step 1: Attacker primes, i.e. , fills, the cache (no shared memory) Step 2: Victim evicts cache lines while running 15

Cache attacks: Prime+Probe s s c e a c t a s f Victim address space Cache Attacker address space Step 1: Attacker primes, i.e. , fills, the cache (no shared memory) Step 2: Victim evicts cache lines while running Step 3: Attacker probes data to determine if set has been accessed 15

Cache attacks: Prime+Probe s e s c a c w o s l Victim address space Cache Attacker address space Step 1: Attacker primes, i.e. , fills, the cache (no shared memory) Step 2: Victim evicts cache lines while running Step 3: Attacker probes data to determine if set has been accessed 15

• i.e. , the index bits, 11 bits in modern last-level caches • with false positives What did the attacker learn? 17 5 31 16 6 0 Address Tag Index Offset • a program accessed cache lines mapping to the same cache set 16

What did the attacker learn? 17 5 31 16 6 0 Address Tag Index Offset • a program accessed cache lines mapping to the same cache set • i.e. , the index bits, ≈ 11 bits in modern last-level caches • with false positives 16

Prime+Probe: Applications • cross-VM side channel attacks on crypto algorithms: • El Gamal (sliding window): full key recovery in 12 min. • tracking user behavior in the browser, in JavaScript • covert channels between virtual machines in the cloud F. Liu, Y. Yarom, Q. Ge, G. Heiser, and R. B. Lee. “Last-Level Cache Side-Channel Attacks are Practical”. In: S&P’15 . 2015. Y. Oren, V. P. Kemerlis, S. Sethumadhavan, and A. D. Keromytis. “The Spy in the Sandbox: Practical Cache Attacks in JavaScript and their Implications”. In: CCS’15 . 2015. C. Maurice, M. Weber, M. Schwarz, L. Giner, D. Gruss, C. A. Boano, S. Mangard, and K. Römer. “Hello from the Other Side: SSH over Robust Cache Covert Channels in the Cloud”. In: NDSS’17 . 2017. 17

Is that it?

Last-level cache addressing 35 17 6 0 physical address tag set offset 30 H 11 2 line slice 0 slice 1 slice 2 slice 3 19

• but requires: 1. eviction sets, i.e. , addresses in the same cache set, in the same slice 2. actually evicting addresses, i.e. , accessing addresses with some strategy • issues: 1. the last-level cache addressing function is undocumented 2. the replacement policy is (mostly) undocumented Prime+Probe technical issues • no need for e.g., memory deduplication → more practical 20

1. eviction sets, i.e. , addresses in the same cache set, in the same slice 2. actually evicting addresses, i.e. , accessing addresses with some strategy • issues: 1. the last-level cache addressing function is undocumented 2. the replacement policy is (mostly) undocumented Prime+Probe technical issues • no need for e.g., memory deduplication → more practical • but requires: 20

2. actually evicting addresses, i.e. , accessing addresses with some strategy • issues: 1. the last-level cache addressing function is undocumented 2. the replacement policy is (mostly) undocumented Prime+Probe technical issues • no need for e.g., memory deduplication → more practical • but requires: 1. eviction sets, i.e. , addresses in the same cache set, in the same slice 20

• issues: 1. the last-level cache addressing function is undocumented 2. the replacement policy is (mostly) undocumented Prime+Probe technical issues • no need for e.g., memory deduplication → more practical • but requires: 1. eviction sets, i.e. , addresses in the same cache set, in the same slice 2. actually evicting addresses, i.e. , accessing addresses with some strategy 20

1. the last-level cache addressing function is undocumented 2. the replacement policy is (mostly) undocumented Prime+Probe technical issues • no need for e.g., memory deduplication → more practical • but requires: 1. eviction sets, i.e. , addresses in the same cache set, in the same slice 2. actually evicting addresses, i.e. , accessing addresses with some strategy • issues: 20

2. the replacement policy is (mostly) undocumented Prime+Probe technical issues • no need for e.g., memory deduplication → more practical • but requires: 1. eviction sets, i.e. , addresses in the same cache set, in the same slice 2. actually evicting addresses, i.e. , accessing addresses with some strategy • issues: 1. the last-level cache addressing function is undocumented 20

Prime+Probe technical issues • no need for e.g., memory deduplication → more practical • but requires: 1. eviction sets, i.e. , addresses in the same cache set, in the same slice 2. actually evicting addresses, i.e. , accessing addresses with some strategy • issues: 1. the last-level cache addressing function is undocumented 2. the replacement policy is (mostly) undocumented 20

Reverse-engineering last-level cache addressing We reverse-engineered this function! Intuition 1. find some way to map one address to one slice 2. repeat for every address with a 64B stride 3. infer a function out of it 21

Mapping addresses to slices with performance counters • event UNC_CBO_CACHE_LOOKUP counts accesses to a slice address H CBo 0 CBo 1 CBo 2 CBo 3 slice 0 slice 1 slice 2 slice 3 UNC_CBO_CACHE_LOOKUP 0 0 0 0 22

Mapping addresses to slices with performance counters • event UNC_CBO_CACHE_LOOKUP counts accesses to a slice 0x3a0071010 H CBo 0 CBo 0 CBo 1 CBo 2 CBo 3 slice 0 slice 0 slice 1 slice 2 slice 3 UNC_CBO_CACHE_LOOKUP 1 0 0 0 22

Mapping addresses to slices with performance counters • event UNC_CBO_CACHE_LOOKUP counts accesses to a slice 0x3a0071090 H CBo 0 CBo 1 CBo 2 CBo 2 CBo 3 slice 0 slice 1 slice 2 slice 2 slice 3 UNC_CBO_CACHE_LOOKUP 1 0 1 0 22

Mapping addresses to slices with performance counters • event UNC_CBO_CACHE_LOOKUP counts accesses to a slice 0x3a00710d0 H CBo 0 CBo 1 CBo 2 CBo 3 CBo 3 slice 0 slice 1 slice 2 slice 3 slice 3 UNC_CBO_CACHE_LOOKUP 1 0 1 1 22

Last-level cache linear functions 3 functions, depending on the number of cores Address bit 3 3 3 3 3 3 3 3 2 2 2 2 2 2 2 2 2 2 1 1 1 1 1 1 1 1 1 1 0 0 0 0 7 6 5 4 3 2 1 0 9 8 7 6 5 4 3 2 1 0 9 8 7 6 5 4 3 2 1 0 9 8 7 6 2 cores o 0 ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ 4 cores o 0 ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ o 1 ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ o 0 ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ 8 cores o 1 ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ o 2 ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ ⊕ • valid for Sandy Bridge, Ivy Bridge, Haswell, Broadwell, whether Core or Xeon • different for 6, 10, 12… cores → non-linear • different for Skylake 23

but not for long :) • removing clflush does not address the root causes of vulnerabilities • fixing crypto is (relatively) easy, but mitigating all cache attacks is hard Lessons learned from cache side-channel attacks • undocumented hardware can be a problem, 24

• removing clflush does not address the root causes of vulnerabilities • fixing crypto is (relatively) easy, but mitigating all cache attacks is hard Lessons learned from cache side-channel attacks • undocumented hardware can be a problem, but not for long :) 24

• fixing crypto is (relatively) easy, but mitigating all cache attacks is hard Lessons learned from cache side-channel attacks • undocumented hardware can be a problem, but not for long :) • removing clflush does not address the root causes of vulnerabilities 24

Lessons learned from cache side-channel attacks • undocumented hardware can be a problem, but not for long :) • removing clflush does not address the root causes of vulnerabilities • fixing crypto is (relatively) easy, but mitigating all cache attacks is hard 24

How do we make fault attacks out of that?

• attack entirely in software, again no physical access how can we flip bits without accessing them? • we’ll conduct attacks on the cache to create the right conditions • (but we’re not flipping bits on the cache) DRAM fault attacks • we’re now exploring fault attacks on DRAM 26

how can we flip bits without accessing them? • we’ll conduct attacks on the cache to create the right conditions • (but we’re not flipping bits on the cache) DRAM fault attacks • we’re now exploring fault attacks on DRAM • attack entirely in software, again no physical access 26

• we’ll conduct attacks on the cache to create the right conditions • (but we’re not flipping bits on the cache) DRAM fault attacks • we’re now exploring fault attacks on DRAM • attack entirely in software, again no physical access → how can we flip bits without accessing them? 26

• (but we’re not flipping bits on the cache) DRAM fault attacks • we’re now exploring fault attacks on DRAM • attack entirely in software, again no physical access → how can we flip bits without accessing them? • we’ll conduct attacks on the cache to create the right conditions 26

DRAM fault attacks • we’re now exploring fault attacks on DRAM • attack entirely in software, again no physical access → how can we flip bits without accessing them? • we’ll conduct attacks on the cache to create the right conditions • (but we’re not flipping bits on the cache) 26

back of DIMM: rank 1 channel 0 front of DIMM: rank 0 chip channel 1 Background: DRAM organization 27

back of DIMM: rank 1 front of DIMM: rank 0 chip Background: DRAM organization channel 0 channel 1 27

chip Background: DRAM organization back of DIMM: rank 1 channel 0 front of DIMM: rank 0 channel 1 27

Background: DRAM organization back of DIMM: rank 1 channel 0 front of DIMM: rank 0 chip channel 1 27

Background: DRAM organization bank 0 chip row 0 • bits in cells in rows row 1 • access: activate row, row 2 copy to row buffer … row 32767 row buffer 28

Software-Based Fault Attack: Rowhammer Rowhammer (Kim et al., 2014) “It’s like breaking into an apartment by repeatedly slamming a neighbor’s door until the vibrations open the door you were after” – Motherboard Vice DRAM bank 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 … 1 1 1 1 1 1 1 1 1 1 1 1 1 1 row buffer 29

Software-Based Fault Attack: Rowhammer Rowhammer (Kim et al., 2014) “It’s like breaking into an apartment by repeatedly slamming a neighbor’s door until the vibrations open the door you were after” – Motherboard Vice DRAM bank 1 1 1 1 1 1 1 1 1 1 1 1 1 1 activate 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 … copy 1 1 1 1 1 1 1 1 1 1 1 1 1 1 row buffer row buffer 29

Software-Based Fault Attack: Rowhammer Rowhammer (Kim et al., 2014) “It’s like breaking into an apartment by repeatedly slamming a neighbor’s door until the vibrations open the door you were after” – Motherboard Vice DRAM bank 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 activate 1 1 1 1 1 1 1 1 1 1 1 1 1 1 … copy 1 1 1 1 1 1 1 1 1 1 1 1 1 1 row buffer row buffer 29

Software-Based Fault Attack: Rowhammer Rowhammer (Kim et al., 2014) “It’s like breaking into an apartment by repeatedly slamming a neighbor’s door until the vibrations open the door you were after” – Motherboard Vice DRAM bank 1 1 1 1 1 1 1 1 1 1 1 1 1 1 activate 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 … copy 1 1 1 1 1 1 1 1 1 1 1 1 1 1 row buffer row buffer 29

Software-Based Fault Attack: Rowhammer Rowhammer (Kim et al., 2014) “It’s like breaking into an apartment by repeatedly slamming a neighbor’s door until the vibrations open the door you were after” – Motherboard Vice DRAM bank 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 activate 1 1 1 1 1 1 1 1 1 1 1 1 1 1 … copy 1 1 1 1 1 1 1 1 1 1 1 1 1 1 row buffer row buffer 29

Software-Based Fault Attack: Rowhammer Rowhammer (Kim et al., 2014) “It’s like breaking into an apartment by repeatedly slamming a neighbor’s door until the vibrations open the door you were after” – Motherboard Vice DRAM bank 1 1 1 1 1 1 1 1 1 1 1 1 1 1 bit flips in row 2! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 0 1 1 1 1 1 0 1 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 … 1 1 1 1 1 1 1 1 1 1 1 1 1 1 row buffer row buffer 29

Impact of the CPU cache CPU core • only non-cached accesses reach DRAM CPU • original attacks use clflush instruction cache → flush line from cache → next access will be served from DRAM DRAM 30

Rowhammer (with clflush ) DRAM bank cache set 1 cache set 2 31

Rowhammer (with clflush ) DRAM bank cache set 1 clflush clflush cache set 2 31

Rowhammer (with clflush ) DRAM bank cache set 1 cache set 2 31

Rowhammer (with clflush ) DRAM bank cache set 1 r e l o a d cache set 2 31

Rowhammer (with clflush ) DRAM bank cache set 1 r r e e l l o o a a d d r r e e l l o o a a d d cache set 2 31

Rowhammer (with clflush ) DRAM bank cache set 1 r e l o a d r e l o a d cache set 2 31

Cache attacks: From side channels to fault attacks Clmentine Maurice - PowerPoint PPT Presentation

Cache attacks: From side channels to fault attacks Clmentine Maurice CNRS, Rennes, France November 16, 2017 Cryptacus Workshop, Nijmegen, Netherlands the attacker only needs unprivileged execution on the victims machine

1 Classifying cache misses Cache Organization Classifying misses by causes (3Cs) Cache size,

Side Channels and Covert Channels Daniel Bosk Department of Information and Communication Systems

Lecture 10: Fault Tolerance Fault Tolerant Concurrent Computing The main principles of fault

Dealing with cache based attacks in cryptography Speculating on cache attacks Simo Sorce Sr.

What Is Memory Hierarchy A typical memory hierarchy today: Lecture 13: Cache Basics and Cache

Memory Hierarchy: Cache Memory hierarchy Cache basics Locality Cache organization Cache-aware

Web Cache Consistency Web Cache Consistency Web Cache Consistency Web Cache Consistency

L09: Cache Name: ID: Question: Direct Mapping Cache Hit Rate Consider a 4-block empty Cache,

CaSym: Cache Aware Symbolic Execution for Side Channel Detection and Mitigation Robert

Systems Security: Side-channel attacks Stjepan Picek s.picek@tudelft.nl Delft University of

JUST ONE FAULT Persistent Fault Analysis on Block Ciphers Shivam Bhasin Temasek Labs @ NTU ASK

Generations of Cache 1980: no cache in proc; 1989 first Intel proc with a cache on chip.

Cache Memory Chapter 17 S. Dandamudi Outline Introduction Types of cache misses

Cache Performance Associativity Replacement Samira Khan Cache Performance March 28,

Cache Memory Chapter 17 S. Dandamudi Outline Introduction Types of cache misses

Caches Electronic Computers M Caches 1 Cache LOCALITY PRINCIPLE (SPATIAL AND TEMPORAL)

opacity to Lyman- radiation Sarah Bosman University College London George Becker, Martin

Building the Linkage Tree (LT) in LTGA 1. Start with singleton linkage sets Thierens, D. (2010).

February 2018 www.facebook.com/NavyCYP @NavyCYP CYP is a CNIC Quality of Life Program

Illuminate by Nancy Duarte Illuminate by Nancy Duarte TED.com: The Secret Structure of Great

New Challenges for Memetic Algorithms on Continuous Multi-objective Problems MOEAs +

What matters around galaxies during the epoch of reionization Sarah Bosman Harley Katz Laura

Dealing with constraints in estimation of distribution algorithms: a different approach Josu

Classical and quantum non-linear dynamics in optomechanical systems Yaroslav M. Blanter Kavli

Sambuz

Useful Links

Newsletter

Mail Us