[PPT] - Proximity-Aware Directory-based Coherence for Multi-core Processor PowerPoint Presentation

SLIDE 1

Proximity-Aware Directory-based Coherence for Multi-core Processor Architectures

Jeff Brown Rakesh Kumar Dean Tullsen

UC San Diego ● University of Illinois at Urbana-Champaign SPAA19 ● June 9, 2007

SLIDE 2

Introduction

The chip multiprocessor (CMP)

era is upon us!

Caching complicate writes
Cache Coherence ensures

caching is done safely

Multi-core designs offer new tradeoffs

SLIDE 3

Introduction

The chip multiprocessor (CMP)

era is upon us!

Caching complicate writes
Cache Coherence ensures

caching is done safely

Multi-core designs offer new tradeoffs

P M P M

SLIDE 4

Introduction

The chip multiprocessor (CMP)

era is upon us!

Caching complicate writes
Cache Coherence ensures

caching is done safely

Multi-core designs offer new tradeoffs

P M P M P P M M

SLIDE 5

Background: Directory-based Cache Coherence

Directory-based; explicit per-block accounting

– Doesn't rely on broadcasts

Directory operation: client/server

SLIDE 6

Background: Directory-based Cache Coherence

Directory-based; explicit per-block accounting

– Doesn't rely on broadcasts

Directory operation: client/server

– Processors request data, permissions

P

SLIDE 7

Background: Directory-based Cache Coherence

Directory-based; explicit per-block accounting

– Doesn't rely on broadcasts

Directory operation: client/server

– Processors request data, permissions – Directory controllers manage memory access

P Dir

SLIDE 8

Background: Directory-based Cache Coherence

Directory-based; explicit per-block accounting

– Doesn't rely on broadcasts

Directory operation: client/server

– Processors request data, permissions – Directory controllers manage memory access

P M Dir

SLIDE 9

Background: Directory-based Cache Coherence

Directory-based; explicit per-block accounting

– Doesn't rely on broadcasts

Directory operation: client/server

– Processors request data, permissions – Directory controllers manage memory access

P M Dir

SLIDE 10

Background: Directory-based Cache Coherence

Directory-based; explicit per-block accounting

– Doesn't rely on broadcasts

Directory operation: client/server

– Processors request data, permissions – Directory controllers manage memory access

Updates, conflicts

P M P Dir

SLIDE 11

Background: Historical MP Cache Coherence

Distributed directory, memory

P M P M P M P M

SLIDE 12

Background: Historical MP Cache Coherence

Distributed directory, memory

P M P M P M P M Cache Miss

SLIDE 13

Background: Historical MP Cache Coherence

Distributed directory, memory

P M P M P M P M Cache Miss "Home Node"

SLIDE 14

Background: Historical MP Cache Coherence

Distributed directory, memory

P M P M P M P M Cache Miss "Home Node"

SLIDE 15

Background: Historical MP Cache Coherence

Distributed directory, memory

P M P M P M P M Cache Miss "Home Node" Data Request

SLIDE 16

Background: Historical MP Cache Coherence

Distributed directory, memory

P M P M P M P M Cache Miss "Home Node" Data Request Reply

SLIDE 17

Motivation: Multi-core Cache Coherence

M M P M P P P M

SLIDE 18

Motivation: Multi-core Cache Coherence

M M P M P P P M Cache Miss

SLIDE 19

Motivation: Multi-core Cache Coherence

M M P M P P P M Cache Miss

SLIDE 20

Motivation: Multi-core Cache Coherence

"Home Node" M M P M P P P M Cache Miss

SLIDE 21

Motivation: Multi-core Cache Coherence

"Home Node" Data Request M M P M P P P M Cache Miss

SLIDE 22

Motivation: Multi-core Cache Coherence

"Home Node" Data Request M M P M P P P M Cache Miss

SLIDE 23

Motivation: Multi-core Cache Coherence

"Home Node" Reply M M P M P P P M Cache Miss

SLIDE 24

Motivation: Multi-core Cache Coherence

M M P M P P P M Additional Sharer

SLIDE 25

Motivation: Multi-core Cache Coherence

M M P M P P P M Additional Sharer

Multi-core designs present radically different

relative latency & bandwidth

SLIDE 26

Outline

Introduction & Background
System Architecture
Proximity-Aware Coherence
Results
Conclusion

SLIDE 27

Directory-based Cache Coherence

Directory structures

SLIDE 28

Directory-based Cache Coherence

Directory structures

Main Memory

SLIDE 29

Directory-based Cache Coherence

Directory structures

Main Memory

SLIDE 30

Directory-based Cache Coherence

Directory structures

– Directory Memory

Main Memory Directory Memory

SLIDE 31

Directory-based Cache Coherence

Directory structures

– Directory Memory – Directory Entries

Main Memory Directory Memory

SLIDE 32

Directory-based Cache Coherence

Directory structures

– Directory Memory – Directory Entries – Directory Controller

Main Memory Directory Memory Controller

SLIDE 33

A Traditional Multiprocessor

Core L2 $ Dir Mem Interconnect Core L2 $ Dir Mem

…

SLIDE 34

A Traditional Multiprocessor

Core L2 $ Dir Mem Interconnect Core L2 $ Dir Mem

…

(Chassis, board, etc.)

SLIDE 35

A Traditional Multiprocessor

Dir $

Mem. channel

Tile Tile 1 Tile 15 ...

SLIDE 42

Outline

Introduction & Background
System Architecture
Proximity-Aware Coherence
Results
Conclusion

SLIDE 43

Proximity-Aware Coherence

Idea: home node asks sharer nearest requester

to forward its cached copy

SLIDE 44

Proximity-Aware Coherence

Idea: home node asks sharer nearest requester

to forward its cached copy

– Stay on-chip when possible

SLIDE 45