DRAIN: Distributed Recovery Architecture for Inaccessible Nodes in - PowerPoint PPT Presentation

DRAIN: Distributed Recovery Architecture for Inaccessible Nodes in Multi-core Chips Andrew DeOrio † , Konstantinos Aisopos ‡§ Valeria Bertacco † , Li-Shiuan Peh § † University of Michigan ‡ Princeton University § Massachusetts Institute of Technology DAC 2011

Reliable Networks on Chip fault-tolerant routing Drain recon- recovery detection diagnosis figuration Detect if Diagnose Reconfigure Recover and fault has where fault network to resume normal occurred has occurred account for fault operation processor µp when nodes $ cache become disconnected, R data is lost! router 2

Previous Recovery Solutions • Checkpoint approaches ReVive [Prvulovic et. al’02] SafetyNet [Sorin et. al’02] checkpoint µP µP buffers $ $ MEM R R data stuck in high performance checkpoint buffer! overhead! • Drain takes a reactive approach , incurring performance overhead only when errors occur 3

Data Recovery with Drain • Recover data lost during reconfiguration – Emergency links provide alternate path – Transfers cache contents and architectural state power 2 wires gated processor µ µ µ P P P core . . . . . . . . . . . . . . . ... . . ... . local cache $ DRAIN emergency link $ $ ... . . ... . . . . . . . Router primary link Router Router memory . . . Mem . . . controller . . . 32-128 wires 4

Drain Example µp µp $ $ emergency link µp µp primary $ $ link M router 5

Drain Example µp µp Fault model: $ $ faults accumulate one at a time. link µp µp X failure $ $ M 6

Drain Example µp µp $ $ reconfigure interconnect µp µp $ $ M 7

Drain Example µp µp Fault model: $ $ initiate Drain recovery when a single additional fault causes a node to become isolated µp µp $ $ X M link failure 8

Drain Example µp µp $ $ node isolated! µp µp $ $ M 9

Drain Example µp µp $ $ drain connected nodes via primary links µp µp $ $ M 10

Drain Example µp µp $ $ µp µp $ $ drain disconnected node via emergency link M 11

Emergency Link Algorithm find next target find next target cache cache toward not found connected to subnet border main memory found found no no empty empty ? ? yes copy dirty copy registers cache lines to and state to done target cache target cache 12

Drain Example µp µp $ $ µp µp $ $ drain connected node again M 13

Drain Example µp µp $ $ resume normal operation OS can re-assign workload µp µp $ $ M 14

Drain Hardware additional cache logic existing cache logic primary link input data emergency link input serial to parallel way 0 ... way N DRAIN tag tag decoder set 0 data data set ... set DRAIN- tag tag data data uP set M enabled control logic tag =? =? local cache primary link output data emergency parallel to serial router set link output para <5,000 gates tag per node 15

Drain Performance as Links Fail 5M average time to flush data via emergency links average time to flush data via primary links increasing 4M drain time (cycles per incident) emergency link time 3M 2M 1M 0M 0 10 20 30 40 50 60 70 80 90 100 injected faults 16

Memory Latency Before and After 250 avg. memory latency (cycles) before recovery after recovery 200 150 100 50 0 17

Conclusions • DRAIN is a lightweight recovery mechanism for CMPs – 5,000 gates per node • Recoup cache data and architectural state from disconnected nodes • Performance overhead only during a recovery incident – ~3ms at 1GHz 18

DRAIN: Distributed Recovery Architecture for Inaccessible Nodes in - PowerPoint PPT Presentation

DRAIN: Distributed Recovery Architecture for Inaccessible Nodes in Multi-core Chips Andrew DeOrio , Konstantinos Aisopos Valeria Bertacco , Li-Shiuan Peh University of Michigan Princeton University Massachusetts

Ewing Intercounty Drain Hearing of Necessity July 14, 2020 Ewing Intercounty Drain Drainage

Tow n of Moraga Storm Drain O&M Program Developing a Storm Drain GIS Outline Part I

Prevention of Nosocomial Prevention of Nosocomial Infections with KLEANIK Infections with

ADVANCED PLUMBING PRODUCTS By VINOD SITAPARA Shower Drain Channels Shower Drain Channels

Drain, Baby, Drain: Term Deposits, Reserves and Interbank Rates Day Ahead Conference - Federal

Drain Code Trivia Use your cell phone to participate! 1. Text MACDC to 37607 to join Drain Code

Background and Problems Petitioned filed by Livingston County Drain Commissioner in September

Technology Scaling Gate Source Drain Gate Source Drain Substrate Substrate IC Design Space

Readiness of WT and related plants (TG6) C. Cattadori Situation of Water drain 3 Tests of

Strip Recovery: Strip Recovery: Strip Recovery: Strip Recovery: A 12 A 12- -Step

UW-Stevens Point Northern Aquaculture Demonstration Facility No drain pipe in tank or other

Knight Piesold Elko Roundtable 2014 Drain Down from Waste Rock and Heap Leach Piles Thom Seal,

Car arol olyn P Pok okor orny MTA Board Meeting Wednesday, June 26, 2019 Drain Cleaning at

PROPOSITION O Argo Drain Sub-Basin Facility Project Construction Update Neighborhood Council of

AGENCY AND BRAIN DRAIN Abdoulaye SALIFOU Delegated 1 director AUF_Central Africa Office The

Hunter Valley Flood Mitigation Scheme Update on drain clearing works in the Williamtown area.

USING THE SOURCE WATER PROTECTION SELF-ASSESSMENT GUIDE For Non-Municipal Public Water Supply

CMOS Transistor Theory (and its effects on scaling) Michael Niemier (Some slides based on

BOSTON WATER & SEWER COMMISSION QUASI INDEPENDENT AUTHORITY DISTRIBUTION OF WATER

Core draining: neutronics, fluid dynamics, and heat transfer Main goals of this specific study

Why care? HDD SSD Require seek, rotate, No seeks SSDs transfer on each I/O Parallel Not

Why is Olmsted County proposing this ordinance? State Law Olmsted County is required to comply

Inventory, Monitoring and Assessment of Sediment Risks from Forest Roads Charles Luce Tom

MANA for MPI MPI-Agnostic Network-Agnostic Transparent Checkpointing Rohan Garg, *Gregory Price,

DRAIN: Distributed Recovery Architecture for Inaccessible Nodes in - PowerPoint PPT Presentation

DRAIN: Distributed Recovery Architecture for Inaccessible Nodes in Multi-core Chips Andrew DeOrio , Konstantinos Aisopos Valeria Bertacco , Li-Shiuan Peh University of Michigan Princeton University Massachusetts

Ewing Intercounty Drain Hearing of Necessity July 14, 2020 Ewing Intercounty Drain Drainage

Tow n of Moraga Storm Drain O&amp;M Program Developing a Storm Drain GIS Outline Part I

Prevention of Nosocomial Prevention of Nosocomial Infections with KLEANIK Infections with

ADVANCED PLUMBING PRODUCTS By VINOD SITAPARA Shower Drain Channels Shower Drain Channels

Drain, Baby, Drain: Term Deposits, Reserves and Interbank Rates Day Ahead Conference - Federal

Drain Code Trivia Use your cell phone to participate! 1. Text MACDC to 37607 to join Drain Code

Background and Problems Petitioned filed by Livingston County Drain Commissioner in September

Technology Scaling Gate Source Drain Gate Source Drain Substrate Substrate IC Design Space

Readiness of WT and related plants (TG6) C. Cattadori Situation of Water drain 3 Tests of

Strip Recovery: Strip Recovery: Strip Recovery: Strip Recovery: A 12 A 12- -Step

UW-Stevens Point Northern Aquaculture Demonstration Facility No drain pipe in tank or other

Knight Piesold Elko Roundtable 2014 Drain Down from Waste Rock and Heap Leach Piles Thom Seal,

Car arol olyn P Pok okor orny MTA Board Meeting Wednesday, June 26, 2019 Drain Cleaning at

PROPOSITION O Argo Drain Sub-Basin Facility Project Construction Update Neighborhood Council of

AGENCY AND BRAIN DRAIN Abdoulaye SALIFOU Delegated 1 director AUF_Central Africa Office The

Hunter Valley Flood Mitigation Scheme Update on drain clearing works in the Williamtown area.

USING THE SOURCE WATER PROTECTION SELF-ASSESSMENT GUIDE For Non-Municipal Public Water Supply

CMOS Transistor Theory (and its effects on scaling) Michael Niemier (Some slides based on

BOSTON WATER &amp; SEWER COMMISSION QUASI INDEPENDENT AUTHORITY DISTRIBUTION OF WATER

Core draining: neutronics, fluid dynamics, and heat transfer Main goals of this specific study

Why care? HDD SSD Require seek, rotate, No seeks SSDs transfer on each I/O Parallel Not

Why is Olmsted County proposing this ordinance? State Law Olmsted County is required to comply

Inventory, Monitoring and Assessment of Sediment Risks from Forest Roads Charles Luce Tom

MANA for MPI MPI-Agnostic Network-Agnostic Transparent Checkpointing Rohan Garg, *Gregory Price,

Tow n of Moraga Storm Drain O&M Program Developing a Storm Drain GIS Outline Part I

BOSTON WATER & SEWER COMMISSION QUASI INDEPENDENT AUTHORITY DISTRIBUTION OF WATER