A Strategy for Network Resilience David Hutchison Lancaster - PowerPoint PPT Presentation

A Strategy for Network Resilience David Hutchison Lancaster University d.hutchison@lancaster.ac.uk University of Liverpool, 26 June 2014 InfoLab21 InfoLab21

Resilience Generally, this means the capability of people to ‘bounce back’ after experiencing problems [Oxford English Dictionary definition: “Power of resuming the original form after compression &c.”] Specifically, a resilient system is one that can continue to offer a satisfactory level of service even in the face (or in the aftermath) of the challenges it experiences Resilience goes beyond security; it encompasses security but aims to recover from security breaches and also any other challenges that compromise the system InfoLab21 InfoLab21 2

Resilience as a network need • Society is increasingly reliant on the Internet and on “Future Internet Research: networked systems in general (‘Information Society’) The EU framework” by Joao da Silva • Communication networks now underpin many of “ … as the Internet is increasingly becoming a society’s critical infrastructures “critical infrastructure, security and robustness of • We need resilience , a (QoS) property of networks the Internet are naturally becoming issues of major and systems such that they can withstand any concern.” (ACM CCR, 2007) challenge, whether from natural disasters, mis- acmqueue “Resolved: the configurations, hardware or software failures, Internet Is No Place for congestion/overloads (including flash crowds), Critical Infrastructure” by Dan Geer | April 2, 2013 or attacks • Network system attacks are increasing in variety Chinese domains downed by 'largest ever' cyber-attack. and number: virus, worms, botnets, DoS, … DDoS attacks targeted the country's national registry. The Independent, Aug 27, 2013 It is no coincidence that every single major cloud storage provider went down last week. That's Google's cloud storage, Microsoft's cloud storage, Intel's cloud services and Amazon's (the biggest and used by a huge number of other providers from Dixons and Dropbox to Spotify). Remember these services are supposed to have a 99.999% availability yet they've all failed with one day of each other. Not a single word of explanation from any of the companies involved … InfoLab21 InfoLab21 3

Some notable past challenges • 2001 Baltimore tunnel fire • 2001 9/11 terrorist attacks General lessons: • 2003 Cogent peering disputes • 2003 Northeast US blackout - Plan for vulnerabilities (threats may be predictable) • 2005 7/7 terrorist attacks • 2005 Hurricane Katrina - Redundancy without diversity is not resilient • 2006 Hengchun earthquake • 2008 Pakistan YouTube hijack • 2008 Mideast submarine cable cuts European Network • 2009 H1N1 influenza pandemic and Information Security Agency • 2010 Stuxnet worm attack www.enisa.europa.eu A crucial issue identified by ENISA is the lack of a standardised framework, even for the most basic resilience measurements. There are not many frameworks, none of them globally accepted. InfoLab21 InfoLab21 4

ResiliNets project (Kansas, Lancaster): to establish a strategy for network resilience First, investigated the relationship between resilience and other previously-researched areas: • Disciplines related to tolerance of faults and challenges – Fault Tolerance – Survivability – Disruption Tolerance – Traffic Tolerance • Trustworthiness disciplines with quantifiable properties – Dependability – Security – Performability InfoLab21 InfoLab21 5

ResiliNets “formula” and strategy “D 2 R 2 +DR” à Resilience Real-time Control Loop D efend D etect R emediate R ecover System Enhancement D iagnose R efine InfoLab21 InfoLab21 6

Resilience cube model InfoLab21 InfoLab21 7

The ResumeNet project (2008-2011): to evaluate the D 2 R 2 +DR resilience strategy ETH Zürich (ETHZ) – coordinator Switzerland Lancaster University (ULanc)* United Kingdom Technical University Münich (TUM) Germany France Telecom (FT) France NEC Europe Ltd (NEC) United Kingdom Universität Passau (UP) Germany Technical University Delft (TUDelft) Netherlands Uppsala Universitet (UU) Sweden Université de Liège (ULg) Belgium * Also: the Universities of Kansas (USA) and Sydney (Australia) InfoLab21 InfoLab21 8

Approach: three conceptual levels • Framework – Architecture The ResumeNet framework was experimentally evaluated in Future Internet scenarios: wireless – Information flow mesh networks; cloud-based networks; a multimedia service provisioning context; and an – Metrics Internet of Things environment – Challenge classification • Mechanisms and algorithms – Network resilience (redundancy, diversity in routing, transport, incentives for collaboration, challenge detection) – Service resilience (overlays/P2P, virtualization, challenge detection, machine learning) • Validation by experimentation in testbeds and with simulation – {network, service, challenge, resilience mechanism} – Realistic models, traffic and system behavior traces InfoLab21 InfoLab21 9

De-constructing D 2 R 2 +DR (1) • Defend: static, and dynamic Marcus Schoeller et al, “Assessing Risk for Network Resilience” (RNDM 2011) • Initially: – System analysis – Risk assessment – Prioritise the assets – Build defensive walls – E.g. redundant links, nodes • Runtime: – Make adjustments as appropriate – E.g. adjust firewall rules, resources InfoLab21 InfoLab21 10

De-constructing D 2 R 2 +DR (2) • Detect A Knowledge Plane for the Internet • Implies a monitoring system David D. Clark et al, SIGCOMM’03 “To learn about and alter its environment, the – Instrument the network! knowledge plane must access, and manage, what the cognitive community calls sensors and – cf. the Knowledge Plane? actuators. Sensors are entities that produce observations. Actuators are entities that change – Aim to observe normal behaviour behavior (e.g., change routing tables or bring links up or down). So, for instance, a knowledge – Then look for anomalies / intrusions application that sought to operate a network according to certain policies might use sensors to • Employ suitable ADTs / IDSs collect observations on the network, use assertions to determine if the network’s behavior complies with policy, and, if necessary, use – Classify the detected anomalies actuators to change the network’s behavior.” – Attempt a root cause analysis? Fig. 5: Entropy changes with the Slammer Worm From: “PReSET: A Toolset for the Evaluation of Network Resilience Strategies”, by Alberto Schaeffer-Filho et al (IM 2013) InfoLab21 InfoLab21 11

De-constructing D 2 R 2 +DR (3) Alberto Schaeffer-Filho et al, “Policy-based • Remediate DDoS remediation” [see also DRCN 2011] – Rely on symptoms, or root cause – Typically use traffic engineering – Get as much context as possible ① Attack starts Rate limit only the attack flow ④ Rate limit the entire link ② Attack flows successfully classified ⑤ Rate limit all traffic towards the victim • Recover ③ – Get back to normal behaviour if Azman Ali et al, “Evolving Classifier utilizing eClass0 and eCluster (ALS algorithms)” possible – Use policies for high-level guidance • Diagnose & Refine – Learning phase – Human in the loop InfoLab21 InfoLab21 12

Resilience as a network metric • We need to know how to specify resilience and how to measure it – Resilience classes: i.e. the science and the engineering Operational State Resilience class = • For computer networks, we should (challenge tolerance, Normal Partially Severely Operation Degraded Degraded trustworthiness) specify and measure resilience at Gold (Au) Unacceptable – normal operation the topology and the service levels ensures Cu Sn acceptable service • Topology resilience: typically, Silver (Ag) Service Parameters – only partial structural diversity degradation Impaired Ag ensures at most impaired • Service resilience: for example, a service Bronze (CuSn) combination of availability and Acceptable – no assurance of Au reliability service • Overall R [0,1]: a combination of individual metrics, maybe simplified as a set of ‘resilience classes’ InfoLab21 InfoLab21 13

ResumeNet architectural model: D 2 R 2 Note: Centralized view of a complex distributed system InfoLab21 InfoLab21 14

System enhancement: +DR • Outer feedback loop • long-term, slow reaction • Driven by politics or market forces InfoLab21 InfoLab21 15 • humans in the loop : re-design, policy change

System implementation view Idealized system operation + Refine - + (Human) Off-line Loop: DR Defence Design & Mechanisms Policies Real-time Loop: D 2 R 2 Challenges Service provided Network & Resilience to Services Target users Resilience Resilience Estimator Manager Resilience Mechanisms Resilience Challenge Knowledge Analysis Diagnose InfoLab21 InfoLab21 16

A Strategy for Network Resilience David Hutchison Lancaster - PowerPoint PPT Presentation

A Strategy for Network Resilience David Hutchison Lancaster University d.hutchison@lancaster.ac.uk University of Liverpool, 26 June 2014 InfoLab21 InfoLab21 Resilience Generally, this means the capability of people to bounce back

Childrens Resilience Initiative One Communitys Response to ACEs through Resilience 1

How do we assess resilience? Paul Ryan, Australian Resilience Centre Allyson Quinlan, Resilience

Do Now: Resilience 1. Create a Circle Map for resilience. 2. Look at the pictures. What

resilience Professor Kate Thomas c.p.thomas@bham.ac.uk What is resilience? Resilience is the

Community Resilience and Engagement Strategy What is the strategy? Public facing document

Developing Resilience Parent Council Meeting What is resilience . Resilience is the ability to

Mission: Resilience Keeping going & not giving up! We are going to learn all about the

East Central Flo lorida Regional Resilience Coll llaborative Resilience EDA funded

Resilience Is Key CHORUS AMERICA VIRTUAL CONFERENCE JUNE 2020 NADINE WETHINGTON FORTE

SAME Resilience Webinar Dedicated to National Security Since 1920 JETC Resilience Program Wed, 23

Kent Resilience Forum Activity 2017 Steve Scully KCC Senior Resilience Officer Kent Resilience

From BBB to Urban Resilience Dr. Joe Leitmann Team Leader , Urban Resilience GFDRR/World Bank

Outline Node vs. edge percolation Resilience of randomly vs. preferentially grown

The Concept of Resilience Developing Supports to Build Resilience Julia ten Hove What is

ReSIST ReSIST Resilience for Survivability in IST A European Network of Excellence Second Open

RECOVERY, RENEWAL, RESILIENCE U.S. VIRGIN ISLANDS MARCH 8, 2018 Recovery, Renewal, Resilience

Qualification of Alloys for Structural Applications in Fluoride High Temperature Reactor (FHR)

BOARD OF COMMISSIONERS 7/20/2015 304 E. Grand River, Board Chambers, Howell, MI 48843 7:30 PM

Earnings Presentation July 29, 2016 Agenda Strategic Review Edward Tilly Chief Executive

NRIC VI FG4 Broadband Report for December 5th 2003 Network Reliability and Interoperability

Network Slices for Vertical Industries The First Workshop on Control and C. Casetti, C. F.

tts r trs Ptrs t

Reasoning about the Security of Open Architecture Software Systems Walt Scacchi and Thomas

Multipath Interference Characterization in Wireless Communication Systems Michael Rice BYU

Sambuz

Useful Links

Newsletter

Mail Us

A Strategy for Network Resilience David Hutchison Lancaster - PowerPoint PPT Presentation

A Strategy for Network Resilience David Hutchison Lancaster University d.hutchison@lancaster.ac.uk University of Liverpool, 26 June 2014 InfoLab21 InfoLab21 Resilience Generally, this means the capability of people to bounce back

Childrens Resilience Initiative One Communitys Response to ACEs through Resilience 1

How do we assess resilience? Paul Ryan, Australian Resilience Centre Allyson Quinlan, Resilience

Do Now: Resilience 1. Create a Circle Map for resilience. 2. Look at the pictures. What

resilience Professor Kate Thomas c.p.thomas@bham.ac.uk What is resilience? Resilience is the

Community Resilience and Engagement Strategy What is the strategy? Public facing document

Developing Resilience Parent Council Meeting What is resilience . Resilience is the ability to

Mission: Resilience Keeping going &amp; not giving up! We are going to learn all about the

East Central Flo lorida Regional Resilience Coll llaborative Resilience EDA funded

Resilience Is Key CHORUS AMERICA VIRTUAL CONFERENCE JUNE 2020 NADINE WETHINGTON FORTE

SAME Resilience Webinar Dedicated to National Security Since 1920 JETC Resilience Program Wed, 23

Kent Resilience Forum Activity 2017 Steve Scully KCC Senior Resilience Officer Kent Resilience

From BBB to Urban Resilience Dr. Joe Leitmann Team Leader , Urban Resilience GFDRR/World Bank

Outline Node vs. edge percolation Resilience of randomly vs. preferentially grown

The Concept of Resilience Developing Supports to Build Resilience Julia ten Hove What is

ReSIST ReSIST Resilience for Survivability in IST A European Network of Excellence Second Open

RECOVERY, RENEWAL, RESILIENCE U.S. VIRGIN ISLANDS MARCH 8, 2018 Recovery, Renewal, Resilience

Qualification of Alloys for Structural Applications in Fluoride High Temperature Reactor (FHR)

BOARD OF COMMISSIONERS 7/20/2015 304 E. Grand River, Board Chambers, Howell, MI 48843 7:30 PM

Earnings Presentation July 29, 2016 Agenda Strategic Review Edward Tilly Chief Executive

NRIC VI FG4 Broadband Report for December 5th 2003 Network Reliability and Interoperability

Network Slices for Vertical Industries The First Workshop on Control and C. Casetti, C. F.

tts r trs Ptrs t

Reasoning about the Security of Open Architecture Software Systems Walt Scacchi and Thomas

Multipath Interference Characterization in Wireless Communication Systems Michael Rice BYU

Sambuz

Useful Links

Newsletter

Mail Us

Mission: Resilience Keeping going & not giving up! We are going to learn all about the