Ken Birman i Cornell University. CS5410 Fall 2008. Background for - PowerPoint PPT Presentation

Ken Birman i Cornell University. CS5410 Fall 2008.

Background for today � Consider a system like Astrolabe. Node p announces: � I’ve computed the aggregates for the set of leaf nodes to which I belong hi h I b l � It turns out that under the rules, I’m one regional contact to use, and my friend node q is the second contact , y q � Nobody in our region has seen any signs of intrusion attempts. � Should we trust any of this? � Similar issues arise in many kinds of P2P and gossip ‐ b based systems d t

What could go wrong? � Nodes p and q could be compromised � Perhaps they are lying about values other leaf nodes reported to them… t d t th � … and they could also have miscomputed the aggregates � … and they could have deliberately ignored values that and they could have deliberately ignored values that � they were sent, but felt were “inconvenient” (“oops, I thought that r had failed…”) � Indeed, could assemble a “fake” snapshot of the region using a mixture of old and new values, and then computed a completely correct aggregate using this computed a completely correct aggregate using this distorted and inaccurate raw data

Astrolabe can’t tell � … Even if we wanted to check, we have no easy way to fix Astrolabe to tolerate such attacks � We could assume a public key infrastructure and have W ld bli k i f d h nodes sign values, but doing so only secures raw data � Doesn’t address the issue of who is up, who is down, or Doesnt address the issue of who is up, who is down, or whether p was using correct, current data � And even if p says “the mean was 6.7” and signs this, h how can we know if the computation was correct? k f h � Points to a basic security weakness in P2P settings � Points to a basic security weakness in P2P settings

Today’s topic � We are given a system that uses a P2P or gossip protocol and does something important. Ask: Is there a way to strengthen it so that it will Is there a way to strengthen it so that it will tolerate attackers (and tolerate faults, too)? � Ideally, we want our solution to also be a symmetric, P2P Ideally, we want our solution to also be a symmetric, P2P or gossip solution � We certainly don’t want it to cost a fortune � For example, in Astrolabe, one could imagine sending raw data instead of aggregates: yes, this would work… but it would be far too costly and in fact would “break the gossip model” � And it needs to scale well

… leading to � Concept of a Sybil attack � Broadly: � Attacker has finite resources k h f � Uses a technical trick to amplify them into a huge (virtual) army of zombies (virtual) army of zombies � These join the P2P system and then subvert it

Who was Sybil? � Actual woman with a psychiatric problem � Termed “multiple T d “ l i l personality disorder” � Unclear how real this is Unclear how real this is � Sybil Attack: using small y g number of machines to mimic much larger set

Relevance to us? � Early IPTPS paper suggested that P2P and gossip systems are particularly fragile in face of Sybil attacks � Researchers found that if one machine mimics many R h f d h if hi i i (successfully), the attackers can isolate healthy ones � Particularly serious if a machine has a way to pick its Particularly serious if a machine has a way to pick its own hashed ID (as occurs in systems where one node inserts itself multiple times into a DHT) � Having isolated healthy nodes, can create a “virtual” l d h l h d “ l” environment in which we manipulate outcome of queries and other actions queries and other actions

Real world scenarios � Recording Industry of America (RIA) rumored to have used Sybil attacks to disrupt illegal file sharing � So ‐ called “Internet Honeypots” lure virus, worms other malware (like insects to a worms, other malware (like insects to a pot of honey) � Organizations like the NSA might use Sybil approach to evade onion ‐ routing and other information hiding methods

Elements of a Sybil attack � In a traditional attack, the intruder takes over some machines, perhaps by gaining root privilages � Once on board, intruder can access files and other data O b d i d fil d h d managed by the P2P system, maybe even modify them � Hence the node runs correct protocol but is controlled Hence the node runs correct protocol but is controlled by the attacker � In a Sybil attack, the intruder has similar goals, but seeks a numerical advantage.

O h h Once search reaches a Chord scenario compromised node attacker can “hijack” it N5 N10 N110 N110 K19 K19 N20 N99 N32 Lookup(K19) N80 N60

Challenge is numerical… � In most P2P settings, there are LOTS of healthy clients � Attack won’t work unless the attacker has a huge number of machines at his disposal b f hi hi di l � Even a rich attacker is unlikely to have so much money � Solution? � Attacker amplies his finite number of attack nodes by � Attacker amplies his finite number of attack nodes by clever use of a kind of VMM

VMM technology � Virtual machine technology dates to IBM in 1970’s � Idea then was to host a clone of an outmoded machine or operating system on a more modern one ti t d � Very popular… reduced costs of migration � Died back but then resurfaced during the OS wars � Died back but then resurfaced during the OS wars between Unix ‐ variants (Linux, FreeBSD, Mac ‐ OS…) and the Windows platforms � Goal was to make Linux the obvious choice � Want Windows? Just run it in a VMM partition

Example: IBM VM/370 user processes MVS Virtual System/370 user processes user processes user processes user processes MVS MVS DOS/VS DOS/VS Virtual CP Virtual CP CMS CMS CMS CMS Virtual virtual Virtual Virtual Virtual Virtual System/370 System/370 hardware System/370 System/370 System/370 CP real hardware System/370 Adapted from Dietel, pp. 606–607

VMM technology took off � Today VMWare is a huge company � Ironically, the actual VMM in widest use is Xen, from X XenSource in Cambridge S i C b id � Uses paravirtualization � Main application areas? � Main application areas? � Some “Windows on Linux” � But migration of VMM images has been very popular ut g at o o ages as bee ve y popu a � Leads big corporations to think of thin clients that talk to VMs hosted on cloud computing platforms � Term is “consolidation”

Paravirtualization vs. Full Virtualization Ring 3 Control User User Applications Plane Plane Apps Apps Ring 2 Guest OS Ring 1 Guest OS Dom0 Binary VMM Ring 0 Xen Translation Full Virtualization Paravirtualization

VMMs and Sybil � If one machine can host multiple VM images… then we have an ideal technology for Sybil attacks � Use one powerful machine, or a rack of them U f l hi k f h � Amplify them to look like thousands or hundreds of thousands of machines thousands of machines � Each of those machines offers to join, say, eMule � Similar for honeypots � Our system tries to look like thousands of tempting, not very protected Internet nodes d d

Research issues � If we plan to run huge numbers of instances of some OS on our VM, there will be a great deal of replication of pages of pages � All are running identical code, configurations (or nearly identical) � Hence want VMM to have a smart memory manager that has just one copy of any given page � Research on this has yielded some reasonable solutions � Copy ‐ on ‐ write quite successful as a quick hack and by itself gives a dramatic level of scalability itself gives a dramatic level of scalability

Other kinds of challenges � One issue relates to IP addresses � Traditionally, most organizations have just one or two primary IP domain addresses i IP d i dd � For example, Cornell has two “homes” that function as NAT boxes. All our machines have the same IP prefix p � This is an issue for the Sybil attacker � Systems like eMule have black lists � If they realize that one machine is compromised, it would be trivial to exclude others with the same prefix � But there may be a solution…. B h b l i

Attacker is the “good guy” � In our examples, the attacker is doing something legal � And has a lot of money � Hence helping him is a legitimate line of business for ISP ISPs � So ISPs might offer the attacker a way to purchase lots � S ISP i ht ff th tt k t h l t and lots of seemingly random IP addresses � They just tunnel the traffic to the attack site They just tunnel the traffic to the attack site

A very multi ‐ homed Sybil attacker

Implications? � Without “too much” expense, attacker is able to � Create a potentially huge number of attack points � Situate them all over the network (with a little help from AT&T or Verizon or some other widely diversified ISP) � Run whatever he would like on the nodes rather � Run whatever he would like on the nodes rather efficiently, gaining a 50x or even 100’sx scale ‐ up factor! � And this really works… � See, for example, the Honeypot work at UCSD � U. Michigan (Brian Ford, Peter Chen) another example

Ken Birman i Cornell University. CS5410 Fall 2008. Background for - PowerPoint PPT Presentation

Ken Birman i Cornell University. CS5410 Fall 2008. Background for today Consider a system like Astrolabe. Node p announces: Ive computed the aggregates for the set of leaf nodes to which I belong hi h I b l It turns out that under

Live Objects Live Objects Live Objects Live Objects Krzys Ostrowski, Ken Birman, Danny Dolev

CS5412: HOW DURABLE SHOULD IT BE? Lecture XV Ken Birman Durability 2 When a system accepts

CS5412: ANATOMY OF A CLOUD Lecture VII Ken Birman How are cloud structured? 2 Clients talk

CS5412: WHERE DID MY PERFORMANCE GO? Lecture XVIII Ken Birman Suppose you follow the rules

Ken Birman i Cornell University. CS5410 Fall 2008. Welcome to CS5140! A course on cloud

CS5412: TRANSACTIONS (I) Lecture XVII Ken Birman Transactions A widely used reliability

CS5412: SPRING 2012 CLOUD COMPUTING Lecture 1 Ken Birman Welcome to CS 5412... 2 A completely

OTHER DATA CENTER SERVICES Lecture V Ken Birman Tier two and Inner Tiers 2 If tier one

CS5412: HOW IT WORKS Lecture II Ken Birman Today: Lets look at some real apps 2 Well

CS5412: VIRTUAL SYNCHRONY Lecture XIV Ken Birman Group Communication idea 2 System

CS5412: TORRENTS AND TIT-FOR-TAT Lecture VII Ken Birman BitTorrent 2 Used in WAN setting

CS5412: THE BASE METHODOLOGY VERSUS THE ACID MODEL Lecture VIII Ken Birman Todays lecture

CS5412: TWO AND THREE PHASE COMMIT Lecture XI Ken Birman Continuing our consistency saga 2

CS5412: TORRENTS AND TIT-FOR-TAT Lecture VI Ken Birman BitTorrent 2 Today well be

CS5412: HOW MUCH ORDERING? Lecture XVI Ken Birman Ordering 2 The key to consistency turns

CS5412: CONSENSUS AND THE FLP IMPOSSIBILITY RESULT Lecture XII Ken Birman Generalizing Ron and

Diabetes: the key things to understand Naveed Sattar Professor of Metabolic Medicine Duality of

NET ETRC C - Sum umme mer Web ebinar nar Ser eries es Se Septem ember ber 23, , 2014

Critical Care Setting John G Toffaletti, PhD Director of Blood Gas and Clinical Pediatric Labs

Myeloma(Clinical(Presenta1on( (the(need(for(a(changing(paradigm? ( Professor'Gordon'Cook'

Haryadi S. Gunawi, Pallavi Joshi, Peter Alvaro, ! Joseph M. Hellerstein, and Koushik Sen ! ! Thanh

Keeping Movies Running Amid Thunderstorms Fault-tolerant Systems @ Netflix Sid Anand (@r39132)

Why do Internet services fail, and what can be done about it? David Oppenheimer

Distributed Systems CS425/ECE428 01/29/2020 Logistics Slide policy: Lecture slides v1