Epidemic Protocols in Peer-to-Peer Computing Dr. r. G Giuse - PowerPoint PPT Presentation

NexTech 2011 - AP2PS 2011 The Third International Conference on Advances in P2P Systems, November 20-25, 2011, Lisbon, Portugal Keynote Presentation: Epidemic Protocols in Peer-to-Peer Computing Dr. r. G Giuse iuseppe pe D Di i Fat atta G.DiFatta@reading.ac.uk Monday, November 21, 2011

The University of Reading • Established in 1892 as an extension of the Christ Church College of the University of Oxford. • Received its Royal Charter in 1926. • Awarded the Queen's Anniversary Prize for Higher and Further Education in 1998, 2005 and 2009. • One of the ten most research intensive universities in the UK. • Campus voted as one of best green spaces in the UK in 2011. Dr. G. Di Fatta 2

Outline • Introduction • Gossip or Epidemic protocols – robustness and efficiency – push vs. pull schemes – convergence speed and accuracy • Applications in large-scale systems – information dissemination vs. global knowledge – the data aggregation problem • Future applications in/of P2P systems • Open issues, research directions and conclusions Dr. G. Di Fatta 3

Is Peer-to-Peer in Decline? • Google trends are often (and arguably) shown as – evidence for the decline of a subject or – to advocate the rise of another Cloud Computing Peer Pe er-to to-Pe Peer er Grid Computing Cloud Computing “Peer Peer t to Pe Peer” er” Grid Computing Dr. G. Di Fatta 4

Is Peer-to-Peer in Decline? • Facts [source: Sandvine’s Global Internet Phenomena Report: Fall 2011 ] – P2P file sharing traffic as % of overall IP traffic has declined – overall IP traffic and P2P file sharing traffic have increased Dr. G. Di Fatta 5

Is Peer-to-Peer in Decline? • Decline of P2P file sharing applications – Security and legal issues • Malware distributed in place of content • Many organisations block ports of P2P applications – P2P has been replaced by other means of file sharing • RapidShare, Megavideo, iTunes, iPlayer, Hulu, Netflix, etc. • P2P paradigm emancipation – applications beyond file sharing • VoIP, video chat, live video streaming, • data-intensive ad-hoc applications, e.g., the CERN Advanced Storage system (CASTOR) • volunteer computing, Clouds integration • social media, online social networking Dr. G. Di Fatta 6

Papers Statistics • Source: IEEE Xplore – Keyword search: Metadata Only – Publisher: IEEE – Content Types: Conferences, Journals – Subjects: Computing & Processing (Hardware/Software), Communication, Networking & Broadcasting 3500 500 peer-to-peer epidemic OR gossip 3000 cloud computing 400 grid computing epidemic OR gossip AND P2P 2500 epidemic OR gossip 300 2000 1500 200 1000 100 500 0 0 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 Dr. G. Di Fatta 7

Gossip • Etymology: “gossip” is from Old English godsibb (= godparent) • Gossip is rumor, possibly the oldest and most common mean of sharing facts and opinions.  peer to peer information spreading • From an evolutionary biology point of view, it aids social bonding in large groups. overlay networks  • From an evolutionary psychology point of view, it aids building cooperative reputations and maintaining widespread indirect reciprocity: altruistic behaviour is favoured by the probability of future mutual interactions ( randomly chosen pair-wise encounters). tit for tat  Dr. G. Di Fatta

Epidemic • Etymology: “epidemic” is from Greek words epi and demos (= upon or above people). • In epidemiology it is a disease outbreak. It occurs when new cases exceed a "normal" expectation of propagation (a contained propagation). – The disease spreads person-to-person: the affected individuals become independent reservoirs leading to further exposures. – In uncontrolled outbreaks there is an exponential growth of the infected cases. Figure from: “Controlling infectious disease outbreaks: Lessons Figure from: “Rapid communications A preliminary estimation of the from mathematical modelling”, T Déirdre Hollingsworth, Journal of reproduction ratio for new influenza A(H1N1) from the outbreak in Public Health Policy 30, 328-341, Sept. 2009 Mexico, March-April 2009", P Y Boëlle, P Bernillon, J C Desenclos, Eurosurveillance, Volume 14, Issue 19, 14 May 2009 Dr. G. Di Fatta

A Bio-Inspired Paradigm • Epidemic or Gossip protocols are a communication and computation par paradi adigm gm for large-scale networked systems – based on randomised communication, – provides • scalability, • probabilistic guarantees on convergence speed and accuracy, • robustness, resilience, • fault-tolerance, high stability under disruption, • computational and communication efficiency. Dr. G. Di Fatta

Seminal Work and History • Clearinghouse Directory Service, Demers et al., Xerox PARC, 1987 • The refdbms distributed bibliographic database system, Golding et al., 1993 • Bayou project, Demers et al., Xerox PARC, 1993-97 • Bimodal Multicast, Cornell, 1998 • Astrolabe, Cornell, 1999 • 2000-2005, a few papers studied and extended the use of Epidemic approaches in communication networks and distributed systems Dr. G. Di Fatta

Applicability • Information Dissemination – Epidemic protocols can be used to disseminate information in large- scale distributed environments. • broadcasting, multicasting, failure detection, synchronisation, sampling, replica maintenance, monitoring, management, etc. • Data Aggregation – Epidemic protocols can also be adopted to solve the data aggregation problem in a fully decentralized manner. • Complex applications can be built from these basic services for very dynamic and very large-scale distributed systems. – e.g., fully decentralised Data Mining applications for large-scale distributed systems. Dr. G. Di Fatta

Information Dissemination • Epidemic information dissemination with probabilistic guarantees: – Anti-entropy • every node periodically chooses another node at random and resolves any differences in state – Rumour mongering • infected nodes periodically choose a node at random and spread the rumour – Gossiping • each node forwards a message probabilistically Dr. G. Di Fatta 13

Information Dissemination • Protocols for information dissemination in large-scale systems should have the following properties: – Efficiency, Robustness, Speed, Scalability • Alternative approaches: – Tree-based: efficient, but fragile and difficult configuration – Flooding: robust, but inefficient – Gossip-based: both efficient and robust, but has relatively high latency Gossip efficiency robustness Tree speed Flood Dr. G. Di Fatta 14

Gossip-based Protocol • Based on randomised communication and – peer selection mechanism – definition of state and merge function • Repeat • Repeat – wait some ∆ T – receive remote state – chose a random peer – merge with local state – send local state Dr. G. Di Fatta 15

Gossip Propagation Time • Time to propagate information originated at one peer expected # protocol cycles # peers Time to complete “infection”: O(log N) Dr. G. Di Fatta 16

Variants • Push epidemic – each peer sends state to other member • Pull epidemic – each peer requests state from other member – starts slowly, ends quickly – expected #rounds the same • Push/Pull epidemic – Push and Pull in one exchange – reduces #rounds, but increases overhead Dr. G. Di Fatta 17

Data Aggregation • (a.k.a. the “node aggregation” problem) • Given a network of N nodes, each node i holding a local value x i , • the goal is to determine the value of a global aggregation function f() at every node: f(x 0 , x 1 , ..., x N-1 ) • Example of aggregation functions: – sum, average, max, min, random samples, quantiles and other aggregate databases queries. Dr. G. Di Fatta

Aggregation: e.g., Sum − N 1 ∑ = s x i = i 0 • Centralised approach: all receive operations, and all additions, must be serialized: O(N) • Divide-and-conquer strategy to perform the global sum with a binary tree: the number of communication steps is reduced from O(N) to O(log(N)). Dr. G. Di Fatta 19

All-to-all Communication • MPI AllReduce MPI predefined operations: max, min, sum, product, and, or, xor  all processes compute identical results  number of communication steps: log(N)  number of messages: N*log(N)  Any global function which f ( x , x ,..., x ) can be approximated well − 0 1 N 1 using linear combinations. x 4 x 0 x 1 x 2 x 3 x 5 x 6 x 7 Dr. G. Di Fatta 20

Fault-Tolerance and Robustness • The parallel approach is not fault tolerant. • Even a single node or link failure cannot be tolerated. • A delay on a single communication link has an effect on all nodes. node ode failur ure • In large-scale and dynamic distributed systems we require the protocols to be decentralised and fault-tolerant. Dr. G. Di Fatta 21

The Push-Sum Protocol (PSP) • Each node i holds and updates the local sum s t,i and a weight w t,i . • Initialisation: – Node i sends the pair <x i ,w 0,i > to itself. • At each cycle t: • Update at node i: <½s t,i , ½w t,i > <½s t,j , ½w t,j > j i s t+1,i = ½s t,j + ½s t,i + ½s t,z <½s t,i , ½w t,i > w t+1,i = ½w t,j + ½w t,i + ½w t,z variance reduction step z u Dr. G. Di Fatta 22

Epidemic Protocols in Peer-to-Peer Computing Dr. r. G Giuse - PowerPoint PPT Presentation

NexTech 2011 - AP2PS 2011 The Third International Conference on Advances in P2P Systems, November 20-25, 2011, Lisbon, Portugal Keynote Presentation: Epidemic Protocols in Peer-to-Peer Computing Dr. r. G Giuse iuseppe pe D Di i Fat atta

Serverless networking (peer-to-peer computing) Peer-to-peer models Client-server computing

The Obesity Epidemic The Obesity Epidemic John Ganser, MD, FACS John Ganser, MD, FACS

Peer-to-Peer Computing Peer-to-Peer (P2P) employ distributed resources to perform function in a

THE PEER-TO-PEER NETWORK JOHN NEWBERY @jfnewbery github.com/jnewbery THE PEER-TO-PEER NETWORK

Peer-to-Peer Networks 09 Random Graphs for Peer-to-Peer-Networks Christian Ortolf Technical

Comparing Hybrid Peer-to-Peer Hybrid peer-to-peer systems Systems Beverly Yang and Hector

Epidemic Techniques Milo Polte Summary of First Paper Epidemic Algorithms For Replicated

THE SILENT EPIDEMIC OF TBIS: LISTENING FOR DEPRESSION AND SUICIDE THE SILENT EPIDEMIC OF

Epidemic Multicast (Mike) Yu Cheng Outline Definition, properties of epidemic multicast

Peer to Peer Learning & Support Aims and Objectives of this Workshop Workshop 3: Peer to

Peer-to-Peer Networking and Discovery Technologies Week 6 Whats Peer-to-Peer? A different

Secure Multi-Party Computation Lecture 17 GMW & BGW Protocols MPC Protocols MPC Protocols

From RPC to RMI Protocols for middleware services Protocols for middleware services

Analysis of Security Protocols Gavin Lowe Analysis of Security Protocols 02 Overview Brief

P2P: Distributed Hash Tables Chord + Routing Geometries Nirvan Tyagi CS 6410 Fall16

SpamResist: Making Peer-to-Peer Tagging SpamResist: Making Peer-to-Peer Tagging Systems Robust to

Coaching for high integrity organisational politics and networking Tim Bright 23 rd Annual

BELONGS TO HIGHVIEW SINCE 2012 HIGHVIEW IS AN INDEPENDENT MEDIA COMPANY THAT OPERATES VARIOUS

The Architecture of Uber's Realtime System March 25, 2015 Amos Barreto Danny Yuan

buying a home IN TODAYS MARKE T COMPANY First Name | Lic# 9999999 LOGO CLIE NT

E ff ortless Eventual Consistency Gossip, CRDTs, and Weave Mesh weave works - Outline Theory

Beyond Replicated Storage: Eventually-Consistent Distributed Data Structures Konrad Iwanicki

Developers Now By Aaron Stannard, Founder & CEO Petabridge High Expectations The obvious

Bullying ullying: : Wha hat You u Ne Need d to Know ow Council cil Rock Education on

Epidemic Protocols in Peer-to-Peer Computing Dr. r. G Giuse - PowerPoint PPT Presentation

NexTech 2011 - AP2PS 2011 The Third International Conference on Advances in P2P Systems, November 20-25, 2011, Lisbon, Portugal Keynote Presentation: Epidemic Protocols in Peer-to-Peer Computing Dr. r. G Giuse iuseppe pe D Di i Fat atta

Serverless networking (peer-to-peer computing) Peer-to-peer models Client-server computing

The Obesity Epidemic The Obesity Epidemic John Ganser, MD, FACS John Ganser, MD, FACS

Peer-to-Peer Computing Peer-to-Peer (P2P) employ distributed resources to perform function in a

THE PEER-TO-PEER NETWORK JOHN NEWBERY @jfnewbery github.com/jnewbery THE PEER-TO-PEER NETWORK

Peer-to-Peer Networks 09 Random Graphs for Peer-to-Peer-Networks Christian Ortolf Technical

Comparing Hybrid Peer-to-Peer Hybrid peer-to-peer systems Systems Beverly Yang and Hector

Epidemic Techniques Milo Polte Summary of First Paper Epidemic Algorithms For Replicated

THE SILENT EPIDEMIC OF TBIS: LISTENING FOR DEPRESSION AND SUICIDE THE SILENT EPIDEMIC OF

Epidemic Multicast (Mike) Yu Cheng Outline Definition, properties of epidemic multicast

Peer to Peer Learning &amp; Support Aims and Objectives of this Workshop Workshop 3: Peer to

Peer-to-Peer Networking and Discovery Technologies Week 6 Whats Peer-to-Peer? A different

Secure Multi-Party Computation Lecture 17 GMW &amp; BGW Protocols MPC Protocols MPC Protocols

From RPC to RMI Protocols for middleware services Protocols for middleware services

Analysis of Security Protocols Gavin Lowe Analysis of Security Protocols 02 Overview Brief

P2P: Distributed Hash Tables Chord + Routing Geometries Nirvan Tyagi CS 6410 Fall16

SpamResist: Making Peer-to-Peer Tagging SpamResist: Making Peer-to-Peer Tagging Systems Robust to

Coaching for high integrity organisational politics and networking Tim Bright 23 rd Annual

BELONGS TO HIGHVIEW SINCE 2012 HIGHVIEW IS AN INDEPENDENT MEDIA COMPANY THAT OPERATES VARIOUS

The Architecture of Uber's Realtime System March 25, 2015 Amos Barreto Danny Yuan

buying a home IN TODAYS MARKE T COMPANY First Name | Lic# 9999999 LOGO CLIE NT

E ff ortless Eventual Consistency Gossip, CRDTs, and Weave Mesh weave works - Outline Theory

Beyond Replicated Storage: Eventually-Consistent Distributed Data Structures Konrad Iwanicki

Developers Now By Aaron Stannard, Founder &amp; CEO Petabridge High Expectations The obvious

Bullying ullying: : Wha hat You u Ne Need d to Know ow Council cil Rock Education on

Peer to Peer Learning & Support Aims and Objectives of this Workshop Workshop 3: Peer to

Secure Multi-Party Computation Lecture 17 GMW & BGW Protocols MPC Protocols MPC Protocols

Developers Now By Aaron Stannard, Founder & CEO Petabridge High Expectations The obvious