Gossip-based peer sampling Mateusz Fedoryszak on the base of M. - - PowerPoint PPT Presentation

gossip based peer sampling
SMART_READER_LITE
LIVE PREVIEW

Gossip-based peer sampling Mateusz Fedoryszak on the base of M. - - PowerPoint PPT Presentation

Gossip-based peer sampling Mateusz Fedoryszak on the base of M. Jelasity, S. Voulgaris, R. Guerraoui, A.-M. Kermarrec, and M. van Steen: Gossip - based peer sampling, ACM Transactions on Computer Systems, vol. 25, no. 3, August 2007,


slide-1
SLIDE 1

Gossip-based peer sampling

Mateusz Fedoryszak

  • n the base of
  • M. Jelasity, S. Voulgaris, R. Guerraoui, A.-M. Kermarrec, and
  • M. van Steen: “Gossip-based peer sampling,” ACM

Transactions on Computer Systems, vol. 25, no. 3, August 2007, article no. 8.

slide-2
SLIDE 2

The fount of all gossip

Node 1 Node 2 Node 3 Node 4 Node 5

  • Each node has a part of

the overal knowledge

  • Information periodically

exchanged

  • High scalability and

fault tolerance

slide-3
SLIDE 3

Who is my neighbour?

  • We need a method of sampling from a set of

nodes with an uniform distribution.

  • Implement as a gossip-based protocol itself.
  • Create a generic protocol, then instantiate and

evaluate variations.

slide-4
SLIDE 4

The knowledge

  • Each node has a list of c descriptors.
  • A descriptor is a pair of peer’s IP address and

descriptor’s age.

  • During information exchange, a node sends its
  • wn IP address with age 0 and c/2 – 1 of other

descriptors it has, randomly selected, not using H oldest.

slide-5
SLIDE 5

The cycle

  • During each cycle a node initiates just one

information exchange.

  • May receive many exchange requests.
  • At the end of the cycle, all descriptors’ ages

are incremented.

slide-6
SLIDE 6

Merging

  • Add recieved descriptors to your own view
  • Remove duplicates (leave fresher descriptors)
  • Remove at most H oldest items
  • Remove at most S items sent to a peer
  • A care is taken to make view eventually

contain exactly c items.

slide-7
SLIDE 7

Parameters

  • Peer selection

– selectPeer()

  • View selection

– S – swapping – H – healing

  • View propagation

– Push – Pull

slide-8
SLIDE 8

Is it cool enough?

  • Randomness
  • Load balancing
  • Fault tolerance
slide-9
SLIDE 9

Rolling the dice

  • Treat sampled peers as a number sequence
  • Test randomness using tests defined in

MARSAGLIA, G. 1995. The Marsaglia random number CDROM including the Diehard battery

  • f tests of randomness. Florida State

University.

  • Result: only one failing test
slide-10
SLIDE 10

The big picture

  • Treat the network as a directed graph
  • Vertices ≡ Nodes
  • There is an edge (a, b) iff a stores the

descriptor of b

slide-11
SLIDE 11

Convergence

  • Initial overlay

– Growing – Lattice – Random

  • All scenarios lead to a consistent network

except push protocol with growing scenario

slide-12
SLIDE 12

Do pull!

slide-13
SLIDE 13

Indegree standard deviation

slide-14
SLIDE 14

Converged indegree distribution

slide-15
SLIDE 15

Won’t it blow up?

  • Catastrophic failure
  • Churn

– 1% – 30%

  • Bootstrapping

– Central – Random

slide-16
SLIDE 16

Cluster partition

slide-17
SLIDE 17

Catastrophic failure recovery

slide-18
SLIDE 18

1% churn – degree standard deviation

slide-19
SLIDE 19

1% churn – dead links

slide-20
SLIDE 20

30% churn

slide-21
SLIDE 21

Findings

  • It works well
  • Use push and pull
  • Swapper for good load balancing
  • Healer for fault tolerance