gossip based peer sampling
play

Gossip-based peer sampling Mateusz Fedoryszak on the base of M. - PowerPoint PPT Presentation

Gossip-based peer sampling Mateusz Fedoryszak on the base of M. Jelasity, S. Voulgaris, R. Guerraoui, A.-M. Kermarrec, and M. van Steen: Gossip - based peer sampling, ACM Transactions on Computer Systems, vol. 25, no. 3, August 2007,


  1. Gossip-based peer sampling Mateusz Fedoryszak on the base of M. Jelasity, S. Voulgaris, R. Guerraoui, A.-M. Kermarrec, and M. van Steen: “Gossip - based peer sampling,” ACM Transactions on Computer Systems, vol. 25, no. 3, August 2007, article no. 8.

  2. The fount of all gossip • Each node has a part of Node 1 the overal knowledge • Information periodically Node 5 Node 2 exchanged • High scalability and Node 4 Node 3 fault tolerance

  3. Who is my neighbour? • We need a method of sampling from a set of nodes with an uniform distribution. • Implement as a gossip-based protocol itself. • Create a generic protocol, then instantiate and evaluate variations.

  4. The knowledge • Each node has a list of c descriptors. • A descriptor is a pair of peer’s IP address and descriptor’s age. • During information exchange, a node sends its own IP address with age 0 and c/2 – 1 of other descriptors it has, randomly selected, not using H oldest.

  5. The cycle • During each cycle a node initiates just one information exchange. • May receive many exchange requests. • At the end of the cycle, all descriptors’ ages are incremented.

  6. Merging • Add recieved descriptors to your own view • Remove duplicates (leave fresher descriptors) • Remove at most H oldest items • Remove at most S items sent to a peer • A care is taken to make view eventually contain exactly c items.

  7. Parameters • Peer selection – selectPeer() • View selection – S – swapping – H – healing • View propagation – Push – Pull

  8. Is it cool enough? • Randomness • Load balancing • Fault tolerance

  9. Rolling the dice • Treat sampled peers as a number sequence • Test randomness using tests defined in MARSAGLIA, G. 1995. The Marsaglia random number CDROM including the Diehard battery of tests of randomness . Florida State University. • Result: only one failing test

  10. The big picture • Treat the network as a directed graph • Vertices ≡ Nodes • There is an edge ( a , b ) iff a stores the descriptor of b

  11. Convergence • Initial overlay – Growing – Lattice – Random • All scenarios lead to a consistent network except push protocol with growing scenario

  12. Do pull!

  13. Indegree standard deviation

  14. Converged indegree distribution

  15. Won’t it blow up? • Catastrophic failure • Churn – 1% – 30% • Bootstrapping – Central – Random

  16. Cluster partition

  17. Catastrophic failure recovery

  18. 1% churn – degree standard deviation

  19. 1% churn – dead links

  20. 30% churn

  21. Findings • It works well • Use push and pull • Swapper for good load balancing • Healer for fault tolerance

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend