Heterogeneity and Load Balance in Distributed Hash Tables Brighten - PowerPoint PPT Presentation

Heterogeneity and Load Balance in Distributed Hash Tables Brighten Godfrey Joint Work with Ion Stoica Computer Science Division, UC Berkeley IEEE INFOCOM March 15, 2005

The goals • Distributed Hash Tables partition an ID space among n nodes – Typically: each node picks one random ID – Node owns region between its predecessor and its own ID – Some nodes get log n times their fair share of ID space • Goal 1: Fair partitioning of ID space – If load distributed uniformly in ID space, then this produces a load balanced system – Handle case of heterogeneous node capacities • Goal 2: Use heterogeneity to our advantage to reduce route length in overlay that connects nodes

Model & performance metric • n nodes • Each node v has a capacity c v (e.g. bandwidth) • Average capacity is 1 , total capacity n • Share of node v is share( v ) = fraction of ID space that v owns . c v /n • Want low maximum share • Perfect partitioning has max. share = 1 .

Basic Virtual Server Selection • Standard homogeneous case – Each node picks Θ(log n ) IDs (like simulating Θ(log n ) nodes) – Maximum share is O (1) with high probability (w.h.p.) in homogeneous system Multiple disjoint segments • Heterogeneous case – Node v simulates Θ( c v log n ) nodes (discard low-capacity nodes) – Maximum share is O (1) w.h.p. for any capacity distribution Low capacity node High capacity node

Basic-VSS: Problems • To route between nodes, construct an overlay network • With Θ(log n ) IDs, must maintain Θ(log n ) times as many overlay connections! • Other proposals use one ID per node, but... – all require reassignment of IDs in response to churn, and load movement is costly – none handles heterogeneity directly – some can’t compute node IDs as hash of IP address for security – some are limited in the achievable quality of load balance – some are complicated

Low Cost Virtual Server Selection • Pick Θ( c v log n ) IDs for node of capacity c v as before... • ...but cluster them in a random fraction Θ( c v log n ) of the ID space n – Random starting location r – Pick Θ( c v log n ) IDs spaced at intervals of ≈ 1 n (with random perturbation) • Ownership of ID space is still in disjoint segments • Why does this help?

LC-VSS: Overlay Topology • When building overlay network, simulate ownership of contiguous fraction Θ( c v log n ) of ID space n Real Simulated • Routing ends at node simulating ownership of target ID, not real owner • But clustering of IDs ⇒ real owner is nearby in ID space ⇒ can complete route in O (1) more hops using successor links

LC-VSS: Overlay Topology • When building overlay network, simulate ownership of contiguous fraction Θ( c v log n ) of ID space n Real Simulated Message • Routing ends at node simulating ownership of target ID, not real owner • But clustering of IDs ⇒ real owner is nearby in ID space ⇒ can complete route in O (1) more hops using successor links

LC-VSS: Theoretical Properties • Works for any ring-based overlay topology – Y 0 : LC-VSS applied to Chord • Compared to single-ID case, – Node outdegree increases by at most a constant factor – Route length increases by at most an additive constant • Goal 1 : Load balance – Achieves maximum share of 1 + ε for any ε > 0 and any capacity distribution ∗ ...under some assumptions: sufficiently good approximation of n and average capacity, and sufficiently low capacity thresh- old below which nodes are discarded – Tradeoff: outdegree depends on ε

Heterogeneity and Load Balance in Distributed Hash Tables Brighten - PowerPoint PPT Presentation

Heterogeneity and Load Balance in Distributed Hash Tables Brighten Godfrey Joint Work with Ion Stoica Computer Science Division, UC Berkeley IEEE INFOCOM March 15, 2005 The goals Distributed Hash Tables partition an ID space among n nodes

Hash Functions and Hash Tables (2.5.2) A hash function h maps keys of a given type to

Datastructures 1 Hash Tables Red Black Trees Week 8 Objectives Hash Tables, Hashing

Hash Tables 1 / 91 Hash Tables Administrivia Assignment 2 has been released. We will be

Hash Functions in Action Hash Functions in Action Lecture 12 Hash Functions Hash Functions

Hash Functions in Action Hash Functions in Action Lecture 11 Hash Functions Hash Functions

CS 758/858: Algorithms http://www.cs.unh.edu/~ruml/cs758 Searching Hash Tables Hash Functions

Hash tables Hash functions Open addressing March 09, 2020 Cinda Heeren / Andy Roth / Geoffrey

Working with Hash Tables Daniel Petrolito (ANZ Bank) Working With Hash Tables Daniel SAS

Hash Tables Direct-Address Tables Hash Functions Universal Hashing Chaining Open Addressing

Distributed, Secure Load Balancing with Skew, Heterogeneity, and Churn Jonathan Ledlie and Margo

Distributed Hash Tables What is a DHT? Hash Table data structure that maps keys to

Hash Functions Hash Functions 1 Cryptographic Hash Function Crypto hash function h(x) must

Hash Pile Ups: Using Collisions to Identify Unknown Hash Functions R. Joshua Tobin and David

Topic 22 Hash Tables " hash collision n. [from the techspeak] (var. `hash clash') When used

Hash Tables 1 Hash Table in Primary Storage Main parameter B = number of buckets Hash

Hash tables Most data structures that were going to see are about storing and manipulating data

LOAD CARRYING CAPACITY OF BROKEN ELLIPSOIDAL INHOMOGENEITY AND CRACK EXTENDED IN PARTICLE

Burlington Early Learning In Initiative Conceptual Scholarship Model Discussion September 26,

Alp Eroglu IOSCO General Secretariat A Holis/c Approach is

Policies for an Oil Exporting Country Prospects for and Challenges of Industrialization and

Deploying at scale with PaaS Last Updated: October. 2014 About Me Lakmal Warusawithana Vise

Discussion Regarding Uneconomic Adjustment Policy & Parameter Tuning Market and Product

Active Consumer Participation in Wholesale Electricity Markets Frank A. Wolak Director, Program

- - packing p a - packing algo- packing cking rithms algo- a l g o - theorems rithms

Heterogeneity and Load Balance in Distributed Hash Tables Brighten - PowerPoint PPT Presentation

Heterogeneity and Load Balance in Distributed Hash Tables Brighten Godfrey Joint Work with Ion Stoica Computer Science Division, UC Berkeley IEEE INFOCOM March 15, 2005 The goals Distributed Hash Tables partition an ID space among n nodes

Hash Functions and Hash Tables (2.5.2) A hash function h maps keys of a given type to

Datastructures 1 Hash Tables Red Black Trees Week 8 Objectives Hash Tables, Hashing

Hash Tables 1 / 91 Hash Tables Administrivia Assignment 2 has been released. We will be

Hash Functions in Action Hash Functions in Action Lecture 12 Hash Functions Hash Functions

Hash Functions in Action Hash Functions in Action Lecture 11 Hash Functions Hash Functions

CS 758/858: Algorithms http://www.cs.unh.edu/~ruml/cs758 Searching Hash Tables Hash Functions

Hash tables Hash functions Open addressing March 09, 2020 Cinda Heeren / Andy Roth / Geoffrey

Working with Hash Tables Daniel Petrolito (ANZ Bank) Working With Hash Tables Daniel SAS

Hash Tables Direct-Address Tables Hash Functions Universal Hashing Chaining Open Addressing

Distributed, Secure Load Balancing with Skew, Heterogeneity, and Churn Jonathan Ledlie and Margo

Distributed Hash Tables What is a DHT? Hash Table data structure that maps keys to

Hash Functions Hash Functions 1 Cryptographic Hash Function Crypto hash function h(x) must

Hash Pile Ups: Using Collisions to Identify Unknown Hash Functions R. Joshua Tobin and David

Topic 22 Hash Tables &quot; hash collision n. [from the techspeak] (var. `hash clash') When used

Hash Tables 1 Hash Table in Primary Storage Main parameter B = number of buckets Hash

Hash tables Most data structures that were going to see are about storing and manipulating data

LOAD CARRYING CAPACITY OF BROKEN ELLIPSOIDAL INHOMOGENEITY AND CRACK EXTENDED IN PARTICLE

Burlington Early Learning In Initiative Conceptual Scholarship Model Discussion September 26,

Alp Eroglu IOSCO General Secretariat A Holis/c Approach is

Policies for an Oil Exporting Country Prospects for and Challenges of Industrialization and

Deploying at scale with PaaS Last Updated: October. 2014 About Me Lakmal Warusawithana Vise

Discussion Regarding Uneconomic Adjustment Policy &amp; Parameter Tuning Market and Product

Active Consumer Participation in Wholesale Electricity Markets Frank A. Wolak Director, Program

- - packing p a - packing algo- packing cking rithms algo- a l g o - theorems rithms

Topic 22 Hash Tables " hash collision n. [from the techspeak] (var. `hash clash') When used

Discussion Regarding Uneconomic Adjustment Policy & Parameter Tuning Market and Product