Characterizing Load Imbalance in Real-World Networked Caches Qi - - PowerPoint PPT Presentation

characterizing load imbalance in real world networked
SMART_READER_LITE
LIVE PREVIEW

Characterizing Load Imbalance in Real-World Networked Caches Qi - - PowerPoint PPT Presentation

Characterizing Load Imbalance in Real-World Networked Caches Qi Huang Cornell U, Facebook Helga Gudmundsdottir Emory U, Reykjavik U Ymir Vigfusson Emory U, Reykjavik U Daniel A. Freedman Technion Ken Birman Cornell U Robbert van Renesse


slide-1
SLIDE 1

Characterizing Load Imbalance in Real-World Networked Caches

Qi Huang Cornell U, Facebook Helga Gudmundsdottir Emory U, Reykjavik U Ymir Vigfusson Emory U, Reykjavik U Daniel A. Freedman Technion Ken Birman Cornell U Robbert van Renesse Cornell U

slide-2
SLIDE 2

Networked Caches in Real-World Web Stack

Web Servers Backend Store Caches

Get X Get X X X

slide-3
SLIDE 3

Partitioning Data

Hash(A)

id: A type: USER name: Helga

slide-4
SLIDE 4

Partitioning Data

Hash(1)

Shard 1

A

slide-5
SLIDE 5
  • Hashing schemes balance quantity of content per server.
  • However, content popularity varies!
  • Need to provision servers to handle peak load.
  • Lack of real-world data to verify suspicions.

Causes of Load Imbalance

slide-6
SLIDE 6

Questions

  • What is the state of load imbalance?
  • What contributes to load imbalance?
  • How effective are current techniques?
  • How might they be improved?

Sampled production traffic in TAO, serving Facebook’s social graph.

slide-7
SLIDE 7

What is the State of Load Imbalance?

Significant load imbalance observed in TAO.

slide-8
SLIDE 8

Skewed content popularity observed at both shard and object level.

Possible Causes: Content Popularity

slide-9
SLIDE 9

Possible Causes: Hot Objects

Very hot objects alone are not a major cause of load imbalance.

slide-10
SLIDE 10

Popular shards receive significantly higher load, compared to hot objects.

Possible Causes: Hot Shards

slide-11
SLIDE 11

How Effective are Current Techniques?

  • Hashing: Balance number of shards across servers.
  • Replication: Divide load of popular shards among servers.
  • Metric: Maximum load versus average.

Replication Hashing libketama TAO Perfect None TAO Perfect

slide-12
SLIDE 12
  • Where We Are Today

Replication Hashing libketama TAO Perfect None 1.53 TAO Perfect

slide-13
SLIDE 13
  • Where We Are Today

Replication Hashing libketama TAO Perfect None 1.53 TAO Perfect 1.00

slide-14
SLIDE 14
  • Where We Are Today

Replication Hashing libketama TAO Perfect None 1.53 TAO 1.25 Perfect 1.00

slide-15
SLIDE 15
  • Hashing Schemes w/o Replication

Replication Hashing libketama TAO Perfect None 1.53 TAO 1.25 Perfect 1.00

slide-16
SLIDE 16
  • Hashing Schemes w/o Replication

Replication Hashing libketama TAO Perfect None 1.53 1.46 TAO 1.25 Perfect 1.00

slide-17
SLIDE 17
  • Hashing Schemes w/o Replication

Replication Hashing libketama TAO Perfect None 1.53 1.46 1.34 TAO 1.25 Perfect 1.00

slide-18
SLIDE 18
  • libketama Hashing

Replication Hashing libketama TAO Perfect None 1.53 1.46 1.34 TAO 1.25 Perfect 1.00

slide-19
SLIDE 19
  • libketama Hashing

Replication Hashing libketama TAO Perfect None 1.53 1.46 1.34 TAO 1.53 1.25 Perfect 1.00

slide-20
SLIDE 20
  • libketama Hashing

Replication Hashing libketama TAO Perfect None 1.53 1.46 1.34 TAO 1.53 1.25 Perfect 1.41 1.00

slide-21
SLIDE 21
  • TAO: Room For Improvement

Replication Hashing libketama TAO Perfect None 1.53 1.46 1.34 TAO 1.53 1.25 Perfect 1.41 1.00

slide-22
SLIDE 22
  • TAO: Room For Improvement

Replication Hashing libketama TAO Perfect None 1.53 1.46 1.34 TAO 1.53 1.25 Perfect 1.41 1.18 1.00

slide-23
SLIDE 23
  • TAO: Room For Improvement

Replication Hashing libketama TAO Perfect None 1.53 1.46 1.34 TAO 1.53 1.25 1.17 Perfect 1.41 1.18 1.00

slide-24
SLIDE 24
  • Streaming Replication

Replication Hashing libketama TAO Perfect None 1.53 1.46 1.34 TAO 1.53 1.25 1.17 Perfect 1.41 1.18 1.00

slide-25
SLIDE 25
  • Streaming Replication

Replication Hashing libketama TAO Perfect None 1.53 1.46 1.34 TAO 1.53 1.25 1.17 Perfect 1.41 1.18 1.00 Streaming with Perfect Hashing achieves max/avg 1.12

slide-26
SLIDE 26

Summary and Future Work

  • Characterized how hashing and replication

affect load imbalance.

  • Can streaming algorithms replicate content

before its popularity surges?

  • Can we predict popularity spikes and

prevent hotspots?

Questions?