A Content Propagation Metric for Efficient Content Distribution Ryan - - PowerPoint PPT Presentation

a content propagation metric for efficient content
SMART_READER_LITE
LIVE PREVIEW

A Content Propagation Metric for Efficient Content Distribution Ryan - - PowerPoint PPT Presentation

A Content Propagation Metric for Efficient Content Distribution Ryan S. Peterson ! , Bernard Wong ! , and Emin Gn Sirer ! Department of Computer Science, Cornell University School of Computer Science, University of Waterloo !


slide-1
SLIDE 1

A Content Propagation Metric for Efficient Content Distribution

Ryan S. Peterson†!, Bernard Wong‡!, and Emin Gün Sirer†!

† Department of Computer Science, Cornell University ‡ School of Computer Science, University of Waterloo ! United Networks, LLC

August 18, 2011

slide-2
SLIDE 2

Content Distribution

  • rigin server

cache servers end users

slide-3
SLIDE 3

BW in Client-Server

  • rigin server

cache servers end users

slide-4
SLIDE 4

BW in

  • rigin server

cache servers end users

Peer-to-Peer

in swarms

slide-5
SLIDE 5

BW in

  • rigin server

cache servers end users in swarms

Antfarm

slide-6
SLIDE 6
  • rigin server

cache servers end users in swarms

Goal

Efficiently use all available bandwidth

slide-7
SLIDE 7

Problem Definition

  • The general multi-swarm content

distribution problem

  • given: hosts, swarms, and swarm memberships
  • find: allocation of each host’s upload

bandwidth among its swarms that maximizes system-wide bandwidth

slide-8
SLIDE 8

Approach

Enables each host to measure its impact on each swarm and adjust its bandwidth allocations accordingly New metric that steers hosts toward a globally efficient allocation of resources

slide-9
SLIDE 9

Approach

Content Propagation Metric

New metric that steers hosts toward a globally efficient allocation of resources

slide-10
SLIDE 10

Outline

The CPM

V-Formation

Evaluation

slide-11
SLIDE 11

Benefit of a Block

p s1 s2 p’s choice: upload the next block to s1 or s2? Which swarm will benefit more?

slide-12
SLIDE 12

Determining Benefit

  • What block p uploads
  • Distribution of blocks in the swarms
  • Sizes of the swarms
  • Network conditions among peers
  • The direct recipient of p’s block

Use history to predict the future

slide-13
SLIDE 13

Intuition

p s1 s2 Measure how “fast” p’s blocks propagate in each swarm Use the result as an estimate of the benefit that the swarms derive from p’s blocks

slide-14
SLIDE 14

Content Propagation Metric

p s1 s2 Block propagation bandwidth: rate that an uploaded block propagates in a fixed time interval τ CPM: rolling average of a peer’s recent block propagation bandwidths for a swarm

9/τ 17/τ

slide-15
SLIDE 15

Using the CPM

  • Each host measures random uploaded blocks

to maintain a CPM value for each swarm

  • Hosts upload to swarms with the largest CPM

values when faced with competing requests

  • Hosts proactively probe new swarms and

swarms with stale CPM values

slide-16
SLIDE 16

CPM Case Study

s1 s2

CPM time

slide-17
SLIDE 17

CPM Case Study

p1 p2 s1 s2 competition for block propagation

CPM time

slide-18
SLIDE 18

CPM Case Study

p1 p2 s1 s2

bandwidth from cache time

slide-19
SLIDE 19

CPM Overview

  • Identifies neediest swarms
  • Easy to measure
  • Can allocate bandwidth from a single

server

  • Accounts for interference from

competing hosts

slide-20
SLIDE 20

Outline

The CPM

V-Formation

Evaluation

slide-21
SLIDE 21

V-Formation

  • Based on our hybrid architecture
  • A logically centralized coordinator

provides efficient bookkeeping

  • A token protocol enables the coordinator

to track blocks and monitor peers

slide-22
SLIDE 22

Coordinator

  • Measures swarm dynamics
  • tracks block transfers based on spent tokens
  • Computes peers’ CPM values
  • periodically sends updates to peers
  • Provides accountability
  • detects and blocks misbehaving peers
slide-23
SLIDE 23

Wire Protocol Goals

  • Track block transfers among peers
  • Disseminate CPM values and peer lists
  • Enforce peer behavior
slide-24
SLIDE 24

s1

Wire Protocol

coordinator

join s1 peerlist

slide-25
SLIDE 25

s1

Wire Protocol

coordinator

get tokens tokens

slide-26
SLIDE 26

s1

Wire Protocol

coordinator

want block block

slide-27
SLIDE 27

s1

Wire Protocol

coordinator

token deposit tokens

slide-28
SLIDE 28

coordinator’s state b1

time

s1

Wire Protocol

coordinator

slide-29
SLIDE 29

coordinator’s state b1

time

s1

Wire Protocol

coordinator

CPM value announce

slide-30
SLIDE 30

Coordinator Design

web server distributed, shared state web server web server processor processor processor processor

handle peer requests, record block propagation data stores membership info, propagation data, and CPMs continuously process block propagation data

slide-31
SLIDE 31

Coordinator State

  • Soft state stored in memcached
  • Swarm: peers, number of blocks
  • Peers: addr, swarms, block propagation

bandwidths, CPMs

  • Blocks: swarm, propagation graph with

timestamped, peer-identified nodes

  • Updated via atomic CAS operations
slide-32
SLIDE 32

Outline

The CPM

V-Formation

Evaluation

slide-33
SLIDE 33

Evaluation

  • Built and deployed

V-Formation as a video- sharing service called FlixQ

  • Uses the CPM to achieve high performance
  • Coordinator scales to large deployments
slide-34
SLIDE 34

Experimental Setup

  • Coordinator on Amazon EC2
  • 380 peers on PlanetLab with realistic

bandwidth capacities

  • 200 swarms based on IMDb movie

popularities and sizes

  • 20% of peers belong to multiple swarms
  • 2 caches with different subsets of content
slide-35
SLIDE 35

End-to-End Performance

BitTorrent Antfarm V-Formation

1500 3000 4500 6000 aggregate bandwidth (KB/s) time (s) 1600 1200 800 400

slide-36
SLIDE 36

Performance of Heuristics

2000 4000 6000 8000 aggregate bandwidth (KB/s)

V-Formation Largest swarm Global rarest Random Smallest swarm

time (s) 1000 2000 3000

slide-37
SLIDE 37

100 200 300 400 500 2500 5000 7500 10000 coordinator bandwidth (KB/s) number of peers

Scalability

bandwidth state size

30 25 20 15 10 5 coordinator state (MB)

slide-38
SLIDE 38

Related Work

  • Content Distribution Networks
  • Antfarm, Akamai, CoBlitz, CoDeeN, ECHOS, Coral, Slurpie,

YouTube, Hulu, GridCast, Tribler, Joost, Huang et al. 2008, Freedman et al. 2008, ...

  • P2P Swarming
  • BitTorrent, BitTyrant, PropShare, BitTornado, BASS,

Annapureddy et al. 2007, Guo et al. 2005, Pouwelse et al. 2005, Zhang et al. 2011, OneSwarm, ...

  • Incentives and microcurrencies
  • Dandelion, BAR Gossip, Samsara, Karma, SHARP

, PPay, Kash et al. 2007, Levin et al. 2009, iOwe, ...

slide-39
SLIDE 39

Conclusions

  • New hybrid approach for efficient

bandwidth allocation

  • Decentralized metric enables hosts to

measure their global benefit

  • Centralized implementation drives hosts

toward globally efficient use of resources

http://flixq.com