Exploring and Improving BitTorrent Topologies Christian Decker ETH - - PowerPoint PPT Presentation

exploring and improving bittorrent topologies
SMART_READER_LITE
LIVE PREVIEW

Exploring and Improving BitTorrent Topologies Christian Decker ETH - - PowerPoint PPT Presentation

Exploring and Improving BitTorrent Topologies Christian Decker ETH Zurich Distributed Computing Group www.disco.ethz.ch BitTorrent Filesharing protocol Peers form Ad-Hoc networks (swarm) Trackers to join the swarm Trading


slide-1
SLIDE 1

Exploring and Improving BitTorrent Topologies

Christian Decker

ETH Zurich – Distributed Computing Group – www.disco.ethz.ch

slide-2
SLIDE 2

BitTorrent

  • Filesharing protocol
  • Peers form Ad-Hoc networks (swarm)
  • Trackers to join the swarm
  • Trading pieces between peers
slide-3
SLIDE 3

BitTorrent: handshake

  • Peers exchange handshakes before trading
  • Protocol identifier
  • Protocol extensions
  • peer id
  • Torrent info hash
slide-4
SLIDE 4

Exploring

slide-5
SLIDE 5

Exploring Swarm Topologies

Random topology:

  • Tracker return random peers
  • Peers chose random neighbors
slide-6
SLIDE 6

Exploring Swarm Topologies

Random topology:

  • Tracker return random peers
  • Peers chose random neighbors

Studies to explore BitTorrent topologies:

  • Experimental setup
  • Traffic log
  • Instrumented clients
slide-7
SLIDE 7

Exploring Swarm Topologies

Random topology:

  • Tracker return random peers
  • Peers chose random neighbors

Studies to explore BitTorrent topologies:

  • Experimental setup
  • Traffic log
  • Instrumented clients
  • Live swarms
slide-8
SLIDE 8

Scanning method

A B ? S

” H e l l

  • I

’ m A ”

slide-9
SLIDE 9

Challenges when moving to real swarms

  • Scanning takes time

500 1000 1500 2000 2500 3000 3500 0.0 0.2 0.4 0.6 0.8 1.0

CDF of connection uptimes

Uptime [s] CDF of connections

slide-10
SLIDE 10

Challenges when moving to real swarms

  • Scanning takes time
  • Invisible part of a swarm

500 1000 1500 2000 2500 3000 3500 0.0 0.2 0.4 0.6 0.8 1.0

CDF of connection uptimes

Uptime [s] CDF of connections

slide-11
SLIDE 11

Challenges when moving to real swarms

  • Scanning takes time
  • Invisible part of a swarm
  • Error detection

500 1000 1500 2000 2500 3000 3500 0.0 0.2 0.4 0.6 0.8 1.0

CDF of connection uptimes

Uptime [s] CDF of connections

slide-12
SLIDE 12

Evaluation: topology sample size

58% peers cannot be scanned

42% 58%

slide-13
SLIDE 13

Evaluation: topology sample size

58% peers cannot be scanned

42% 58%

But we can scan either endpoint of a connection:

67% 33%

slide-14
SLIDE 14

Evaluation: coverage

How fast can we scan for all possible connections?

200 400 600 800 1000 Swarm Size 20 40 60 80 100 Coverage (%)

Concurrency 1 Concurrency 2 Concurrency 3 Concurrency 5

slide-15
SLIDE 15

Evaluation: coverage

How fast can we scan for all possible connections?

100 200 300 400 500 Scan time [s] 20 40 60 80 100 Coverage (%)

Concurrency 1 Concurrency 2 Concurrency 3 Concurrency 4 Concurrency 5

slide-16
SLIDE 16

Locality

Trading with random peers, that may be halfway around the globe. Closer peers may be available. σ(a, b) =

  • 1

a and b are connected

  • therwise
slide-17
SLIDE 17

Locality

Trading with random peers, that may be halfway around the globe. Closer peers may be available. σ(a, b) =

  • 1

a and b are connected

  • therwise

L =

  • a,b σ(a, b) · d(a, b)
  • a,b σ(a, b)

· E[D]−1

slide-18
SLIDE 18

Locality

Trading with random peers, that may be halfway around the globe. Closer peers may be available. σ(a, b) =

  • 1

a and b are connected

  • therwise

L =

  • a,b σ(a, b) · d(a, b)
  • a,b σ(a, b)

· E[D]−1 ? ≈ 1

slide-19
SLIDE 19

Locality: evaluation

  • σ(a, b) provided from scanning method
  • d(a, b) provided by MaxMind GeoIP Database
slide-20
SLIDE 20

Locality: evaluation

  • σ(a, b) provided from scanning method
  • d(a, b) provided by MaxMind GeoIP Database
  • 33 recently uploaded torrents
  • International torrents
  • 50-500 peers
slide-21
SLIDE 21

Locality: evaluation

  • σ(a, b) provided from scanning method
  • d(a, b) provided by MaxMind GeoIP Database
  • 33 recently uploaded torrents
  • International torrents
  • 50-500 peers

BitTorrent is not locality aware! L = 1.062 > 1

slide-22
SLIDE 22

Improving

slide-23
SLIDE 23

Suggesting

Peer Exchange (PEX):

  • Reduce tracker load
  • Increase trading partners
slide-24
SLIDE 24

Suggesting

Peer Exchange (PEX):

  • Reduce tracker load
  • Increase trading partners
  • Suggest nearby peers

B S

”You’ll like A”

slide-25
SLIDE 25

Suggesting peers with PEX

1 Identify new peers 2 Find nearby peers 3 Connect to new peers 4 Send suggestions as PEX message

slide-26
SLIDE 26

Suggesting peers with PEX

+ No special access + No shaping or blocking + Widely supported + ISPs get information for free

slide-27
SLIDE 27

Suggesting peers with PEX

+ No special access + No shaping or blocking + Widely supported + ISPs get information for free

  • Limited influence
slide-28
SLIDE 28

How are we doing?

  • L: 1.062 → 0.994.
  • 6.3% improvement

Connection length comparison

Connection length [km] Probability 2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 0.00 0.05 0.10 0.15 0.20 0.25 0.30

  • Influenced Swarms

Untouched Swarms

slide-29
SLIDE 29

Conclusion

A B ? S

”Hello I’m A”

slide-30
SLIDE 30

Conclusion

A B ? S

”Hello I’m A”

B S

”You’ll like A”

Connection length comparison

Connection length [km] Probability 2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 0.00 0.05 0.10 0.15 0.20 0.25 0.30

  • Influenced Swarms

Untouched Swarms

slide-31
SLIDE 31

Thank you, questions?