CompSci 514: Computer Networks Lecture 10: BGP problems Xiaowei - - PowerPoint PPT Presentation

compsci 514 computer networks lecture 10 bgp problems
SMART_READER_LITE
LIVE PREVIEW

CompSci 514: Computer Networks Lecture 10: BGP problems Xiaowei - - PowerPoint PPT Presentation

CompSci 514: Computer Networks Lecture 10: BGP problems Xiaowei Yang 1 Today Known problems of BGP Multi-homing Instability Delayed convergence Slow failover Discussing fixes Root cause, ghost flushing etc. 2


slide-1
SLIDE 1

1

CompSci 514: Computer Networks Lecture 10: BGP problems

Xiaowei Yang

slide-2
SLIDE 2

2

Today

  • Known problems of BGP

– Multi-homing – Instability – Delayed convergence

  • Slow failover
  • Discussing fixes

– Root cause, ghost flushing etc.

slide-3
SLIDE 3

Background on the paper

  • Another ACM SIGCOMM test of time

award paper

  • A first large scale study of BGP traffic
  • Motivated much improvement to BGP

3

IEEE/ACM TRANSACTIONS ON NETWORKING, VOL. 6, NO. 5, OCTOBER 1998 515

Internet Routing Instability

Craig Labovitz, Student Member, IEEE, G. Robert Malan, Student Member, IEEE, and Farnam Jahanian, Member, IEEE

Abstract—This paper examines the network interdomain rout- ing information exchanged between backbone service providers at the major U.S. public Internet exchange points. Internet rout- ing instability, or the rapid fluctuation of network reachability information, is an important problem currently facing the In- ternet engineering community. High levels of network instability can lead to packet loss, increased network latency and time to

  • convergence. At the extreme, high levels of routing instability

have led to the loss of internal connectivity in wide-area, national

  • networks. In this paper, we describe several unexpected trends

in routing instability, and examine a number of anomalies and pathologies observed in the exchange of inter-domain routing

  • information. The analysis in this paper is based on data collected

from BGP routing messages generated by border routers at five

  • f the Internet core’s public exchange points during a nine month
  • period. We show that the volume of these routing updates is

several orders of magnitude more than expected and that the majority of this routing information is redundant, or pathological. Furthermore, our analysis reveals several unexpected trends and ill-behaved systematic properties in Internet routing. We finally posit a number of explanations for these anomalies and evaluate their potential impact on the Internet infrastructure. flaps have led to the transient loss of connectivity for large portions of the Internet. Overall, instability has three primary effects: increased packet loss, delays in the time for network convergence, and additional resource overheard (memory, CPU, etc.) within the Internet infrastructure. The Internet is comprised of a large number of intercon- nected regional and national backbones. The large public exchange points are often considered the “core” of the Internet, where backbone service providers peer, or exchange traffic and routing information with one another. Backbone service providers participating in the Internet core must maintain a complete map, or default-free routing table, of all globally vis- ible network-layer addresses reachable throughout the Internet. The Internet is divided into a large number of different re- gions of administrative control commonly called autonomous

  • systems. These autonomous systems (AS’s) usually have dis-

tinct routing policies and connect to one or more remote AS’s at private or public exchange points. AS’s are traditionally composed of network service providers or large organizational

slide-4
SLIDE 4

4

Failover

  • BGP is designed for scaling more than fast

failover

– Many mechanisms favor this balance – Route flap damping, for example.

  • If excess routing changes (flapping), ignore for some

time.

  • Has unexpected effects on convergence times.

– Route advertisement/withdrawal timers in the 30 second range – Effect: tens of seconds to many minutes to recover from simple failures. – 15-30 minute outages not uncommon.

slide-5
SLIDE 5

5

Multi-homing

  • Connect to multiple providers

– Goal: Higher availability, more capacity

  • Problems:

– Provider-based addressing breaks – Everyone needs their own address space

slide-6
SLIDE 6

6

Multi-homing increases routing table size

Mutil-home.com 128.0.0.0/8 204.0.0.0/8 204.1.0.0/16 ISP2 ISP1 You can reach 128.0.0.0/8 And 204.1.0.0/16 via ISP1 ISP3 204.1.0.0/16 ISP1 204.1.0.0/16 128.0.0.0/8 ISP1 204.1.0.0/16 ISP2 204.0.0.0/8 ISP2

slide-7
SLIDE 7

7

Global routing tables continue to grow

Source: http://bgp.potaroo.net/as6447/

slide-8
SLIDE 8

8

Other BGP problems

  • Convergence: BGP may explore many

routes before finding the right new one.

– Labovitz et al., SIGCOMM 2000

  • Correctness: routes may not be valid, visible,
  • r loop-free.
  • Security: There is none!

– Some providers filter what announcements their customers can make. Not all do. – See paper discussion site for pointers

slide-9
SLIDE 9

9

Measurement studies

  • Two papers (measurement)

– End-to-end traffic – Routing messages

  • Experimental techniques
  • Results
slide-10
SLIDE 10

10

Internet Routing Instability

  • Goals: how often BGP sends updates to change

routes

  • Methodology:

– Analyzing BGP logs for a long time

slide-11
SLIDE 11

Terms

  • WADiff: withdrawal à announcement
  • AADiff: announcement à announcement
  • WADup: same route withdrawal à

announcement

  • AADup: same route announcement à

announcement

  • WWDup: same route withdrawal à

withdrawal

10/2/18 CPS 214 11

slide-12
SLIDE 12

Observed pathologies

  • Repeated WWDup, WADup, AADup
  • Why are they pathologies?

10/2/18 CPS 214 12

slide-13
SLIDE 13

10/2/18 13

  • Majority of BGP updates are WWDup
  • WWDup belong to ASes that never announce them
  • Why?

– Stateless BGP, does not remember what have sent to peers – Send withdrawals to all peers

slide-14
SLIDE 14

Possible origins of instability

  • Stateless BGP
  • Physical link errors
  • Unjittered timers
  • IGP, BGP interactions
  • Conflicting routing policies

10/2/18 14

slide-15
SLIDE 15

10/2/18 15

Data analysis techniques

  • Time series analysis
  • Frequency analysis

– Fast Fourier transform – Maximum entropy spectral estimation

  • Different estimation methods, but both

find significant frequencies at seven days, and 24 hours

slide-16
SLIDE 16

Main results

  • Much more updates than expected

– 99% is pathological. Impressive! – A taxonomy to analyze pathologies

  • Speculation of causes

– Configuration errors, router bugs – Correlate with traffic load, perhaps due to router architectures – Open research questions: root cause of updates

  • Motivated much follow-up work

10/2/18 16

slide-17
SLIDE 17

17

IEEE/ACM TRANSACTIONS ON NETWORKING, VOL. 9, NO. 3, JUNE 2001 293

Delayed Internet Routing Convergence

Craig Labovitz, Member, IEEE, Abha Ahuja, Member, IEEE, Abhijit Bose, and Farnam Jahanian

Abstract—This paper examines the latency in Internet path failure, failover, and repair due to the convergence properties of interdomain routing. Unlike circuit-switched paths which exhibit failover on the order of milliseconds, our experimental mea- surements show that interdomain routers in the packet-switched Internet may take tens of minutes to reach a consistent view of the network topology after a fault. These delays stem from temporary routing table fluctuations formed during the operation of the Border Gateway Protocol (BGP) path selection process on Internet backbone routers. During these periods of delayed convergence, we show that end-to-end Internet paths will experience intermittent loss of connectivity, as well as increased packet loss and latency. We present a two-year study of Internet routing convergence through the experimental instrumentation of key portions of the Internet infrastructure, including both passive data collection and fault-injection machines at major Internet exchange points. Based on data from the injection and measurement of several

  • tions. For example, transient disruptions in backbone networks

that previously impacted a handful of scientists may now cause enormous financial loss and disrupt hundreds of thousands of end users. Since its commercial inception in 1995, the Internet has lagged behind the public switched telephone network (PSTN) in availability, reliability, and quality of service (QoS). Factors contributing to these differences between the commercial Internet infrastructure and the PSTN have been discussed in various literature [26], [18]. Although recent advances in the IETF’s Differentiated Services working group promise to improve the performance of application-level services within some networks, across the wide-area Internet these QoS algorithms are usually predicated on the existence of a stable

slide-18
SLIDE 18

10/2/18 18

Delayed Internet Convergence

Measurement Problem discovery Modeling & analysis Improvement

  • Methodologies
slide-19
SLIDE 19

10/2/18 19

Experiments setup

  • Actively inject BGP faults

– How is fault injected?

  • Passively listen at peering sessions, and use NTP

synchronized machines to calculate the convergence time

  • Actively send probe packets to observe end-to-end

packet loss and latency

  • Much BGP work later uses similar measurement

techniques.

slide-20
SLIDE 20

10/2/18 20

Results show delayed convergence

  • Bad news travels slow.
slide-21
SLIDE 21

10/2/18 21

Slow routing convergence results in poor end-to-end performance

slide-22
SLIDE 22

10/2/18 22

What causes the delayed routing convergence?

  • A simple BGP convergence model reveals that in

the worse case, all possible paths are explored before a prefix is withdrawn.

  • No minimum advertisement timer: synchronized

network, global message queue

1 2 R (*0R, ∞,, 2R) (∞, ∞, *2R) (∞, ∞, *20R) (*0R, 1R, ∞,) (01R, *1R, ∞) (*01R, 10R, ∞,) (∞, *1R, 2R) (∞, ∞, *2R) (∞, ∞, ∞) 01R 01R 10R 10R 20R 20R

slide-23
SLIDE 23

10/2/18 23

Min router Advertisement interval timer(MRAI) reduces message count

  • Why?

– MATI introduces synchronization. Multiple announcements are combined into one announcement, reducing the total message count.

  • However, the convergence time

becomes proportional to timer_interval * (n-3)

slide-24
SLIDE 24

10/2/18 24

Lets brain storm…

  • How can we fix the slow convergence

problem?

– What is the solution proposed by the authors?

  • Sender-side loop detection. When a sender detects

a loop, it sends a withdrawal to a neighbor

  • immediately. Since withdrawal is not subject to

MATI delay, this improvement reduces both message count and convergence time.

– What exactly is the root cause of BGPs slow convergence problem? – Can you come up with any solution?

slide-25
SLIDE 25

10/2/18 25

Sender-side loop detection

  • Without sender-side loop detection:

– AS3 à AS1: 301R – This announcement is sent out when MRAI timer expires

  • With sender-side loop detection:

– AS3->AS1: withdrawal – Withdrawal is sent out immediately. AS1 knows it has no path.

slide-26
SLIDE 26

10/2/18 26

BGP assertion

  • Detect path inconsistency between different neighbors
  • If inconsistency is found, give path learned from direct

neighbors high priority

  • Sensitive to topology
  • Does not eliminate all invalid paths

N1 R D X Y N2

X

N1à R: N1 X Y D R: N1 D N2 N1 D

slide-27
SLIDE 27

10/2/18 27

Ghost flushing

  • If new path is worse than last announced path, and

router advertisement timer has not expired yet, send a withdrawal immediately.

  • The withdrawal flushes ghost information.
  • Reduces the convergence time because withdrawals

are not delayed by MRAI, but does not help much with Tlong.

N1 R D X Y N2

X

N1à R: withdrawal R: N1 D N2 N1 D

slide-28
SLIDE 28

10/2/18 28

BGP root cause notification

  • Neither BGP-assertion nor ghost flushing works well in this

topology.

– Why? – BGP-assertion: 3 and 6 are both direct neighbors to 5, but their announcements may be inconsistent – BGP ghost flushing: the newer path is subject to MATI delay

  • Explicitly send out link up/down information
  • Essentially adds link-state information into BGP
  • Sequence number is used to order the notifications.
  • Open research problem: can you get rid of sequence number?

5 3 4 2 1

X

6

slide-29
SLIDE 29

29

slide-30
SLIDE 30

Background

  • Route import/export follow business

relationship between ASes

  • Routing advertisements include AS path

information

  • à Use this information to infer AS-level

topology

30

slide-31
SLIDE 31

An annotated AS Graph

  • Sibling: provide transit services for each
  • ther.

– May belong to the same company

31

AS5 AS1 AS2 AS3 AS4 peer-to-peer sibling-to- sibling edge edge AS6 AS7 provider-to

  • customer edge
slide-32
SLIDE 32

The valley-free property

  • Valley-free: After traversing a provider-

to-customer or peer-to-peer edge, the AS path can not traverse a customer-to- provider or peer-to-peer edge

32

slide-33
SLIDE 33

What type of paths are legitimate?

33

slide-34
SLIDE 34

The basic algorithm

  • The node with the largest number of

neighbors is the top-level provider on an AS path

34

slide-35
SLIDE 35

35

slide-36
SLIDE 36

Taken into configuration errors

36

slide-37
SLIDE 37

Inferring peering relationships

37

slide-38
SLIDE 38

Evalution

  • Verified using AT&T internal information

38

slide-39
SLIDE 39

10/2/18 39

Summary

  • BGPs slow convergence problem and
  • ther problems
  • It represents a message overhead,

processing overhead, and latency tradeoff

  • How to infer AS-level topology
slide-40
SLIDE 40

Comments

  • Measurement paper

– Data – Data collection techniques – Data analysis

  • When to write a measurement paper

10/2/18 40