iToM: An Internet Topology Mapping Project Kamil Sarac - - PowerPoint PPT Presentation

itom an internet topology mapping project
SMART_READER_LITE
LIVE PREVIEW

iToM: An Internet Topology Mapping Project Kamil Sarac - - PowerPoint PPT Presentation

iToM: An Internet Topology Mapping Project Kamil Sarac (ksarac@utdallas.edu) Department of Computer Science The University of Texas at Dallas Internet topology measurement/mapping Need for Internet topology measurement Help with


slide-1
SLIDE 1

iToM: An Internet Topology Mapping Project

Kamil Sarac (ksarac@utdallas.edu) Department of Computer Science The University of Texas at Dallas

slide-2
SLIDE 2

 Need for Internet topology measurement

  • Help with network management or surveillance
  • Robustness with respect to failures/attacks
  • Comprehend spreading of worms/viruses
  • Relevant in active defense scenarios
  • Scientific discovery
  • Scale-free (power-law), Small-world, Rich-club,

Disassortativity,…

Internet topology measurement/mapping

2

slide-3
SLIDE 3

Subnets as first class citizens in network layer Internet topology maps

3

slide-4
SLIDE 4

Network layer Internet topology maps

 A sample IP network segment view at Layer 3

  • A number of routers connected via subnets

R1 R2 R3 R6 R5 R7 R4

4

slide-5
SLIDE 5

A router level map at layer 3

R1 R2 R3 R6 R5 R7 R4

 A corresponding router level map view

5

slide-6
SLIDE 6

How to build router level network maps ?

 Involves topology data collection and topology

construction

 How to collect topology data ?

Traceroute – a network debugging and diagnostic tool

End-to-end traces from k vantage points to n destinations where (typically) k << n

 How to construction topology maps?

Resolving alias IP addresses

Resolving anonymous routers

6

slide-7
SLIDE 7

A network layer view incl. routers & subnets

R1 R2 R3 R6 R5 R7 R4

 Not all subnets are created equal !  Can we discover layer 3 view of subnets ?  List of alive IP addresses  Subnet number as a.b.c.d/x

7

slide-8
SLIDE 8

How to discover a subnet?

 ExploreNET – an active probing based tool

  • Given an IP address , discovers the subnet

hosting

  • Labels with its observable subnet mask
  • A black box using a set of heuristics for subnet

inference

R R

Vantage V

S

1 3

R2 a b c

8

slide-9
SLIDE 9

How to discover a subnet?

 ExploreNET accuracy rates (experimental)

  • 94.9% for Internet2
  • 97.3% for GEANT
  • 93.0% for global public Internet (w.r.t. mrinfo data)

 Probing cost is within to

9

slide-10
SLIDE 10

Why know subnets?

S1 S2 S3 S4 S1 S2 S3 S4

/30 /29 /31 /29

  • 1. A more complete network layer picture of

the underlying network

  • 2. An alternative layer 3 view of the Internet

map where

 subnets are nodes  routers are links

10

slide-11
SLIDE 11

Statistical sampling for studying characteristics of networks

11

slide-12
SLIDE 12

Why statistical sampling?

 Difficult to collect complete topology map

  • Internet/ISP topologies (eg. subnet level maps)
  • Social network graphs (eg. Facebook)

 Statistical sampling as a viable solution  Challenges in statistical sampling

  • Sampling error vs. non-sampling error
  • Unresponsive units
  • Discrepancy between sampling & observation units

 Goal: develop good (unbiased) estimators

12

slide-13
SLIDE 13

Statistical sampling of subnets in a network

 Subnet characteristics of interest

  • Number of subnets
  • Subnet prefix length distribution
  • Mean subnet prefix length
  • IP address utilization ratio

13

slide-14
SLIDE 14

Subnet sampling

 Non-uniform sampling of subnets due to degree

discrepancy

is twice as likely to be sampled as compared to

1 S 2 S 3 S 4 S

Subnets we want to sample from

IP addresses we can sample from Target Network

14

slide-15
SLIDE 15

Challenges in network sampling

 How to design an effective sampling

scheme?

  • What is the impact of the characteristics under

study?

  • What sampling/entity selection method to use ?
  • Random selection, crawling, forest fire, etc
  • What objects to sample ?
  • Nodes, links, cliques, end-to-end paths, etc
  • How to overcome application domain specific

limitations to sampling ?

  • Mismatch between selection units and observation

units in sampling

15

slide-16
SLIDE 16

Thank you

16