trends in wide area networking
play

Trends in Wide Area Networking Phil DeMar (remotely) ISGC - PowerPoint PPT Presentation

Trends in Wide Area Networking Phil DeMar (remotely) ISGC Symposium Wednesday, April 16, 2008 1 Hybrid networks & end-to-end data circuits What are hybrid networks? No single, concise definition My definition: Networks that


  1. Trends in Wide Area Networking Phil DeMar (remotely…) ISGC Symposium Wednesday, April 16, 2008 1

  2. Hybrid networks & end-to-end data circuits What are hybrid networks? � No single, concise definition � My definition: Networks that mix underlying network technologies, � protocols, and/or administrative boundaries, to support network connections in a non-classic, routed-IP manner What is an end-to-end (E2E) circuit? � Again, no single, concise definition � My definition: a data circuit between two sites that appears at the IP � level to be a direct connection E2E circuits are established over hybrid networks � Usually crossing multiple network boundaries & administrative domains • 2

  3. Why end-to-end circuits? Convergence of need, capability, & strategic direction � And sometimes just because our stakeholders ask for them… � Need: � Emerging high impact data � movement needs for LHC Traffic projections call for rapid • increase in traffic levels Predictable network performance � requirements: Distributed experiment data storage • Distributed analysis model • Isolation of high impact traffic from general internet traffic � Interactive applications don’t mingle well with bulk data transfers • 3

  4. Why end-to-end circuits? - Capability Emerging Research & Education (R&E) optical network infrastructure � offers flexible & economic growth options In US, Internet-2 and ESnet backbones based on leased fiber and dense � wave division multiplexing (DWDM) equipment Additional data channels just require adding DWDM transponders • Facilitates wider & more flexible service offerings, such as E2E circuits • Similar evolution of R&E networks occurring across the globe � Site connectivity migrating along similar paths, where feasible � Fermilab leased dark fiber to nearby StarLight international optical � network exchange in 2004 Initial configuration: 1x10GE & 2x1GE channels • Today, its a cornerstone of a metropolitan area network (MAN) � Fermilab offsite network capacity = 8 x 10GE, with redundant fiber paths • Fiber connectivity at StarLight offers wide scope of network connection options • 4

  5. Why end-to-end circuits? - Strategic Direction ESnet strategic direction: � High bandwidth, scalable, reliable � production IP network service Very high-bandwidth network for � large scale science data flows ESnet Science Data Network • MANs for National Labs local access � The future: � Routed IP service needs met by 10Gb/s � SDN service grows to 20-60Gb/s � MAN bandwidth follows SDN � 5

  6. Fermilab End-to-End Circuits Deployed starting in 2004 � Serve a wide spectrum of � experiments CMS Tier-2s = heavy users � Implemented on multiple � technologies But based on end-to-end � layer-2 paths Usefulness has varied � WestGrid & Apache Pt circuits decommissioned � IP routed path performance was perfectly acceptable • 6

  7. Local Topology for Circuit Support Four 10GE channels for offsite traffic: � One reserved for routed IP service � One supports LHC traffic with CERN � Two support E2E circuits to CMS sites � Addtl infrastructure for legacy 1GE � connections Circuits based on end-to-end vLANs � Direct peering with remote site � Multiple provider domains is the norm � Technology varies by domains involved � Complexity higher than IP service � 7

  8. Rerouting E2E circuit Traffic Identify high impact traffic flows: � Based on source/dest. netblock pairs � Deploy alternate path border router for � E2E`circuits Peer directly across circuits, advertising only � source netblock Implement alternate forwarding: � Policy route outbound on source/dest pairs: � Coordinate similar policy of remote end � Implement inbound policy routing on � alternate path border router Unexpected traffic rerouted to general IP path � 8

  9. Usefulness of E2E Circuits Traffic characterization shows � FNAL E2E circuit traffic dominates Particularly during peak traffic � periods July 2007 peak in excess of 2PB � Equivalent to sustained 7Gb/s for • the month A few sites capable of 7-8Gb/s � Ratio of circuit-based traffic to routed traffic also a reflection of � performance capability US Tier-2s (circuit-based) currently sustaining 2-3 Gb/s and higher � European & Asian Tier-2s (routed) typically sustain sub-gigabit rates � Performance tuning at longer round trip times still a bit of an art… • 9

  10. Types of End-to-End Circuits � Static topology, dedicated link Likely evolution of E2E circuit technology Likely evolution of E2E circuit technology � Most of our E2E circuits today � Scalability & cost concerns We are somewhere � Static topology, dynamically scheduled in here � Scheduling policy issues hinder use � Static topology, dynamic provisioning � Emerging services becoming increasingly available � Dynamic topology; dynamic provisioning � Nirvana, but a long way from being feasible 10

  11. A Static Hybrid Network – The LHCOPN LHCOPN = LHC Optical Private � Network A dedicated 10Gb/s network for � LHC T0/T1 data movement Based on layer-2 technologies � Routing only at the end points � Pass-thru routing at CERN � facilitates Tier-1/Tier-1 transfers Redundancy can be difficult/costly � Each LHCOPN circuit is a � concatenation of layer 2 links: Involves multiple � administrative domains 11

  12. Emerging Dynamic Circuit Services Internet2 & ESnet now offering integrated dynamic circuit services � Integrated control plane via functional Inter- Domain Controller (IDC) � Independent, local domain controllers (DCs) manage within domains � Dynamic circuit provisioning makes � network paths potentially a service Path setup & selection servers: � Fermilab’s Lambda Station � Brookhaven’s TeraPaths � Dynamic circuit trials underway: � FNAL/Univ. of Nebraska � BNL/Boston Univ. � 12

  13. FNAL / Nebraska Dynamic Circuit IDC/ESNet IDC/Internet2 Flow analysis Lambda Station Server Lambda Station Server ESnet OSCARs Internet2 DCS Flow data Routed R&E Network Network infrastructure Network University infrastructure of Nebraska Fermilab Control plane Default network path FtWatch request for circuit A path across DCN Inter-Lambda Station coordination Circuit call setup & teardown LAN reconfiguration to use circuit 13

  14. Large-scale data recover via DCN � Shortly after deploying their dynamic circuit, Nebraska lost their Tier-2 data cache � 50TB Data recovered by data transfer from FNAL Tier-1 � Largely via Internet2/ESNet Dynamic Circuit � Completed in 32 hours � Graceful cutover to & fall back from the dynamic circuit 14

  15. Operational issues with circuits � E2E circuit failure modes are different than for IP service � They are more complex � Impact of the failure may be severely felt elsewhere � Operational failures can be “creative” and difficult to troubleshoot � Monitoring infrastructure is critical, but requires new effort � Asymmetric paths will occur and will be difficult to detect � We’re working on flow data analysis to detect this UNL CMS � Unexpect consequences traffic for changes � UNL moves several T2 ESnet IP path systems to a new subnet 15

  16. PerfSonar for Monitoring Capabilities � Motivation for PerfSonar � Need for integrated path monitoring capability across multiple administrative domains Not specifically for E2E circuits, but definitely a stronger need • � What is PerfSonar? � A global collaboration � An architecture and a set of protocols � Several interoperable software implementations Multi-Domain Monitoring (MDM) utility {Dante} • PerfSonar-PS {US-based collaboration headed by Internet2} • � A measurement infrastructure 16

  17. MP = measurement point 17 PerfSonar Architecture

  18. E2E Link Monitoring (CERN-LHCOPN-FNAL-001) LHCOPN link status monitored by E2ECU (Dante service) � PerfSonar monitoring within each administrative domain � Independently-managed Measurement Points (MPs) • Centralized collection (E2ECU) of circuit status from each domain � Location of faults identified & displayed � Appropriate alarms & notifications sent out • 18

  19. PerfSonar Active Performance Monitoring High Level (near term) Objectives: � Detect & react to changes in the underlying network � Quickly identify network problems: � Even if not noticeably impacting applications • Distinguish between application level & network level problems • Types of active measurements: � One-way delay (OWAMP & Hades) � Round trip time (Pinger) � Bandwidth measurement (BWCTL) � Path monitoring (Traceroute) � Less interesting within E2E circuit environment • Pending implementations: � LHCOPN to be monitored by centrally-managed MDM appliance {GEANT} � (Some) LHC Tier-1/Tier-2 paths with PerfSonar-PS � 19

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend