network activity in egee iii sa2
play

Network activity in EGEE-III SA2 Xavier Jeannin (CNRS/UREC) SA2 - PowerPoint PPT Presentation

Enabling Grids for E-sciencE Network activity in EGEE-III SA2 Xavier Jeannin (CNRS/UREC) SA2 Activity Manager 7th NRENs and Grids Workshop (Dublin) 1/2 September 2008 www.eu-egee.org EGEE and gLite are registered trademarks EGEE-III


  1. Enabling Grids for E-sciencE Network activity in EGEE-III SA2 Xavier Jeannin (CNRS/UREC) SA2 Activity Manager 7th NRENs and Grids Workshop (Dublin) 1/2 September 2008 www.eu-egee.org EGEE and gLite are registered trademarks EGEE-III INFSO-RI-222667

  2. Agenda Enabling Grids for E-sciencE • EGEE size and statistics • SA2 Network activity – Technical Network Liaison Committee TNLC – EGEE Network Operations Center EGEE – EGEE-III Projects � LHCOPN support / operational Model � Trouble matching and correlation � Tools for troubleshooting � Grid site networking needs � Advanced network services � IPv6 � Trouble Ticket standardization • European Grid Initiative, National Grid Initiative – Lesson learnt from EGEE – Network activity in EGI/NGI • Conclusion EGEE-III INFSO-RI-222667 SA2: Network activity in EGEE-III 2

  3. EGEE: the largest multi-disciplinary research Grid infrastructure in the world Enabling Grids for E-sciencE No. Cores 80000 60000 40000 20000 0 avr.-04 juil.-04 oct.-04 janv.-05 avr.-05 juil.-05 oct.-05 janv.-06 avr.-06 juil.-06 oct.-06 janv.-07 avr.-07 juil.-07 oct.-07 janv.-08 avr.-08 No. Sites 300 250 200 150 100 50 0 avr.-04 juil.-04 oct.-04 janv.-05 avr.-05 juil.-05 oct.-05 janv.-06 avr.-06 juil.-06 oct.-06 janv.-07 avr.-07 juil.-07 oct.-07 janv.-08 avr.-08 EGEE-III INFSO-RI-222667 SA2: Network activity in EGEE-III 3

  4. Users and resources distribution Enabling Grids for E-sciencE Feb’08 EGEE-III INFSO-RI-222667 With the courtesy of Bob Jones SA2: Network activity in EGEE-III 4

  5. Highlights of EGEE-II - Applications Enabling Grids for E-sciencE • >270 VOs from several scientific domains – Astronomy & Astrophysics – Civil Protection – Computational Chemistry – Comp. Fluid Dynamics – Computer Science/Tools – Condensed Matter Physics – Earth Sciences – Fusion – High Energy Physics – Life Sciences • Further applications under evaluation Applications are moving from testing to routine and daily usage EGEE-III INFSO-RI-222667 With the courtesy of Erwin Laure SA2: Network activity in EGEE-III 5

  6. SA2 in EGEE-III Enabling Grids for E-sciencE • Total of 375 FTEs in EGEE-III – 9010 person months (vs. 11165 PMs in EGEE-II; ~20% less) – Grand total combining funded and unfunded contributions � No difference for execution of program of work! • Network activity SA2 = 14 persons + TNLC, 159 PMs NA1 JRA1 NA2 SA3 NA3 SA2 2% 5% 5% 9% 8% 2% NA4 19% NA5 SA1 1% 49% EGEE-III INFSO-RI-222667 SA2: Network activity in EGEE-III 6

  7. SA2 Global view Enabling Grids for E-sciencE SA2 – EGEE-III ENOC running Overall Networking Support for the ENOC coordination Operational procedures (CNRS) IPv6 (GARR, CNRS) LCG Support (CNRS) TT exchange standard Operational tools and (GRNET) IPv6 maintenance (GARR, CNRS) (RRC-KI, CNRS) Advanced network services (GRNET) Monitoring (DFN) s TNLC Troubleshooting (DFN) Site networking needs (RedIRIS) EGEE-III INFSO-RI-222667 SA2: Network activity in EGEE-III 7

  8. Technical Network Liaison Committee Enabling Grids for E-sciencE • Technical Network Liaison Committee – TNLC – Facilitate cooperation between EGEE on the one hand and GÉANT2 and the NRENs on the other hand – CERN; CNRS, France; DANTE, UK - the GÉANT2 operator; RRC KI, Russia; DFN-Verein, Germany; GARR, Italy; GRNET, Greece; RedIRIS Spain... • Main themes – Monitoring (E2ECU, monitoring LHCOPN/EGI) – Standardization of network trouble tickets (Assessment of the impact on the grid of a trouble ticket) – Advanced network services (AMPS/SLA, new network advanced services) EGEE-III INFSO-RI-222667 SA2: Network activity in EGEE-III 8

  9. EGEE’08 conference Enabling Grids for E-sciencE • NRENs are invited to take part in the TNLC EGEE-III INFSO-RI-222667 June 2008 9

  10. Role of the ENOC Enabling Grids for E-sciencE Operated by DANTE Operated by NOC of Operated by Operated by Operated by NOC of RC1 NOC of NREN A NOC of NREN B RC2 RC 1 RC 2 NREN A GÉANT2 NREN B Grid site 1 Grid site 2 ENOC ensuring E2E connectivity for Grid sites on the whole path • ENOC ensuring E2E connectivity for Grid sites • Assess the impact on the Grid of network trouble • Troubleshoot problems – Provide support to users – Identify the faulty domain • Assess the network connectivity of the Grid sites EGEE-III INFSO-RI-222667 SA2: Network activity in EGEE-III 10

  11. The ENOC Enabling Grids for E-sciencE – A single point of contact between EGEE and the NRENs where EGEE and the network can exchange operational information – A Network support unit in GGUS (trouble ticket system of EGEE) •EGEE Network •Sites •Sites •NRENs •Sites •NRENs Sites •NRENs NRENs GGUS ENOC Support Units GÉANT2 Users • • Interface with the EGEE user support: Interface with network providers: – Receive tickets assigned to ENOC by the – Collect tickets from NRENs GGUS 1 st level support – Assess impact on the grid infrastructure – Troubleshoot them provided that the ENOC – Forward to GGUS tickets that seem relevant has access to suitable monitoring tools – Contact identified faulty domains or reassign ticket to the associated site if this is local network issue EGEE-III INFSO-RI-222667 SA2: Network activity in EGEE-III 11

  12. Assess the network connectivity of the Grid sites Enabling Grids for E-sciencE • Specific tools developed: Downcollector, see https://ccenoc.in2p3.fr/ Number of connectivity troubles detected on EGEE Grid certified sites sorted per supposed location 1000 WAN/MAN 900 LAN / Non network (power…) 800 Unknown 700 Number of sites with at least one network trouble 600 500 400 282 Certified 300 Grid Sites 200 100 0 August 07 September October November December January 08 February March EGEE-III INFSO-RI-222667 SA2: Network activity in EGEE-III 12

  13. Support of LHCOPN Enabling Grids for E-sciencE The LHC Optical Private Network 15 PB of data per year generated by the LHC http://ccenoc.in2p3.fr/ASPDrawer / EGEE-III INFSO-RI-222667 SA2: Network activity in EGEE-III 13

  14. Support of LHCOPN Enabling Grids for E-sciencE • SA2 objectives in LHCOPN context are: – Define the operational Model � Define accurately responsibilities of each actor � Ensure a problem resolution is not delayed by an unsuitable operational model � Ensure the LHCOPN is well monitored – Set up communication channels between this network and the EGEE Grid (scheduled downtimes, incidents etc.) • LHCOPN operational model: – Federative Model, responsibility shared by Tiers 1 and Tier 0 – Approach: Define actors and their relationship, Where to find the information, The procedure � Every actor agrees on the operational model and are aware of their role and the procedure they should apply – Draft: Operational model WIKI EGEE-III INFSO-RI-222667 SA2: Network activity in EGEE-III 14

  15. LHCOPN Operational model Enabling Grids for E-sciencE EGEE-III INFSO-RI-222667 SA2: Network activity in EGEE-III 15

  16. LHCOPN Operational model Enabling Grids for E-sciencE EGEE-III INFSO-RI-222667 SA2: Network activity in EGEE-III 16

  17. Trouble matching and correlation RRC-KI Enabling Grids for E-sciencE • Trouble matching and correlation for the ENOC – From a discovered incident find the related network trouble ticket – Better trouble localisation – Different methods will be tested • First method – Another monitoring tool (smoke ping) has been set up, located in Russia – The results of this tool and those from ENOC (Downcollector, Lyon) are matched up – The two tools are located in two different places in order to improve the knowledge of the network topology EGEE-III INFSO-RI-222667 SA2: Network activity in EGEE-III 17

  18. Network Operational Database Enabling Grids for E-sciencE EGEE-III INFSO-RI-222667 SA2: Network activity in EGEE-III 18

  19. Tools for troubleshooting DFN Enabling Grids for E-sciencE • Tools for efficient troubleshooting – Launch test on demand from the Grid site under central server control: ping, traceroute, DNS lookup, nmap and bandwith measurements. 2 ENOC supervisor 1 Site administrator 3 ENOC 5 4 Grid site B Grid site A Local site light PerfSONAR’s sensor Central ENOC monitoring server EGEE-III INFSO-RI-222667 SA2: Network activity in EGEE-III 19

  20. Tools for troubleshooting DFN Enabling Grids for E-sciencE • Active measure on demand, light weight PerfSONAR version with a specific plug-in • Look for beta-tester sites • NRENs can take advantage of the deployment of this software – To troubleshoot their own grid nodes EGEE-III INFSO-RI-222667 SA2: Network activity in EGEE-III 20

  21. Grid site networking needs RedIRIS Enabling Grids for E-sciencE • Establish by an empirical way the site needs in term of network needs according to type of – Site (Tiers 0, 1, 2, 3) – Experiment computed in the site • Working plan – Review of the status of Tier2 / Tier3 in Spain – Translate the requirements and needs to network parameters to be measured. – Brief review of different network performance and monitoring tools that tiers agree to deploy – Pilot / Service definition for deploying perfSONAR – Performance and monitoring tests definition – Tests phase, Results and conclusions. EGEE-III INFSO-RI-222667 SA2: Network activity in EGEE-III 21

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend