The Network Operation Centre of a RREN: The Network Operation Centre - - PowerPoint PPT Presentation

the network operation centre of a rren the network
SMART_READER_LITE
LIVE PREVIEW

The Network Operation Centre of a RREN: The Network Operation Centre - - PowerPoint PPT Presentation

The Network Operation Centre of a RREN: The Network Operation Centre of a RREN: Anella Cient Anella Cient fica fica Maria Isabel Ganda Carriedo Communications Area, Systems & Networks Department, CESCA TF-NOC Preparation Meeting


slide-1
SLIDE 1

The Network Operation Centre of a RREN: The Network Operation Centre of a RREN: Anella Cient Anella Cientí ífica fica Maria Isabel Gandía Carriedo Communications Area, Systems & Networks Department, CESCA TF-NOC Preparation Meeting NORDUnet A/S, Kastrup, 3/5/2010

slide-2
SLIDE 2

Agenda Agenda About CESCA and Anella Científica Anella Científica/CESCA NOC:

  • Communication with the users
  • How we manage the network
  • How we manage dedicated circuits

Tools

  • Communications database
  • Ad-hoc scripts
  • Cacti & its plugins
  • PerfSonar
  • SMARTxAC
  • NAM
  • Other tools

Conclusions

slide-3
SLIDE 3

About CESCA and Anella Cient About CESCA and Anella Cientí ífica fica

Public consortium

  • Created in 1991
  • Formed by:
  • Generalitat de Catalunya
  • Talència
  • 9 Catalan universities
  • Consejo Superior de

Investigaciones Científicas

CATNIX created in 1999 Anella Científica created in 1993

slide-4
SLIDE 4

Our Services Our Services

slide-5
SLIDE 5

About Anella Cient About Anella Cientí ífica fica

Anella Científica is the research and education network in Catalonia Managed by CESCA Connected to RedIRIS With more than 80 points of access

  • f institutions related to research
slide-6
SLIDE 6

10 20 30 40 50 60 70 80 90 9 3 9 4 9 5 9 6 9 7 9 8 9 9 1 2 3 4 5 6 7 8 9 1 # Points of access (Aggregatged capacity in Mbps)

≤ ≤ ≤ ≤ 10 Mbps 10–90 Mbps 100–990 Mbps ≥ ≥ ≥ ≥ 1.000 Mbps

200 400 600 800 1000 1200 2002 2003 2004 2005 2006 2007 2008 2009 2010 Tràfic (TB) 6 60 6 60 7 70 8 80 8 80 15 188 16 190 17 288 19 388 27 502 37 1G 53 2G 66 4G 73 6G

7.646,55 2008 4.665,43 2007 2.591,91 2010 6.712,35 2009 2.920,75 2006

76 16G 79 28G 82 28G 85 29G

Anella Cient Anella Cientí ífica: fica: Evolution Evolution

slide-7
SLIDE 7

Anella Cient Anella Cientí ífica: Architecture fica: Architecture Some local dark fibre links L2 Gigabit Ethernet network Flexible and easily scalable Different points of access & connections:

  • Ethernet: 10, 34, 100, 1,000 and 10,000 Mb/s
  • ADSL, SHDSL

Core is a full mesh, redundancy in the links between nodes Access is a “ring”: dual homing Redundancy of the provider network and the WDM network Customizable CIR + EIR QoS capabilities at L2 network …but the model will probably change

slide-8
SLIDE 8

Anella Cient Anella Cientí ífica: projects fica: projects PIC participates in LHC (10 Gbps) i2CAT participates in FEDERICA, Phosphorus, HDVIPER (10 Gbps) UPC-CCABA participates in EuQoS, MUPBED,… (1 Gbps) CESCA, i2CAT & UPC participate in PASITO (10 Gbps) BSC participates in RES (1 Gbps) Liceu transmits the course Opera Oberta

slide-9
SLIDE 9

CESCA, as the manager of the Regional Research and Education Network (RREN) in Catalonia and as a Local Internet Registry (LIR) has:

  • Addresses for the connected institutions:

– IPv4: 84.88.0.0/15 – IPv6: 2001:40B0::/32

  • An Autonomous System (AS):

– AS13041

CESCA controls all the L3, some L2 and some L1, so our monitoring is mostly L3-based.

Anella Cient Anella Cientí ífica: L3 fica: L3

slide-10
SLIDE 10

Anella Cient Anella Cientí ífica: topology fica: topology

  • 1. Public and private non-profit Universities
  • 2. Official Bodies of Research
  • 3. Other non-profit Research centres
  • 4. Hospital Research centres
  • 1. Official bodies of R+D management
  • 2. Relevant Digital contents institutions
  • 3. R+D+i participants
  • 4. Special interest for R+D institutions
  • 1. Science and technological parks
  • 2. Other hospital units

A B C

  • C. Nord

Telvent Operator Internet

slide-11
SLIDE 11

Anella Cient Anella Cientí ífica: circuits fica: circuits Permanent circuits & services:

  • Each point of access has one circuit to each core node for

redundancy (using L3 routing)

  • An institution can have more than one VLAN with other points of

access that usually belong to the same institution (internal traffic)

  • An institution can have a dedicated virtual router, managed by

CESCA, to aggregate some connections

  • C. Nord

Telvent

Operator

A B C

slide-12
SLIDE 12

Anella Cient Anella Cientí ífica: points of access fica: points of access

Backbone Node Backbone Node Access Node 10~70km 10~40km 10~70km Access Node Core Access Ring

slide-13
SLIDE 13

Agenda Agenda About CESCA and Anella Científica Anella Científica/CESCA NOC:

  • Communication with the users
  • How we manage the network
  • How we manage dedicated circuits

Tools

  • Communications database
  • Ad-hoc scripts
  • Cacti & its plugins
  • PerfSonar
  • SMARTxAC
  • NAM
  • Other tools

Conclusions

slide-14
SLIDE 14

The NOC: Communications Area The NOC: Communications Area

Some numbers:

  • 85 points of access
  • 2 core nodes
  • 76 institutions connected to Anella Científica
  • 22 entities connected to CATNIX
  • 4 network engineers & 1 student
  • 20 engineers for the weekend monitoring

Help from the Operations & Security Area for cabling, installations, etc. We have a technical and an administrative contact for each institution that channel all the requests (IP address assignments, routing, dedicated circuits, incidents), but we can have previous conversations with relevant users to know their needs. Some technical contacts have a meeting once a year (CTAC). We organize a Meeting/Workshop (TAC) once a year to present new institutions and projects (for instance, this year, Cloud Computing)

slide-15
SLIDE 15

Communication institutions Communication institutions -

  • > CESCA

> CESCA Adresses (RT):

  • ac-noc@suport.cesca.cat

– Routing – Network incidents

  • ac-nic@suport.cesca.cat

– Addresses requests – Reverse DNS

  • anella.serveis@suport.cesca.cat

– Services (Multicast, ftp-mirror,…)

  • eduroam@suport.cesca.cat

– Eduroam

  • eriac@suport.cesca.cat

– Security incidents

Telephone

slide-16
SLIDE 16

Communication CESCA Communication CESCA -

  • > institutions

> institutions Distribution lists:

  • ctac@cesca.cat

– Members of the Comission

  • anella-t@cesca.cat:

– Technical representatives

  • anella-a@cesca.cat:

– Other technical staff – Generic addresses

RT queues Telephone & e-mail TAC Aula (New Technologies and Seminars)

slide-17
SLIDE 17

If there is an incident.. If there is an incident.. During our working hours (9.00-18.00 Mo-Th, 9.00-14.30 Fr, 8.00-15.00 Jul/Aug)

  • They call us
  • They send a message to ac-noc@suport.cesca.cat
  • We try to be very proactive

Out of our working hours, 24x7 reactive service for the institutions with an external enterprise. The external enterprise is able to check the state of our routers and switches and, if the problem is external, they can call our provider. Second level support from our technicians during the weekend.

slide-18
SLIDE 18

How we manage the network How we manage the network Inventory of circuits using “our” Communications database Ad-hoc scripts and alarms Statistics via SNMP with Cacti UPC-CCABA has developed a passive monitoring system using real-time analysis: SMARTxAC Our NOC is subscribed to the Dante E2ECU (End to end coordination unit) mailing list for dedicated circuits perfSONAR node through RedIRIS for LHC NAM Other tools

slide-19
SLIDE 19

How we manage dedicated circuits How we manage dedicated circuits Special circuits & services:

  • If the circuit is between two institutions connected to Anella

Científica, we ask both if they want the connection. We have a special range of VLAN for these connections.

  • If the circuit is external, RedIRIS uses a formulary that the

institutions fill and send. They send it to RedIRIS and CESCA indicating the name of the project, description, responsible entity, kind of connection, etc.

  • For modifications, institutions can ask us directly and we contact

RedIRIS

  • RedIRIS and CESCA have agreed two ranges of VLAN for special

projects, one range for each type of encapsulation

  • We use the Request tracker to handle all the requests, arrange a

VLAN number, etc.

slide-20
SLIDE 20

For our users: For our users: Listen to their needs first For each new connection, there are some stress tests before going to a production environment They can choose static routing or dynamic routing (BGP) We ping their interface from the other end of the /30 and from our monitoring machine We apply anti-spoofing filters…Some insist on using the infrastructure address for VPNs

slide-21
SLIDE 21

Agenda Agenda About CESCA and Anella Científica Anella Científica/CESCA NOC:

  • Communication with the users
  • How we manage the network
  • How we manage dedicated circuits

Tools

  • Communications database
  • Ad-hoc scripts
  • Cacti & its plugins
  • PerfSonar
  • SMARTxAC
  • NAM
  • Other tools

Conclusions

slide-22
SLIDE 22

“ “Our Our” ” Communications database Communications database

slide-23
SLIDE 23

“ “Our Our” ” Communications Communications database database We store all the information of our institutions:

  • Points of access
  • Addresses
  • Technical and executive contacts e-mails and telephones
  • Assigned IP addresses
  • Infrastructure addresses (point to point)
  • Equipment
  • Bandwidth
  • Technology
  • Comments, special cases for the 24x7 service

It makes our life easier, as we have many “special” cases:

  • More than one point of access per institution
  • More than one institution per point of access
  • Different circuits intra and inter-institutions
slide-24
SLIDE 24

“ “Our Our” ” Communications Communications database database All the information from an institution/circuit/person is linked Every time we need to contact an institution, we find the related information here It’s not accessible from external networks It’s programmed by our engineers It also stores information of the Neutral Internet Exchange, CATNIX

slide-25
SLIDE 25

Pros

  • All the information is together
  • We don’t have to maintain separated files for the assignment of

VLAN, IP addresses, etc

  • Easy creation of new instances
  • When there is a change on the technical/administrative contacts, it’s

changed “almost” automatically

Cons

  • Each change requires programming
  • Sometimes the initial programmer is not the same person that

makes the changes

“ “Our Our” ” Communications database Communications database

slide-26
SLIDE 26

“ “Our Our” ” ad ad-

  • hoc scripts

hoc scripts They send e-mails and messages to our mobile phone when a connection fails. The institution and problem is on the subject It’s the best way to be “proactive”

slide-27
SLIDE 27

Pros

  • They are extremely useful to quickly detect problems and know

them during the weekend

  • Easy to program (shell)

Cons

  • We need to remember to add the institutions each time there is a

new connection (separated maintenance)

“ “Our Our” ” ad ad-

  • hoc scripts

hoc scripts

slide-28
SLIDE 28

Cacti Cacti RRDtool front-end, high performance tool that stores and represents series of data. It’s used to monitor:

  • CPU, temperature and memory of the routers
  • Anella Científica: points of access
  • Voice calls
  • Remote and direct access services
  • CATNIX (Internet Exchange)
  • Warnings
  • Automatic monthly statistics
  • BGP prefixes
  • Ping
  • Power consumption
  • RedIRIS & Orange Business Services graphics integrated
slide-29
SLIDE 29

SCP

Cacti: one for users, one for management Cacti: one for users, one for management PRIVATE PUBLIC

Contact information

CACTI CACTI

SCP

slide-30
SLIDE 30

Plugins Plugins Useful to generate monthly reports Useful to detect

  • Down links
  • Congestions
  • High temperature
  • High CPU
  • Excess of BGP prefixes…

Told Told Reportit Reportit

slide-31
SLIDE 31

Superlinks Superlinks It allows us to new tabs Useful to integrate RedIRIS graphs in the same environment It stores in the cache the visited graphs for 5 minutes It doesn’t generate all the graphs Boost Boost

slide-32
SLIDE 32

Link2BDCOPS Link2BDCOPS It adds an icon next to each graph that, if you click on it , you see the data of the technical and administrative contact Programmed by our engineers Linked to our database Only the internal Cacti has access to it

slide-33
SLIDE 33

Some weather maps: occupation of the lines Some weather maps: occupation of the lines

Provider

slide-34
SLIDE 34

Some Some weather weather maps maps: Link : Link and and occupation

  • ccupation
slide-35
SLIDE 35

2

35

Creating Creating a a new new code code

1 3 4 5

slide-36
SLIDE 36

Cacti Cacti Pros

  • It’s very useful to detect problems
  • It’s very useful to “see” the network while it’s working
  • It makes the 24x7 service easier
  • It simplifies the generation of monthly reports
  • Graph templates are useful

Cons

  • Groups of users are hard to manage
  • The creation of Graph Templates requires time and dedication
  • The user interface is better if you don’t have a big amount of data.
slide-37
SLIDE 37

PerfSonar PerfSonar We’re beginning to use it. Initially installed for the LHC project Uses the installable DVD version from RedIRIS Coordinated through RedIRIS Other tools, like NDT, also installed, for the measurement

  • f the network by our users
  • Our NOC is subscribed to the Dante E2ECU (End to end

coordination unit) mailing list

slide-38
SLIDE 38

PerfSonar PerfSonar Pros

  • Good for inter-domain monitoring of L2 circuits (LHC)
  • Very powerful if all the tools are used

Cons

  • Installing it wasn’t easy at all…
slide-39
SLIDE 39

SMARTxAC SMARTxAC Traffic Monitoring System for Anella Científica (Sistema de Monitorització de ATfic per l’Anella Científica). It’s a passive monitoring and analysis system, tailor-made for Anella Científica by the Advanced Broadband Communications Service of the Technical University of Catalonia (UPC-CCABA). Usable for other high-speed networks. Since 2003, SMARTxAC has been used for continuously monitoring Anella Científica. Passive splitters and cards for every external link.

slide-40
SLIDE 40

SMARTxAC SMARTxAC: Topology and splitters : Topology and splitters

Campus Nord Telvent

Special projects

Catalyst 6500 Level 2/3

Local connections

Juniper M320 Level 3 (RedIRIS) Nortel Level 2 (RedIRIS) Capture servers (Endace cards), analysis and monitoring Splitters Catalyst 6500 Level 2 Catalyst 6500 Level 3

Operator

slide-41
SLIDE 41

SMARTxAC SMARTxAC Pros

  • It captures ALL the headers through the regular traffic links
  • Very useful to detect problems that happened hours ago
  • Traffic is classified
  • It can detect different types of application

Cons

  • The 10 Gbps cards are very expensive
  • New interfaces require more programming and more cards
slide-42
SLIDE 42

The NAM, Network Analysis Module The NAM, Network Analysis Module It’s a module of the Catalyst 6500 Similar to a SPAN port + server with ethereal/wireshark It allows us to capture all the traffic in certain period The results help us to find the origin of attacks or security problems, black holes, etc. 2 simultaneous captures

Source: http://www.cisco.com

slide-43
SLIDE 43

The NAM, Network Analysis Module The NAM, Network Analysis Module Pros

  • Very easy to use (web-based interface)
  • Analysis in real-time of what’s happening on the network
  • The capture can be saved in “ethereal format”
  • It can monitor physical and logical interfaces, like VLANs
  • It monitors ALL the traffic
  • Filters can be applied before the capture

Cons

  • It’s a proprietary solution
  • It can only monitor interfaces 1 Gbps or less
  • It’s used once a problem has started
slide-44
SLIDE 44

Other Other tools tools MGEN to send big amounts of traffic on the links and check if they we can fulfill them with UDP traffic Direct access to some tools that our providers gives us:

  • HP Openview
  • Cacti statistics
  • Management of VLAN

Iperf Netmate Pathrate Nagios Zabbix MTR

slide-45
SLIDE 45

The most common incidents & requests The most common incidents & requests Incidents:

  • Electrical cuts at the institution
  • Radiolinks & ADSL
  • Last mile fibre cuts
  • Crazy firewalls…
  • DoS attacks

Other requests

  • Multicast tests
  • New circuits
  • Routing
  • DNS
  • Redundancy
slide-46
SLIDE 46

Agenda Agenda About CESCA and Anella Científica Anella Científica/CESCA NOC:

  • Communication with the users
  • How we manage the network
  • How we manage dedicated circuits

Tools

  • Communications database
  • Ad-hoc scripts
  • Cacti & its plugins
  • PerfSonar
  • SMARTxAC
  • NAM
  • Other tools

Conclusions

slide-47
SLIDE 47

Conclusions Conclusions Our RREN has to face the problems of small entities, big universities and research centres and very important projects with dedicated lambdas that traverse several domains RT for incidents At least a database for data At least a monitoring tool At least an analysis tool New models with dark fibre require new management models for the NOC No single tool

slide-48
SLIDE 48

Thanks for your attention! Thanks for your attention! Questions? Suggestions? Questions? Suggestions?

igandia@cesca.cat