Introduction Harsh realities of network analytics netbeam - PowerPoint PPT Presentation

Introduction ● Harsh realities of network analytics netbeam ● ● Demo ● Technology Stack ● Alternative Approaches Lessons Learned ● 2

ESnet Data, Analytics and Visualization Architecture 3

The Harsh Realities of Network Analytics 1. It’s a mess 3. There’s always more ● Your data isn’t neat and tidy ● More devices & more telemetry 2. Things change 4. It’s never really done ● Time and money are limited ● What you need today may not be what you need tomorrow. 4

Coping strategies 1. It’s a mess 3. There’s always more ● Design knowing things won’t ● Rely on the cloud for scaling be tidy 2. Things change 4. It’s never really done ● “What” not “How” ● Keep raw data to keep your options open 5

netbeam Network Analytics in Google Cloud Three Pillars 1. Real time analytics ○ Low latency, incomplete 2. Offline analytics ○ High latency, complete 3. Flexible data model ○ Changing needs? Recompute from raw data! Secret sauce: Apache Beam 6

What is Apache Beam? 1. The Beam Programming Model SDKs for writing Beam pipelines 2. 3. Runners for existing distributed processing backends Apache Apex ○ ○ Apache Flink Apache Spark ○ ○ Google Cloud Dataflow Local runner for testing ○ 7 Slide courtesy of the Apache Beam Project

The Evolution of Apache Beam Colossus BigTable PubSub Dremel Google Cloud Dataflow Spanner Megastore Millwheel Flume Apache Beam MapReduce 8 Slide courtesy of the Apache Beam Project

SNMP collection Old SNMP system system Architecture Diagram Apache Beam (Stream Processing) avro Bigtable Apache Beam BigQuery BigQuery (realtime) (Batch Processing) (immutable) (historical) API Client 9

SNMP collection Old SNMP system system Architecture Diagram Apache Beam (Stream) avro ● Google Pubsub ● Uses Python outside Align/rates of Google Cloud to poll devices and write Bigtable Rollups BigQuery BigQuery (realtime) to Pubsub topic 5m, 1h, 1d avg (immutable) (historical) ● Code within Google Cloud subscribes to Percentiles topic to process data ... API Client 10

SNMP collection Old SNMP system system Architecture Diagram Apache Beam (Stream) avro ● Apache Beam / Google Dataflow Align/rates ● Stream processing ● Subscribes to Bigtable Rollups BigQuery BigQuery (realtime) Pubsub topic 5m, 1h, 1d avg (immutable) (historical) Percentiles ... API Client 11

SNMP collection Old SNMP system system Architecture Diagram Apache Beam (Stream) avro ● Apache Beam / Google Dataflow Align/rates ● Stream processing ● Subscribes to Bigtable Rollups BigQuery BigQuery (realtime) Pubsub topic 5m, 1h, 1d avg (immutable) (historical) ● Raw data is written to BigQuery Percentiles ... API Client 12

SNMP collection Old SNMP system system Architecture Diagram Apache Beam (Stream) avro ● Apache Beam / Google Dataflow Align/rates ● Stream processing ● Subscribes to Bigtable Rollups BigQuery BigQuery (realtime) Pubsub topic 5m, 1h, 1d avg (immutable) (historical) ● Raw data is written to BigQuery Percentiles ● Real time ... transformed data (e.g. aligned data rates) written to Bigtable API ● Writes and makes use of meta data in BigTable (not shown) Client 13

SNMP collection Old SNMP system system Architecture Diagram Apache Beam (Stream) avro ● Cloud Bigtable ● Like HBase Align/rates ● Write to cells in rows, indexed by keys Bigtable Rollups BigQuery BigQuery (realtime) ● We write 1 day of 5m, 1h, 1d avg (immutable) (historical) data to a single row (columns are the time Percentiles of day, key is metric ... and day) ● Fast access to row by key, can serve data from here API ● Store one year Client 14

SNMP collection Old SNMP system system Architecture Diagram Apache Beam (Stream) avro ● BigQuery ● Data warehousing Align/rates solution ● Cheap storage, SQL Bigtable Rollups BigQuery BigQuery (realtime) access, but not 5m, 1h, 1d avg (immutable) (historical) suitable for real-time access Percentiles ● Allows SQL queries ... for ad hoc investigation ● We store our source of truth here API Client 15

SNMP collection Old SNMP system system Architecture Diagram Apache Beam (Stream) avro ● BigQuery ● Data warehousing Align/rates solution ● Cheap storage, SQL Bigtable Rollups BigQuery BigQuery (realtime) access, but not 5m, 1h, 1d avg (immutable) (historical) suitable for real-time access Percentiles ● Allows SQL queries ... for ad hoc investigation ● We store our source of truth here API ● Also store historical data (7 years), imported via avro files Client 16

SNMP collection Old SNMP system system Architecture Diagram Apache Beam (Stream) avro ● Apache Beam / Google Dataflow Align/rates ● Batch processing ● Run with cron job Bigtable Rollups BigQuery BigQuery (realtime) 5m, 1h, 1d avg (immutable) (historical) Percentiles ... API Client 17

SNMP collection Old SNMP system system Architecture Diagram Apache Beam (Stream) avro ● Apache Beam / Google Dataflow Align/rates ● Batch processing ● Run with cron job Bigtable Rollups BigQuery BigQuery (realtime) ● Recalculate Bigtable 5m, 1h, 1d avg (immutable) (historical) data each night from source of truth in Percentiles BigQuery ... API Client 18

SNMP collection Old SNMP system system Architecture Diagram Apache Beam (Stream) avro ● Apache Beam / Google Dataflow Align/rates ● Batch processing ● Run with cron job Bigtable Rollups BigQuery BigQuery (realtime) ● Recalculate Bigtable 5m, 1h, 1d avg (immutable) (historical) data each night from source of truth in Percentiles BigQuery ... ● Process Bigtable rows into new rows of 5min, 1 hr and 1 day aggregations API Client 19

SNMP collection Old SNMP system system Architecture Diagram Apache Beam (Stream) avro ● Apache Beam / Google Dataflow Align/rates ● Batch processing ● Run with cron job Bigtable Rollups BigQuery BigQuery (realtime) ● Recalculate Bigtable 5m, 1h, 1d avg (immutable) (historical) data each night from source of truth in Percentiles BigQuery ... ● Process Bigtable rows into new rows of 5min, 1 hr and 1 day aggregations API ● Additional pre-computed views e.g. percentiles for traffic distribution over a month Client 20

SNMP collection Old SNMP system system Architecture Diagram Apache Beam (Stream) avro ● API ● Currently runs on Align/rates App Engine ● Node.js Bigtable Rollups BigQuery BigQuery (realtime) ● Serves data out of 5m, 1h, 1d avg (immutable) (historical) Bigtable ● Timeseries data is Percentiles served as ‘tiles’, each ... tile is one row ● Would like to use Cloud Endpoints Dataserver API and provide a gRPC (node.js) service ● Looking forward to grpc-web solution Client 21

Use case example: Historical Trends 22

Use case example: Historical Trends Per-day SNMP collection Interface totals BigQuery Stream to BQ system Bigtable Old SNMP system avro BigQuery (historical) Per-month totals Bigtable rows Dataserver API (node.js) Jan 1 Jan 2 ... Dec31 snmp-daily::2017-08::$interface 1.8 Pb 1.9 Pb ... 3.1 Pb Client Jan 1991 Feb 1991 ... Sep 2017 snmp-monthly-totals 23 28 Gb 29 Gb ... 56 Pb

Use case: real time anomaly detection Generates avg for each interface over the past 3 months for that hour/day SNMP collection Baseline BigQuery Stream to BQ system generation Bigtable Anomaly detection Compares baseline to real time values to generate current deviation from normal Dataserver API (node.js) Mon Mon Mon Sun ... 12am 1am 2am 11pm baseline::5m::avg::$interface 2.1 1.9 0.3 ... 0.5 Client iface-1 iface-2 ... iface-n anomaly::5m::avg +0.1 +2.0 ... -1.5 24

Use case example: Percentiles 25

Use case example: Percentiles Daily rollups 5m avg SNMP collection Stream to Bigtable Bigtable system Percentiles Bigtable rows Dataserver API 1 2 ... 8640 (node.js) rollup-month-5m::2017-08::$interface::in 6Gbps 5Gbps ... 2Gbps Client 1 pct 2 pct ... 99 pct 26 percentiles::2017-08::$interface::in 0.1 Gbps 0.3 Gbps ... 22.1Gbps

Example: Computing Total Traffic # Python Beam SDK pipeline = beam.Pipeline('DirectRunner') (pipeline | 'read' >> ReadFromText('./example.csv') | 'csv' >> beam.ParDo(FormatCSVDoFn()) | 'ifName key' >> beam.Map(group_by_device_interface) | 'group by iface' >> beam.GroupByKey() | 'compute rate' >> beam.FlatMap(compute_rate) | 'timestamp key' >> beam.Map(lambda row: (row['timestamp'], row['rateIn'])) | 'group by timestamp' >> beam.GroupByKey() | 'sum by timestamp' >> beam.Map(lambda rates: (rates[0], sum(rates[1]))) | 'format' >> beam.Map(lambda row: '{},{}'.format(row[0], row[1])) | 'save' >> beam.io.WriteToText('./total_by_timestamp')) pipeline.run() Full code available at: http://x1024.net/blog/2017/05/chinog-flexible-network-analytics-in-the-cloud/ 27

Introduction Harsh realities of network analytics netbeam - PowerPoint PPT Presentation

Introduction Harsh realities of network analytics netbeam Demo Technology Stack Alternative Approaches Lessons Learned 2 ESnet Data, Analytics and Visualization Architecture 3 The Harsh Realities of Network

INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION

Introduction ATV Introduction A T V Introduction A lphabet T V Introduction A lphabet

Brief Brief Introduction Introduction Brief Brief Introduction Introduction Zhengzhou

Brief Brief Introduction Introduction Brief Brief Introduction Introduction Zhengzhou

Shenzhen Cuilu jewelry Co., Ltd was founded in 1996 and its a large private enterprise

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Spectrum Painting Richard Shipman MW0RCZ ADARS 6th Jan 2020 Introduction Introduction

Introduction Introduction Introduction Introduction Outline Motivation Failures

Introduction Introduction Introduction Nationwide Cause for Concern 1

Team Introduction Experiments Outreach Problem Project Brainstorm Introduction Introduction

Lecture 1 Andreas Habegger Introduction Zynq Introduction Zynq Introduction Zynq PS vs. PL

Introduction to Web Design & Computer Principles Class 1 CSCI-UA 4 Introduction and Overview

Introduction to CICS Course introduction Course introduction What is CICS? What is an

INF5110 Compiler Construction Introduction Spring 2016 1 / 33 Outline 1. Introduction

INTRODUCTION I Syllabus INTRODUCTION I Syllabus I Why study labor economics? INTRODUCTION I

2018.06 01 SMILE5 Introduction S E 5 02 Alpha Cloud M I L 03 Company Introduction 04

Learning: Nearest Neighbor, Perceptrons & Neural Nets Artificial Intelligence CSPP 56553

Pricing of Accreting Swaptions using QuantLib Dr. Andr Miemiec, 13./14. Nov. 2013 Agenda 1.

Predicting Chroma from Luma using Frequency Domain Intra Prediction in Codecs Based on Lapped

Reflecting on the Goal and Baseline of Exascale Computing Thomas C. Schulthess | T. Schulthess

Pretty Good Democracy Peter Y A Ryan University of Luxembourg Vanessa Teague University of

Software Testing Testing 1 Background Main objectives of a project: High Quality & High

NEPR208 - Adaptation properties and mechanisms Functional advantages in properties of a neural

Beba BEhavioural BAsed forwarding Giuseppe Bianchi OpenFlows platform The SDN/OpenFlow Model

Introduction Harsh realities of network analytics netbeam - PowerPoint PPT Presentation

Introduction Harsh realities of network analytics netbeam Demo Technology Stack Alternative Approaches Lessons Learned 2 ESnet Data, Analytics and Visualization Architecture 3 The Harsh Realities of Network

INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION

Introduction ATV Introduction A T V Introduction A lphabet T V Introduction A lphabet

Brief Brief Introduction Introduction Brief Brief Introduction Introduction Zhengzhou

Brief Brief Introduction Introduction Brief Brief Introduction Introduction Zhengzhou

Shenzhen Cuilu jewelry Co., Ltd was founded in 1996 and its a large private enterprise

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Spectrum Painting Richard Shipman MW0RCZ ADARS 6th Jan 2020 Introduction Introduction

Introduction Introduction Introduction Introduction Outline Motivation Failures

Introduction Introduction Introduction Nationwide Cause for Concern 1

Team Introduction Experiments Outreach Problem Project Brainstorm Introduction Introduction

Lecture 1 Andreas Habegger Introduction Zynq Introduction Zynq Introduction Zynq PS vs. PL

Introduction to Web Design &amp; Computer Principles Class 1 CSCI-UA 4 Introduction and Overview

Introduction to CICS Course introduction Course introduction What is CICS? What is an

INF5110 Compiler Construction Introduction Spring 2016 1 / 33 Outline 1. Introduction

INTRODUCTION I Syllabus INTRODUCTION I Syllabus I Why study labor economics? INTRODUCTION I

2018.06 01 SMILE5 Introduction S E 5 02 Alpha Cloud M I L 03 Company Introduction 04

Learning: Nearest Neighbor, Perceptrons &amp; Neural Nets Artificial Intelligence CSPP 56553

Pricing of Accreting Swaptions using QuantLib Dr. Andr Miemiec, 13./14. Nov. 2013 Agenda 1.

Predicting Chroma from Luma using Frequency Domain Intra Prediction in Codecs Based on Lapped

Reflecting on the Goal and Baseline of Exascale Computing Thomas C. Schulthess | T. Schulthess

Pretty Good Democracy Peter Y A Ryan University of Luxembourg Vanessa Teague University of

Software Testing Testing 1 Background Main objectives of a project: High Quality &amp; High

NEPR208 - Adaptation properties and mechanisms Functional advantages in properties of a neural

Beba BEhavioural BAsed forwarding Giuseppe Bianchi OpenFlows platform The SDN/OpenFlow Model

Introduction to Web Design & Computer Principles Class 1 CSCI-UA 4 Introduction and Overview

Learning: Nearest Neighbor, Perceptrons & Neural Nets Artificial Intelligence CSPP 56553

Software Testing Testing 1 Background Main objectives of a project: High Quality & High