CSE Seminar Series Networks and Distributed Systems (+ Data as - - PowerPoint PPT Presentation

cse seminar series networks and distributed systems
SMART_READER_LITE
LIVE PREVIEW

CSE Seminar Series Networks and Distributed Systems (+ Data as - - PowerPoint PPT Presentation

CSE Seminar Series Networks and Distributed Systems (+ Data as bonus - mainly Big) 2019-03-07 Marina Papatriantafilou Networks and Systems Division CSE Department Chalmers & Gothenburg Un. 1 M. Papatriantafilou Networks,


slide-1
SLIDE 1
  • M. Papatriantafilou – Networks, Distributed Systems & Data

CSE Seminar Series Networks and Distributed Systems

(+ Data as ”bonus” - mainly Big)

2019-03-07 Marina Papatriantafilou

Networks and Systems Division CSE Department Chalmers & Gothenburg Un.

1

slide-2
SLIDE 2
  • M. Papatriantafilou – Networks, Distributed Systems & Data

What’s a distributed system?

“A distributed system is one in which the failure of a computer you didn't even know existed can render your own computer unusable.”

Leslie Lamport

What is a Distributed System?

slide-3
SLIDE 3
  • M. Papatriantafilou – Networks, Distributed Systems & Data

A Distributed System

A set of computing&communicating processes, collaborating for acheiving local and/or global goals

slide-4
SLIDE 4
  • M. Papatriantafilou – Networks, Distributed Systems & Data

Distributed Systems?

4

Figs: Computer Networking: A Top Down Approach by Kurose&Ross; robocup.org; Chalmers Gulliver prj by E. Schilller; ebgames.ca

slide-5
SLIDE 5
  • M. Papatriantafilou – Networks, Distributed Systems & Data

Layered system perception

Interconnection network (can be sharing memory if in single box) Computing+ communicating unit

slide-6
SLIDE 6
  • M. Papatriantafilou – Networks, Distributed Systems & Data

Distributed Systems vs. Networks

  • Networking is worried about

– Sending a message from here to there – Not what you do with the message – We teach you how networks are built and how they function

  • Distributed Systems

– Assume: There is a way to communicate – Focus: How you build a system using those messages – We teach you what things to do with a network

slide-7
SLIDE 7
  • M. Papatriantafilou – Networks, Distributed Systems & Data

Inter-net-working, Data processing and Distributed Computing in interplay in IoT

A lot of data to be communicated, distributed, processed

Example study topics in these domains

  • Send, share data
  • Aggregate-data/monitor @local-level
  • Learn data-patterns @data-center, @local-level
  • Ensure consistency/synchronization among

copies @updates

Figs:://www.iebmedia.com; Vincenzo Gulisano / Rocio Rodriguez

7

slide-8
SLIDE 8
  • M. Papatriantafilou – Networks, Distributed Systems & Data

Let’s hit the road

8

Overview Some history Present and projection to the future Possibilities in our curriculum Some course-related info Our research team and highlights of results & projects

slide-9
SLIDE 9
  • M. Papatriantafilou – Networks, Distributed Systems & Data

Distributed system synchronization:

  • nce upon a time…
  • [Dijkstra 1965]: Dining philosophers:

example problem in concurrent algorithms&systems to illustrate synchronization issues and techniques for resolving them

  • exam exercise , presented in terms
  • f computers competing for access

to tape drive peripherals

9

Fig Wikipedia

slide-10
SLIDE 10
  • M. Papatriantafilou – Networks, Distributed Systems & Data

”Internet”: once upon a time ….

10

Leonard Kleinrock (now prof Emeritus, UCLA) about the Internet:

slide-11
SLIDE 11
  • M. Papatriantafilou – Networks, Distributed Systems & Data

and later …

11

Adapted from slides on the Computer Systems and Networks Masters program by O. Landsieldel

slide-12
SLIDE 12
  • M. Papatriantafilou – Networks, Distributed Systems & Data

12

… and later …

slide-13
SLIDE 13
  • M. Papatriantafilou – Networks, Distributed Systems & Data

How was this enabled? (examples)

13

slide-14
SLIDE 14
  • M. Papatriantafilou – Networks, Distributed Systems & Data

14

How was this enabled? (examples cont)

70-80’s: foundations about time and coordination in distributed systems; concurrent R/W shared data; wait/lock-free algorithms [Courtois, Heymans, Parnas] [Misra] [Lamport] : asynchronous HW?! Leslie Lamport: Turing award winner 2013 for his work on distributed systems synchronization, consistency, robustness

slide-15
SLIDE 15
  • M. Papatriantafilou – Networks, Distributed Systems & Data

How was this enabled (examples cont):

15

Adapted from slides CSE Seminar 2018 by O. Landsieldel

slide-16
SLIDE 16
  • M. Papatriantafilou – Networks, Distributed Systems & Data

16

How was this enabled? (examples cont)

2000’s: p2p applications, social networks, Content Distribution Networks, … ; multi/many-core data processing; asynchronous hardware! Distributed (inclulind parallel, multicore) systems hold hands with Networks

slide-17
SLIDE 17
  • M. Papatriantafilou – Networks, Distributed Systems & Data

Roadmap

17

Overview Some history Present and projection to the future Possibilities in our curriculum Some course-related info Our research team and highlights of results & projects

slide-18
SLIDE 18
  • M. Papatriantafilou – Networks, Distributed Systems & Data

The Future is Distributed

The Future is Distributed

Adapted from slides CSE seminar O. Landsiedel

Manycores,synchronous hardware

slide-19
SLIDE 19
  • M. Papatriantafilou – Networks, Distributed Systems & Data
  • Networks and Distributed Systems touch significant aspects of daily life!

– Integral building block for our networked society

The Future is Distributed

slide-20
SLIDE 20
  • M. Papatriantafilou – Networks, Distributed Systems & Data

What Makes a Smart City?

Multiple Applications Create BigData

Connected Factory

1 PB per day (0.2% transmitted)

Connected Plane

40 TB per day (0.1% transmitted)

Public Safety

50 PB per day (<0.1% transmitted)

IntelligentBuilding

275 GB per day (1% transmitted)

Smart Hospital

5 TB per day (0.1% transmitted)

SmartCar

70 GB per day (0.1% transmitted)

A city of

  • ne million

will generate 200 million gigabytes

  • f data per day

by 2020

SmartGrid

5 GB per day (1% transmitted)

Weather Sensors

10 MB per day (5% transmitted)

Back to Index

Source: Cisco Global Cloud Index, 2015–2020

slide-21
SLIDE 21
  • M. Papatriantafilou – Networks, Distributed Systems & Data

60 15 5 7 9 11 13 15

40 30 20 10 70 60 50

2015 2016 2017 2018 2019 2020

Useable Data Created per Year Data Center Traffic per Year

48 38 29 21

Data Created vs. Data Center Traffic

Data Created Outpaced

Zettabytes perYear

Opportunityfor Edge or Fog Computing

Source: Cisco Global Cloud Index, 2015–2020

Summary:

Networks & Data: Big => Distributed Computing and Systems: ”break” big problems into smaller, local ones

Michael Stonebraker: Turing award winner 2014 for his work on stream-data procsessing, enabling in-network data processing and revolutionizing database systems

slide-22
SLIDE 22
  • M. Papatriantafilou – Networks, Distributed Systems & Data

Roadmap

22

Overview Some history Present and projection to the future Possibilities in our curriculum Some course-related info Our research team and highlights of results & projects

slide-23
SLIDE 23
  • M. Papatriantafilou – Networks, Distributed Systems & Data

Putting things together

Offererd in our curriculum:

Continuous stream processing and analysis incl. data mining/applied ML

slide-24
SLIDE 24
  • M. Papatriantafilou – Networks, Distributed Systems & Data

e.g.: MS prorgam @CTH, Specialization options @GU

24

slide-25
SLIDE 25
  • M. Papatriantafilou – Networks, Distributed Systems & Data

Courses

25

LP1, 7.5hec

slide-26
SLIDE 26
  • M. Papatriantafilou – Networks, Distributed Systems & Data

Roadmap

26

Overview Some history Present and projection to the future Possibilities in our curriculum Some course-related info Our research team and highlights of results & projects

slide-27
SLIDE 27
  • M. Papatriantafilou – Networks, Distributed Systems & Data

Operating Systems Course

27

slide-28
SLIDE 28
  • M. Papatriantafilou – Networks, Distributed Systems & Data

Courses Distributed Systems

You learn: How to build large-scale distributed systems and the associated challenges, principles & practice

  • eg CAP thm [Brewer’s conjecture 1998; Gilbert&Lynch2002 proof]
  • Eg. applied in Spotify’s, Amazon’s systems: partitioning of servers happens!

=> eventual consistency in distributed state [CRDTs: Shapiro et-al]

28

slide-29
SLIDE 29
  • M. Papatriantafilou – Networks, Distributed Systems & Data

Courses Computer Communication

You get knowledge to build the basis … … to follow continuous evolution ….

Multimedia

slide-30
SLIDE 30
  • M. Papatriantafilou – Networks, Distributed Systems & Data

data plane control plane

Software-Defined Networks: logically separated control plane

Remote Controller CA

CA CA CA CA

31

compute tables separately, in data-center/distributed system and distribute i.e. you learn how to support Networks with Distributed systems and Data Processing

Courses Computer Communication, example content

slide-31
SLIDE 31
  • M. Papatriantafilou – Networks, Distributed Systems & Data

Project-based courses: eg ICT in data-driven cyberphysical systems

Example projects: e. in the context of smart grid systems

Context: paradigm shift from “adapt generation to demand “to “adapt consumption to availability +”

32

Power flow

Picture: G. Georgiadis

slide-32
SLIDE 32
  • M. Papatriantafilou – Networks, Distributed Systems & Data

33

The project: Adaptive, autonomous and collective load balancing

Supply increasing uncertainty Demand … … … … … …

The goal:

Shaping streaming demands to streaming supply, taking into account energy storage options and consumption/generation data

Project-based courses: eg ICT in data-driven cyberphysical systems

Example projects:

slide-33
SLIDE 33
  • M. Papatriantafilou – Networks, Distributed Systems & Data

34

The project: Reliability of RT object detection: Goal: understand limits of ML-processing with noisy data on embedded GPU platforms

Figs: report A. Mosshammer, C vRosen Johansson, M. Romain

Project-based courses: eg ICT in data-driven cyberphysical systems

Example projects:

slide-34
SLIDE 34
  • M. Papatriantafilou – Networks, Distributed Systems & Data

Highlights

Guest lectures by: Ericsson research, Volvo, Spotify, FlexLink, ABB, Göteborg Energi, Svenska Kraftnät and more Example empolyments of graduates: Spotify, Volvo, Zenuity, Ericsson research, RISE, ABB, academic institutes internationally (Max Plank Inst, Purdue Un, EPFL, …)

35

  • Masters projects with relevance for industry and academia, including

publishable work

  • Comments of appreciaton in course evaluations
  • Continued contacts after graduation
slide-35
SLIDE 35
  • M. Papatriantafilou – Networks, Distributed Systems & Data

Roadmap

36

Overview Some history Present and projection to the future Possibilities in our curriculum Some course-related info Our research team and highlights of results & projects

slide-36
SLIDE 36
  • M. Papatriantafilou – Networks, Distributed Systems & Data

On-the-fly/stream data processing & analysis

  • Data validation,

monitoring, …

  • -Security, privacy

Energy, production, transport

  • data-driven

distributed monitoring, resource matching

  • Microgrids demo

work Energy/efficient computation parallel/multicore/ processing, incl. on embedded processors Vehicular systems

  • data-driven situation-

awareness

  • communication

&coordination, e.g. virtual traffic-lights

  • Gulliver testbed

Our research DCS @NS division (approx 30 pers):

Security, dependa bility

Distributed

systems, IoT Parallel &stream computing

  • Fig. Giorgos Georgiadis/ Vincenzo Gulisano / Rocio Rodriguez / Chalmers Magazine Gulliver project/Elad Schiller

Strong publications records (incl. PODC, ICDE where Turing award talks have been delivered

slide-37
SLIDE 37
  • M. Papatriantafilou – Networks, Distributed Systems & Data

2015: DETERMINISTIC REAL-TIME ANALYTICS OF GEOSPATIAL DATA STREAMS THROUGH SCALEGATE OBJECTS http://www.chalmers.se/en/departments/cse/news/Pages/debs2015.aspx ACM DEBS 2015 Grand Challenge best solution award

  • Top k frequent routes @NY, profitable cells (near-real time window-based streaming)
  • > 110,000 tuples/sec throughput, < 46 msec latency, 1yr data processed in 11 min

Example projects/results: Geospatial monitoring

2017: Maximizing Determinism in Stream Processing Under Latency Constraints ACM DEBS 2017 best paper award [collab of our team with Athens Uni. of Business]

  • Fig. Vincenzo Gulisano / Rocio Rodriguez
  • Fig. Yiannis Nikolakopoulos
slide-38
SLIDE 38
  • M. Papatriantafilou – Networks, Distributed Systems & Data

Example research projects/results: Shared Data Objects for ultra-efficient processing on many-core embedded systems

39

Cache Cache IA Core Shared Local

H/W Evolves

  • Networks on chip (NoC)
  • Limited support for

synchronization primitives New Software for Concurrent access data

  • bjects

Eg ScaleGate [CGNPT14]

LTE (FFT) Uplink benchmark

  • Focus on communication

patterns: n-to-1, n-to-n

  • Up to 42% improvement

in execution cycles

slide-39
SLIDE 39
  • M. Papatriantafilou – Networks, Distributed Systems & Data

Distributed data pre-processing and validation

40

Data-driven alerts analysis NTL / risks

AMI: Advanced Metering Infrastructure

  • Fig. Vincenzo Gulisano / Rocio Rodriguez

STAMINA: Processing & analysis of data STreams in AMI for Awareness and Adaptiveness in electricity grids

Facilitate automated funtionality; ”lower-voltage SCADA”

Possibile quality correlation from distribution and AMI

slide-40
SLIDE 40
  • M. Papatriantafilou – Networks, Distributed Systems & Data
  • Lidar provides cloud of 3D points: high

rate sensor (MBps) – It requires different processing tools (filtering, clustering, segmentation, …)

  • New: We enabled possibilities to extract

useful information from raw data, in real-time streaming, even on embedded hardware [ICDCS2018]

Example recent work/results: Continuous parallel processing of LIDAR data

Fig: Hannah Najdataei

slide-41
SLIDE 41
  • M. Papatriantafilou – Networks, Distributed Systems & Data

Recent & current related research projects of the team

1. EXAMINE: Extracting information out of big data in Advanced Metering Infrastructure (Göteborg Energii); 2. FiC: Future Factories in the Cloud (SSF); 3. EU/FP7 EXCESS on energy-efficient computation in embedded devices (EU; Chalmers’ team coordinator) 4. EU/FP7 KARYON on safety Kernels in vehicular systems; 5. EU/FP7 CRISALIS on security of critical infrastructures; 6. RICS: on resilient information and control systems (MSB); 7. EU Horizon 2020 United Grid - with CTH Elteknik 8. Gulliver: a test-bed for developing, demonstrating and prototyping vehicular systems (SAFER, AoA Trasnsport); 9. Scheme: on Software abstractions for heterogeneous multi-core computers (SSF);

  • 10. EPOC (Energy on Campus, Chalmers Area of Advance Energy and Building Future)
  • 11. iTRANSIT on intelligent traffic management systems (FFI);
  • 12. Fine-grain synchronization and memory consistency in parallel programming (VR);
  • 13. Big Data and IoT for Sustainable living (Sw. Energy Agency);
  • 14. Data-driven and Distributed Algorithms for Safe and Sustainable Traffic (SAFER and Chalmers Transport);
  • 15. Adaptive energy dispatch in Smart Grids (E.ON and Swedish Energy Agency);
  • 16. Concurrent Data Structures for Heterogeneous Parallel Programming (VR);
  • 17. D-SAS: Data summaries and stream processing for autonomous systems (Wallenburg Autonomous Systems and Software Programme - WASP);
  • 18. STAMINA: Processing & analysis of data STreams in AMI for Awareness and Adaptiveness in electricity grids (GE & WASP);
  • 19. OODIDA: On-board Off-board Distributed Data Analytics (Vinnova FFI);
  • 20. HARE: Self-deploying and Adaptive Data Streaming Analytics in Fog Architectures (VR);
  • 21. Models and Techniques for Energy-Efficient Concurrent Data Access Designs (VR)

EXCESS

AoA Building Futures, Energy, ICT, Transport

Faculty members responsible/involved: Magnus Almgren Vincenzo Gulisano Marina Papatriantafilou Elad Schiller Philippas Tsigas

Systems Security Distri- buted systems, IoT Parallel &stream computing

slide-42
SLIDE 42
  • M. Papatriantafilou – Networks, Distributed Systems & Data

Roadmap

43

Overview Some history Present and projection to the future Possibilities in our curriculum Some course-related info Our research team and highlights of results & projects Next: See you at the courses! 