Chapter 1: Communication in Distributed Systems Chapter 2: Basic - PowerPoint PPT Presentation

Lehrstuhl für Informatik 4 Kommunikation und verteilte Systeme Chapter 1: Communication in Distributed Systems Chapter 2: Basic Principles in Distributed Systems Chapter 3: Coordination • Time and Synchronization 3.1: Time and Synchonization • Coordination Algorithms • Universal Coordinated Time • Distributed Transactions • Network Time Protocol: NTP Chapter 4: Fault Tolerance • Logical time and Lamport and Performance Improvements Timestamps Chapter 5: Middleware • Causality and Vector Timestamps • Global states Chapter 3.1: Time and Synchronization 1 Chapter 4: Time and Synchronisation Page 1

Lehrstuhl für Informatik 4 Kommunikation und verteilte Systeme Cooperation and Coordination in Distributed Systems Communication Mechanisms for the communication between processes Naming for searching communication partners But... not enough for cooperation: • Time measurements for optimization of interactions • Synchronization • Ordering of events • Coordination algorithms • Detecting causality violations for mutual access, consensus, … • Consistency in transaction processing • Managing groups of replicated objects More complicated problems than in central systems! Chapter 3.1: Time and Synchronization 2 Chapter 4: Time and Synchronisation Page 2

Lehrstuhl für Informatik 4 Kommunikation und verteilte Systeme The Role of Time A distributed system consists of a number of processes • Each process has a state (values of variables) • Each process takes actions to change its state, or to communicate with other processes (send, receive) • An event is the occurrence of an action • Events within a process can be ordered by the time of occurrence • In distributed systems, also the time order of events on different machines and between different processes has to be known Needed: concept of “global time”, i.e. local clocks of machines have to be synchronized • Synchronization based on actual (absolute) time • Synchronization by relative ordering of events • Distributed global states Chapter 3.1: Time and Synchronization 3 Chapter 4: Time and Synchronisation Page 3

Lehrstuhl für Informatik 4 Kommunikation und verteilte Systeme Clock Synchronization • Clocks in distributed systems are independent • Some (or even all) clocks are inaccurate • When each machine has its own clock, an event that occurred after another event may nevertheless be assigned an earlier time. • How to determine the right sequence of events? • Example Compiler – synchronization is needed considering the absolute time on all machines: How can we - synchronize clocks with real world? - synchronize clocks with each other? Chapter 3.1: Time and Synchronization 4 Chapter 4: Time and Synchronisation Page 4

Lehrstuhl für Informatik 4 Kommunikation und verteilte Systeme Clocks Necessary for synchronization: assign a timestamp with each event But... how to determine the own resp. all other times in the system? Network • Skew : the difference between the times on two clocks (at any instant) • Computer clocks are subject to clock drift (they count time at different speeds) • Clock drift rate : the difference per unit of time from some ideal reference clock • Ordinary quartz clocks drift by about 1 sec in 11-12 days. (10 -6 secs/sec). • High precision quartz clocks drift rate is about 10 -7 or 10 -8 secs/sec Chapter 3.1: Time and Synchronization 5 Chapter 4: Time and Synchronisation Page 5

Lehrstuhl für Informatik 4 Kommunikation und verteilte Systeme Universal Coordinated Time (UTC) • International Atomic Time is based on very accurate atomic clocks (drift rate 10 -13 ). Problem: “Atomic day” is 3 msec shorter than a solar day • UTC is an international standard for time keeping solving this problem • It is based on atomic time, but occasionally adjusted to astronomical time: when the difference to the solar time grows up 800 msec, an additional leap second is inserted • It is broadcasted from radio stations on land and satellite (e.g. GPS) • Computers with receivers can synchronise their clocks with these timing signals ( But: only a small fraction of all computers have such receivers! ) • Problem with received UTC: propagation delay has to be considered � Signals from land-based stations are accurate to about 0.1-10 milliseconds � Signals from GPS are accurate to about 1 microsecond Chapter 3.1: Time and Synchronization 6 Chapter 4: Time and Synchronisation Page 6

Lehrstuhl für Informatik 4 Kommunikation und verteilte Systeme Clock Synchronization Algorithms • Universal Coordinated Time (as reference time): t • Clock time on machine p : C p (t) • Perfect world: C p (t) = t , i.e. dC / dt = 1 ⇒ Reality: there is a clock drift so that a maximum drift rate can be specified: ρ : 1 - ρ ≤ dC / dt ≤ 1 + ρ • Needed for synchronization: definition of a tolerable skew, the maximum time drift δ • With this, re-synchronization has to be made in certain intervals: all δ /2 ρ seconds • How to make such a re-synchronization? Chapter 3.1: Time and Synchronization 7 Chapter 4: Time and Synchronisation Page 7

Lehrstuhl für Informatik 4 Kommunikation und verteilte Systeme Cristian's Algorithm • There is one central time server T with a UTC receiver • All other machines M are contacting the time server at least all δ /2 ρ seconds • T responds as fast as it can M computes current time: t send M • Hold time t send for sending the Both values are measured with request time? the same clock } • Measure time when response t response t UTC T with t UTC arrives ( t receive ) • Subtract service time t response of T t UTC • Divide by two to consider only M t receive the time since the reply was sent • Add 'delivery time' to the time t receive – t send – t response t UTC sent by T t synchronous = t UTC + 2 • Result t synchronous becomes new Consider message run-time, avoid M's time to be system time moved back Chapter 3.1: Time and Synchronization 8 Chapter 4: Time and Synchronisation Page 8

Lehrstuhl für Informatik 4 Kommunikation und verteilte Systeme The Berkeley Algorithm 10:28 Another approach (Berkeley Unix): 10:28 1 2 d=0 T T • active time server d=-2 d=-6 10:28 10:28 • logical synchronization 10:28 d=4 M 1 M 3 1. time server sends its time to all M 1 M 3 machines 10:22 10:26 10:22 10:26 2. the machines answer with their M 2 M 2 current deviation from the time 10:32 10:32 server 10:28 3 3. the time server sums up all 10:28 , s.d. 4 d = -1 deviations and divides by the T T number of machines (including +5 +1 itself!) M 1 M 3 -5 M 1 M 3 4. the new time for each machine is given by the mean time 10:22 10:26 10:27 10:27 M 2 M 2 Important: fast clocks are not moved back, but instructed to 10:32 10:32 , slow down move slower Chapter 3.1: Time and Synchronization 9 Chapter 4: Time and Synchronisation Page 9

Lehrstuhl für Informatik 4 Kommunikation und verteilte Systeme Distributed Algorithms Problem with Cristian/Berkeley: use of a centralized server ; mainly used in Intranets Simple mechanism for decentralized synchronization (based on Berkeley Algorithm): • Divide time into fixed-length synchronization intervals • At the beginning of each interval all machines � Broadcast their current time � Collect all values of other machines arriving in a given time span � Compute the new time - by simply averaging all answers, or - by discarding the m highest and the m lowest answers before averaging (to protect against faulty clocks), or - by averaging values corrected by an estimation of their propagation time. • ... but: in large-scale networks, the broadcasting could become a problem widely used algorithm in the Internet: Network Time Protocol (NTP) Chapter 3.1: Time and Synchronization 10 Chapter 4: Time and Synchronisation Page 10

Lehrstuhl für Informatik 4 Kommunikation und verteilte Systeme Network Time Protocol (NTP) NTP is a time service designed for the Internet • Reliability by using redundant paths • Scalable to large number of clients and servers • Authenticates time sources to protect against wrong time data • NTP is provided by a network of time servers distributed across the Internet • Hierarchical structure: synchronization subnet tree Primary servers are connected to UTC sources Secondary servers are synchronized to primary servers (Synchronization subnet ) Lowest level servers in users’ computers, synchronised to 1 More accurate time secondary servers Note: this is 2 2 only an example, there can be more than three 3 3 3 layers Chapter 3.1: Time and Synchronization 11 Chapter 4: Time and Synchronisation Page 11

Lehrstuhl für Informatik 4 Kommunikation und verteilte Systeme Network Time Protocol (NTP) GPS Synchronized Atomic clock Secondary • LAN cluster • servers Stratum-4 Backup path Primary servers Client Stratum-1 Stratum-2 Stratum-3 • Exchange of timestamps between time servers and clients via UDP • Levels in the synchronization subtree also are called Stratum Chapter 3.1: Time and Synchronization 12 Chapter 4: Time and Synchronisation Page 12

Chapter 1: Communication in Distributed Systems Chapter 2: Basic - PowerPoint PPT Presentation

Lehrstuhl fr Informatik 4 Kommunikation und verteilte Systeme Chapter 1: Communication in Distributed Systems Chapter 2: Basic Principles in Distributed Systems Chapter 3: Coordination Time and Synchronization 3.1: Time and

Distributed Systems Goals of Distributed Systems 13A. Distributed Systems: Goals & Challenges

Distributed Systems Goals of Distributed Systems 13A. Distributed Systems: Goals & Challenges

Distributed Systems (ICE 601) Distributed Transactions Dongman Lee ICU Class Overview

SK Telecom 1 U U U U U U U- U - - communication - - - - - communication

Distributed File Systems Distributed File Systems A distributed file system (DFS) is a

Introduction to Distributed * Systems Introduction to Distributed * Systems Outline Outline

Introduction to Distributed Systems Introduction to Distributed Systems Outline Outline

Unleashing Talent in A Distributed Workforce C O R E N E T 2 0 2 0 HACKATHON: DISTRIBUTED W O R K

` James R. Wilcox Zach Tatlock Ilya Sergey Distributed Systems Distributed Infrastructure

Unicamp MC714 Distributed Systems Slides by Maarten van Steen, adapted from Distributed Systems,

Distributed Storage Systems part 1 Marko Vukoli Distributed Systems and Cloud Computing This

Coordinating distributed systems Marko Vukoli Distributed Systems and Cloud Computing Previous

Distributed File Systems Issues in Distributed File Service Case Studies: Sun

Session 12 Assessing and Developing Communication SECTION 4: 1 Communication Communication

Distributed Systems of distributed systems Types of Distributed Describe various

WHAT WE TALK ABOUT WHEN WE TALK ABOUT DISTRIBUTED SYSTEMS ALVARO VIDELA DISTRIBUTED SYSTEMS

Antonio Augusto Rocha Presented by: (Guto) 1 Presentation format 4 papers to present

This and upcoming lectures? Well focus on concepts relating to time Time as it can be

Equations, Synchrony, Time, and Collaborative with: Modes Adam Cataldo Patricia Derler

Time and Timers Portions of these overheads taken from: Unix Systems Programming Communication,

Seismic Acquisition with Ocean Bottom Nodes Providing full azimuth seismic images in busy

Netalyzr for Android: Challenges and opportunities Narseo Vallina-Rodriguez Nicholas Weaver

TDDD82 Secure Mobile Systems Lecture 1: Introductjon and Distributed Systems Models Mikael

Spanner : Google's Globally-Distributed Database James Sedgwick and Kayhan Dursun Spanner - A

Chapter 1: Communication in Distributed Systems Chapter 2: Basic - PowerPoint PPT Presentation

Lehrstuhl fr Informatik 4 Kommunikation und verteilte Systeme Chapter 1: Communication in Distributed Systems Chapter 2: Basic Principles in Distributed Systems Chapter 3: Coordination Time and Synchronization 3.1: Time and

Distributed Systems Goals of Distributed Systems 13A. Distributed Systems: Goals &amp; Challenges

Distributed Systems Goals of Distributed Systems 13A. Distributed Systems: Goals &amp; Challenges

Distributed Systems (ICE 601) Distributed Transactions Dongman Lee ICU Class Overview

SK Telecom 1 U U U U U U U- U - - communication - - - - - communication

Distributed File Systems Distributed File Systems A distributed file system (DFS) is a

Introduction to Distributed * Systems Introduction to Distributed * Systems Outline Outline

Introduction to Distributed Systems Introduction to Distributed Systems Outline Outline

Unleashing Talent in A Distributed Workforce C O R E N E T 2 0 2 0 HACKATHON: DISTRIBUTED W O R K

` James R. Wilcox Zach Tatlock Ilya Sergey Distributed Systems Distributed Infrastructure

Unicamp MC714 Distributed Systems Slides by Maarten van Steen, adapted from Distributed Systems,

Distributed Storage Systems part 1 Marko Vukoli Distributed Systems and Cloud Computing This

Coordinating distributed systems Marko Vukoli Distributed Systems and Cloud Computing Previous

Distributed File Systems Issues in Distributed File Service Case Studies: Sun

Session 12 Assessing and Developing Communication SECTION 4: 1 Communication Communication

Distributed Systems of distributed systems Types of Distributed Describe various

WHAT WE TALK ABOUT WHEN WE TALK ABOUT DISTRIBUTED SYSTEMS ALVARO VIDELA DISTRIBUTED SYSTEMS

Antonio Augusto Rocha Presented by: (Guto) 1 Presentation format 4 papers to present

This and upcoming lectures? Well focus on concepts relating to time Time as it can be

Equations, Synchrony, Time, and Collaborative with: Modes Adam Cataldo Patricia Derler

Time and Timers Portions of these overheads taken from: Unix Systems Programming Communication,

Seismic Acquisition with Ocean Bottom Nodes Providing full azimuth seismic images in busy

Netalyzr for Android: Challenges and opportunities Narseo Vallina-Rodriguez Nicholas Weaver

TDDD82 Secure Mobile Systems Lecture 1: Introductjon and Distributed Systems Models Mikael

Spanner : Google's Globally-Distributed Database James Sedgwick and Kayhan Dursun Spanner - A

Distributed Systems Goals of Distributed Systems 13A. Distributed Systems: Goals & Challenges

Distributed Systems Goals of Distributed Systems 13A. Distributed Systems: Goals & Challenges