Announcements P4: Graded Will resolve all Project grading issues - PDF document

12/7/16 Announcements P4: Graded – Will resolve all Project grading issues this week P5: File Systems • Test scripts available • Due Due: Wednesday 12/14 by 9 pm. • Free Extension Due Date: Friday 12/16 by 9pm. • Extension means absolutely nothing for any reason after that! • Fill out form if would like a new project partner Final Exam: Saturday 12/17 at 10:05 am • Fill out exam form if academic conflicts Advanced Topics: Distributed File Systems (NFS, AFS, GFS) Read as we go along: Chapter 47 and 48 UNIVERSITY of WISCONSIN-MADISON Computer Sciences Department CS 537 Andrea C. Arpaci-Dusseau Introduction to Operating Systems Remzi H. Arpaci-Dusseau Advanced Topics: Distributed Systems and NFS Questions answered in this lecture: What is challenging about distributed systems? What is the NFS stateless protocol ? What is RPC ? How can a reliable messaging protocol be built on unreliable layers? What are idempotent operations and why are they useful? What state is tracked on NFS clients? 1

12/7/16 What is a Distributed System? A distributed system is one where a machine I’ve never heard of can cause my program to fail. — Leslie Lamport Definition: More than 1 machine working together to solve a problem Examples: • client/server: web server and web client • cluster: page rank computation Why Go Distributed? More computing power • throughput • latency More storage capacity Fault tolerance Data sharing 2

12/7/16 New Challenges System failure : need to worry about partial failure Communication failure : network links unreliable • bit errors • packet loss • link failure Individual nodes crash and recover Motivation example: Why are network sockets less reliable than pipes? Pipe Writer Reader user Process Process kernel 3

12/7/16 Pipe Writer Reader user Process Process kernel Pipe Writer Reader user Process Process kernel 4

12/7/16 Pipe Writer Reader user Process Process kernel write waits for space Pipe Writer Reader user Process Process kernel write waits for space 7

12/7/16 Pipe Writer Reader user Process Process kernel Network Socket Machine A Machine B Writer Reader user user Process Process Router kernel kernel 8

12/7/16 Network Socket Machine A Machine B what if B’s Writer Reader buffer is full? user user Process Process Router kernel kernel Can’t tell writer on Machine A to stop; Can’t allocate more memory Solution: Drop arriving packets on machine B Network Socket Machine A Machine B what if router’s Writer Reader buffer is full? user user Process Process Router kernel kernel 9

12/7/16 Network Socket Machine A Writer user From A’s view, network and Process B are largely a black box ? kernel Messages may get dropped, duplicated, re-ordered Distributed File Systems File systems are great use case for distributed systems Local FS (FFS, ext3/4, LFS) : Processes on same machine access shared files Network FS (NFS, AFS) : Processes on different machines access shared files in same way 10

12/7/16 Goals for distributed file systems Fast + simple crash recovery • both clients and file server may crash Transparent access • can’t tell accesses are over the network • normal UNIX semantics Reasonable performance NFS: Network File System Think of NFS as more of a protocol than a particular file system Many companies have implemented NFS since 1980s: Oracle/Sun, NetApp, EMC, IBM We’re looking at NFSv2 • NFSv4 has many changes Why look at an older protocol? • Simpler, focused goals (simplified crash recovery, stateless) • To compare and contrast NFS with AFS (next lecture) 11

12/7/16 Overview Architecture Network API Caching NFS Architecture Client Client RPC RPC Cache Cache File Server Local FS Client Client RPC RPC Cache Cache RPC: Remote Procedure Call Cache individual blocks of NFS files 12

12/7/16 General Strategy: Export FS Client Server Local FS NFS Local FS Client / backups home etc bin bak1 bak2 bak3 tyler .bashrc 537 p1 p2 Mount: device or fs protocol on namespace • /dev/sda1 on / • /dev/sdb1 on /backups • AFS on /home/tyler 13

12/7/16 General Strategy: Export FS Client Server read Local FS NFS Local FS General Strategy: Export FS Client Server read Local FS NFS Local FS 14

12/7/16 Overview Architecture Network API Caching Strategy 1 Attempt: Wrap regular UNIX system calls using RPC • open() on client calls open() on server • open() on server returns fd back to client • read(fd) on client calls read(fd) on server • read(fd) on server returns data back to client Client Server read Local FS NFS Local FS 15

12/7/16 RPC R emote P rocedure C all Motivation: What could be easier than calling a function? Strategy : create wrappers so calling function on remote machine appears like calling local function Very common abstraction RPC Machine A Machine B int main(…) { int foo(char *msg) { int x = foo(”hello”); … } } int foo(char *msg) { void foo_listener() { send msg to B while(1) { recv msg from B recv, call foo } } } How RPC appears to programmer 16

12/7/16 RPC Machine A Machine B int main(…) { int foo(char *msg) { int x = foo(”hello”); … } } int foo(char *msg) { void foo_listener() { send msg to B while(1) { recv msg from B recv, call foo } } } Actual calls RPC Machine A Machine B int main(…) { int foo(char *msg) { int x = foo(”hello”); … } } int foo(char *msg) { void foo_listener() { send msg to B while(1) { client server recv msg from B recv, call foo wrapper wrapper } } } Wrappers (ignore how messages are sent for now…) 17

12/7/16 RPC Tools RPC packages help with two roles: (1) Runtime library • Thread pool • Socket listeners call functions on server (2) Stub/wrapper generation at compile time • Create wrappers automatically • Many tools available (rpcgen, thrift, protobufs) Machine A Machine B int main(…) { int foo(char *msg) { int x = foo(”hello”); … } } int foo(char *msg) { void foo_listener() { send msg to B while(1) { recv msg from B recv, call foo } } } Wrapper Generation Wrappers must do conversions: • client arguments to message • message to server arguments • convert server return value to message • convert message to client return value Need uniform endianness (wrappers do this) Conversion is called Machine A Machine B • marshaling/unmarshaling int main(…) { int foo(char *msg) { • serializing/deserializing int x = foo(”hello”); … } } int foo(char *msg) { void foo_listener() { send msg to B while(1) { recv msg from B recv, call foo } } } 18

12/7/16 Wrapper Generation: Pointers Why are pointers problematic? Address passed from client not valid on server Solutions? • Smart RPC package: follow pointers and copy data Machine A Machine B int main(…) { int foo(char *msg) { int x = foo(”hello”); … } } int foo(char *msg) { void foo_listener() { send msg to B while(1) { recv msg from B recv, call foo } } } Back to NSF: Strategy 1 Attempt: Wrap regular UNIX system calls using RPC • open() on client calls open() on server • open() on server returns fd back to client • read(fd) on client calls read(fd) on server • read(fd) on server returns data back to client Client Server read Local FS NFS Local FS 19

12/7/16 File Descriptors Client Server In memory client fds Local FS NFS Local FS File Descriptors Client Server open() = 2 client fds Local FS NFS Local FS 20

12/7/16 File Descriptors Client Server client fds read(2) Local FS NFS Local FS File Descriptors Client Server client fds read(2) Local FS NFS Local FS 21

12/7/16 Strategy 1 Problems What about server crashes? (and reboots) int fd = open(“foo”, O_RDONLY); read(fd, buf, MAX); Server crash! read(fd, buf, MAX); … Goal: behave like slow read read(fd, buf, MAX); Client Server client fds read(2) Local FS NFS Local FS Potential Solutions 1. Run some crash recovery protocol upon reboot • Complex 2. Persist fds on server disk • Slow for disks • How long to keep fds? What if client crashes? misbehaves? Client Server client fds read(2) Local FS NFS Local FS 22

12/7/16 Strategy 2: put all info in requests Use “stateless” protocol! • server maintains no state about clients • server can still keep other state (cached copies) • can crash and reboot with no correctness problems (just performance) Eliminate File Descriptors Client Server Local FS NFS Local FS 23

Announcements P4: Graded Will resolve all Project grading issues - PDF document

12/7/16 Announcements P4: Graded Will resolve all Project grading issues this week P5: File Systems Test scripts available Due Due: Wednesday 12/14 by 9 pm. Free Extension Due Date: Friday 12/16 by 9pm. Extension means

DHTs and Sharding Aurojit Panda Announcements Announcements Fill out the Github consent

61A Lecture 35 Wednesday, December 4 Announcements 2 Announcements Homework 11 due Thursday

61A Lecture 6 Monday, February 2 Announcements 2 Announcements Homework 2 due Monday 2/2 @

61A Lecture 33 Monday, November 25 Announcements 2 Announcements Homework 10 due Tuesday

61A Lecture 6 Friday, September 13 Announcements 2 Announcements Homework 2 due Tuesday

61A Lecture 24 Monday, March 30 Announcements 2 Announcements Homework 7 due Wednesday 4/8

61A Lecture 37 Wednesday, April 29 Announcements 2 Announcements Homework 9 (4 pts) due

CS 61A Lecture 10 Friday, February 13 Announcements 2 Announcements Guerrilla Section 2 is

61A Lecture 14 Wednesday, February 25 Announcements 2 Announcements Project 2 due Thursday

Linearizability & CAP Announcements No hours this week. Announcements No hours this

61A Lecture 13 Wednesday, October 2 Announcements 2 Announcements Homework 3 deadline

61A Lecture 24 Friday, November 1 Announcements 2 Announcements Homework 7 due Tuesday 11/5

61A Extra Lecture 2 Thursday, February 5 Announcements 2 Announcements If you want 1 unit

CS 61A Lecture 11 Wednesday, February 18 Announcements 2 Announcements Optional Hog Contest

Announcements Lecture 22 System Development Leah Perlmutter / Summer 2018 Announcements

Lecture 30: Conclusion Brian Hou August 11, 2016 Announcements Announcements Final Exam

CS519: Computer Networks Lecture 5, Part 3: Mar 10, 2004 Transport: TCP performance TCP

A PEEK INSIDE RIAK Steve Vinoski Basho Technologies Cambridge, MA USA http://basho.com

Lecture 17 Log into Linux. Copy two subdirectories in /home/hwang/cs375/lecture17/ $ cp -r

Efficient Generation of Short and Fast Repeater Tree Topologies Christoph Bartoschek, Stephan

Systems for Data Science Marco Serafini COMPSCI 532 Lecture 1 Course Structure

Adrian Tate Adrian Tate Technical Lead of Scientific Libraries Technical Lead of Scientific

The Flink Big Data Analytics Platform Marton Balassi, Gyula Fora {mbalassi, gyfora}@apache.org

Our cloud is thirsty ! Shaolei Ren Florida International University sren@cs.fiu.edu 1 A

Announcements P4: Graded Will resolve all Project grading issues - PDF document

12/7/16 Announcements P4: Graded Will resolve all Project grading issues this week P5: File Systems Test scripts available Due Due: Wednesday 12/14 by 9 pm. Free Extension Due Date: Friday 12/16 by 9pm. Extension means

DHTs and Sharding Aurojit Panda Announcements Announcements Fill out the Github consent

61A Lecture 35 Wednesday, December 4 Announcements 2 Announcements Homework 11 due Thursday

61A Lecture 6 Monday, February 2 Announcements 2 Announcements Homework 2 due Monday 2/2 @

61A Lecture 33 Monday, November 25 Announcements 2 Announcements Homework 10 due Tuesday

61A Lecture 6 Friday, September 13 Announcements 2 Announcements Homework 2 due Tuesday

61A Lecture 24 Monday, March 30 Announcements 2 Announcements Homework 7 due Wednesday 4/8

61A Lecture 37 Wednesday, April 29 Announcements 2 Announcements Homework 9 (4 pts) due

CS 61A Lecture 10 Friday, February 13 Announcements 2 Announcements Guerrilla Section 2 is

61A Lecture 14 Wednesday, February 25 Announcements 2 Announcements Project 2 due Thursday

Linearizability &amp; CAP Announcements No hours this week. Announcements No hours this

61A Lecture 13 Wednesday, October 2 Announcements 2 Announcements Homework 3 deadline

61A Lecture 24 Friday, November 1 Announcements 2 Announcements Homework 7 due Tuesday 11/5

61A Extra Lecture 2 Thursday, February 5 Announcements 2 Announcements If you want 1 unit

CS 61A Lecture 11 Wednesday, February 18 Announcements 2 Announcements Optional Hog Contest

Announcements Lecture 22 System Development Leah Perlmutter / Summer 2018 Announcements

Lecture 30: Conclusion Brian Hou August 11, 2016 Announcements Announcements Final Exam

CS519: Computer Networks Lecture 5, Part 3: Mar 10, 2004 Transport: TCP performance TCP

A PEEK INSIDE RIAK Steve Vinoski Basho Technologies Cambridge, MA USA http://basho.com

Lecture 17 Log into Linux. Copy two subdirectories in /home/hwang/cs375/lecture17/ $ cp -r

Efficient Generation of Short and Fast Repeater Tree Topologies Christoph Bartoschek, Stephan

Systems for Data Science Marco Serafini COMPSCI 532 Lecture 1 Course Structure

Adrian Tate Adrian Tate Technical Lead of Scientific Libraries Technical Lead of Scientific

The Flink Big Data Analytics Platform Marton Balassi, Gyula Fora {mbalassi, gyfora}@apache.org

Our cloud is thirsty ! Shaolei Ren Florida International University sren@cs.fiu.edu 1 A

Linearizability & CAP Announcements No hours this week. Announcements No hours this