Goals for Today Learning Objective: Learn how to achieve - PowerPoint PPT Presentation

Goals for Today • Learning Objective: • Learn how to achieve reliability + availability in storage!! • Announcements, etc: • MP3 is out! Due April 18th . • MP3 Walkthrough on Monday, slides up later today Reminder : Please put away devices at the start of class 1 CS 423: Operating Systems Design

CS 423   Operating System Design: Reliable Storage Professor Adam Bates Spring 2018 CS 423: Operating Systems Design

Storage is hard ; - ( “In each cluster's first year, it's typical that 1,000 individual machine failures will occur; thousands of hard drive failures will occur ; one power distribution unit will fail, bringing down 500 to 1,000 machines for about 6 hours; 20 racks will fail, each time causing 40 to 80 machines to vanish from the network; 5 racks will "go wonky," with half their network packets missing in action; and the cluster will have to be rewired once, affecting 5 percent of the machines at any given moment over a 2-day span, Dean said. And there's about a 50 percent chance that the cluster will overheat, taking down most of the servers in less than 5 minutes and taking 1 to 2 days to recover.” - Jeff Dean, Google Fellow (2008) CS 423: Operating Systems Design 3

Storage Goals Storage reliability: data fetched is what you stored ■ ■ Problem when machines randomly fail! Storage availability: data is there when you want it ■ ■ Problem when disks randomly fail! ■ More disks => higher probability of some disk failing ■ Data available ~ Prob(disk working)^k ■ If failures are independent and data is spread across k disks ■ For large k, probability system works -> 0 CS 423: Operating Systems Design 4

File System Reliability What can happen if disk loses power or machine software ■ crashes? ■ Some operations in progress may complete ■ Some operations in progress may be lost ■ Overwrite of a block may only partially complete File systems need durability (as a minimum!) ■ ■ Data previously stored can be retrieved (maybe after some recovery step), regardless of failure CS 423: Operating Systems Design 5

Storage Reliability Problem Single logical file operation can involve updates to multiple ■ physical disk blocks ■ inode, indirect block, data block, bitmap, … At a physical level, operations complete one at a time ■ ■ Want concurrent operations for performance How do we guarantee consistency regardless of when crash ■ occurs? CS 423: Operating Systems Design 6

Transaction Concept A transaction is a grouping of low-level operations that are related to a single ■ logical operation Transactions are atomic — operations appear to happen as a group, or not at ■ all (at logical level) At physical level of course, only a single disk/flash write is atomic ■ Transactions are durable — operations that complete stay completed ■ Future failures do not corrupt previously stored data ■ (In-Progress) Transactions are isolated — other transactions cannot see the ■ results of earlier transactions until they are committed Transactions exhibit consistency — sequential memory model ■ CS 423: Operating Systems Design 7

Reliability Attempt #1: Careful Ordering Sequence operations in a specific order ■ Careful design to allow sequence to be interrupted safely ■ Post-crash recovery ■ Read data structures to see if there were any operations in progress ■ Clean up/finish as needed ■ Approach taken in FAT, FFS (fsck), and many app-level ■ recovery schemes (e.g., Word) CS 423: Operating Systems Design 8

Reliability Attempt #1: Careful Ordering FAT: Append Data to File MFT Data Blocks Add data block ■ 0 1 Add pointer to data block ■ 2 3 fj le 9 block 3 Update file tail to point to 4 ■ 5 new MFT entry 6 7 Update access time at ■ 8 9 fj le 9 block 0 head of file 10 fj le 9 block 1 11 fj le 9 block 2 fj le 12 block 0 12 13 14 15 16 fj le 12 block 1 17 fj le 9 block 4 18 19 20 CS 423: Operating Systems Design 9

Reliability Attempt #1: Careful Ordering FAT: Append Data to File MFT Data Blocks Add data block ■ 0 1 Add pointer to data block ■ 2 3 fj le 9 block 3 Update file tail to point to 4 ■ 5 new MFT entry 6 7 Update access time at ■ 8 9 fj le 9 block 0 head of file 10 fj le 9 block 1 11 fj le 9 block 2 Recovery fj le 12 block 0 12 13 Scan MFT ■ 14 15 If entry is unlinked, delete 16 fj le 12 block 1 ■ 17 data block fj le 9 block 4 18 19 If access time is incorrect, ■ 20 update CS 423: Operating Systems Design 10

Reliability Attempt #1: Careful Ordering FAT: Create New File MFT Data Blocks Allocate data block ■ 0 Update MFT entry to point to data ■ 1 block 2 3 fj le 9 block 3 Update directory with ■ 4 5 file name -> file number 6 What if directory spans multiple 7 ■ 8 disk blocks? 9 fj le 9 block 0 10 fj le 9 block 1 Update modify time for directory ■ 11 fj le 9 block 2 fj le 12 block 0 12 13 14 15 16 fj le 12 block 1 17 fj le 9 block 4 18 19 20 CS 423: Operating Systems Design 11

Reliability Attempt #1: Careful Ordering FAT: Create New File MFT Data Blocks Allocate data block ■ 0 Update MFT entry to point to data ■ 1 block 2 3 fj le 9 block 3 Update directory with ■ 4 5 file name -> file number 6 What if directory spans multiple 7 ■ 8 disk blocks? 9 fj le 9 block 0 10 fj le 9 block 1 Update modify time for directory ■ 11 fj le 9 block 2 fj le 12 block 0 12 13 Recovery 14 15 Scan MFT ■ 16 fj le 12 block 1 If any unlinked files (not in any 17 ■ fj le 9 block 4 18 directory), delete 19 Scan directories for missing update 20 ■ times CS 423: Operating Systems Design 12

Reliability Attempt #1: Careful Ordering FFS: Create New File ■ Allocate data block Inode Array Triple Double ■ Write data block Indirect Indirect Indirect Inode Blocks Blocks Blocks ■ Allocate inode ■ Write inode block ■ Update bitmap of free blocks File Metadata ■ Update directory with file name -> file number Direct Pointer DP ■ Update modify time for directory DP DP DP DP DP DP DP DP DP Direct Pointer Indirect Pointer Dbl. Indirect Ptr. Tripl. Indirect Ptr. CS 423: Operating Systems Design 13

Reliability Attempt #1: Careful Ordering FFS: Create New File ■ Allocate data block Inode Array Triple Double ■ Write data block Indirect Indirect Indirect Inode Blocks Blocks Blocks ■ Allocate inode ■ Write inode block ■ Update bitmap of free blocks File Metadata ■ Update directory with file name -> file number Direct Pointer DP ■ Update modify time for directory DP DP DP Recovery DP ■ Scan inode table DP DP ■ If any unlinked files (not in any DP directory), delete DP DP ■ Compare free block bitmap against inode Direct Pointer trees Indirect Pointer ■ Scan directories for missing update/ Dbl. Indirect Ptr. Tripl. Indirect Ptr. access times Recovery time is proportional to size of disk! CS 423: Operating Systems Design 14

Reliability Attempt #1: Careful Ordering FFS: Move a File Inode Array Triple Double ■ Remove filename from old Indirect Indirect Indirect directory Inode Blocks Blocks Blocks ■ Add filename to new directory File Metadata Direct Pointer DP DP DP DP DP DP DP DP DP DP Direct Pointer Indirect Pointer Dbl. Indirect Ptr. Tripl. Indirect Ptr. CS 423: Operating Systems Design 15

Reliability Attempt #1: Careful Ordering FFS: Move a File Inode Array Triple Double ■ Remove filename from old Indirect Indirect Indirect directory Inode Blocks Blocks Blocks ■ Add filename to new directory File Metadata Recovery Direct Pointer DP ■ Scan all directories to DP DP determine set of live files DP DP ■ Consider files with valid DP DP inodes and not in any DP directory DP DP Direct Pointer New file being created? ■ Indirect Pointer File move? Dbl. Indirect Ptr. ■ Tripl. Indirect Ptr. File deletion? ■ CS 423: Operating Systems Design 16

Reliability Attempt #1: Careful Ordering Application Level ■ Write name of each open file to app folder ■ Write changes to backup file ■ Rename backup file to be file (atomic operation provided by file system) ■ Delete list in app folder on clean shutdown Recovery ■ On startup, see if any files were left open ■ If so, look for backup file ■ If so, ask user to compare versions CS 423: Operating Systems Design 17

Reliability Attempt #1: Careful Ordering FFS: Move and Grep ■ Observation — careful ordering is not a panacea… Process A moves file from x to y mv x/file y/ Process B greps across x and y grep x/* y/* ■ Will Process B always see the contents of the file? CS 423: Operating Systems Design 18

Reliability Attempt #1: Careful Ordering Pros ■ Works with minimal support from the disk drive ■ Works for most multi-step operations Cons ■ Can require time-consuming recovery after a failure ■ Difficult to reduce every operation to a safely-interruptible sequence of writes ■ Difficult to achieve consistency when multiple operations occur concurrently (e.g., FFS grep) CS 423: Operating Systems Design 19

Goals for Today Learning Objective: Learn how to achieve - PowerPoint PPT Presentation

Goals for Today Learning Objective: Learn how to achieve reliability + availability in storage!! Announcements, etc: MP3 is out! Due April 18th . MP3 Walkthrough on Monday, slides up later today Reminder : Please put away devices

Meeting your goals Meeting your goals Meeting your goals Meeting your goals We are DNS DNS

APS Strategic Plan Goals and Strategies February 10, 2018 Strategic Plan Goals Goals : Key areas

What is the League Today 1 1/23/2017 What is the League Today What is the League Today 2

Mission, Goals, Metrics: Where Is CSCU Today? CSCU Mission & Goals History BOR adopted the

Using Learning Goals to Inform Instruction Erin Meikle Overview What we mean by learning goals

Social/Network/Analysis mohamed.bouguessa@uqo.ca/ 1 Web/today 2

Lecture 15 Logistics HW4 is due today HW5 posted today HW5 posted today Exam

Do Now What are your team goals for the day? How are you achieving your goals? Goals for the

Presentation Tips Before You Start Understand your goals and the goals of your audience.

of Proposed B u a Financial Financial Goals Policy l i c y Goals P o Goals a l s

Pat Allen | Rachel Barrow | Alex Byard | Melanie Fonner | Brad Frederick | Brian LaChance | Mike

Budget Recalibration Phase 2 Implementation Group Kickoff August 8, 2017 1 Agenda and Goals

One-on-One Interviews Training for SPF SIG coalitions Goals for today After today, you will

WIEMANN LAMPHERE ARCHITECTS MONTPELIER TODAY MONTPELIER TODAY PARKING! VEHICLES ARE

Today. Types of graphs. Today. Types of graphs. Complete Graphs. Trees. Hypercubes. Today.

Welcome back. Today. Welcome back. Today. Continue Sampling combinatorial structures. Welcome

3d Modularity Miranda C. N. Cheng University of Amsterdam Based on joint work with What are the

Martin Schulz Lawrence Livermore National Laboratory VAPLS 2013, Atlanta October 14 th , 2013

2018 FOOD for Lane County Partner Agency Conference Opening Remarks begin in the Worship Center

Supporting Healthy Communities: Planning Tools Miranda Maupin Skeo Program Manager

ZFS The Last Word in Filesystem tzute Computer Center, CS, NCTU What is RAID? 2 Computer

Helical Mirror Concept Exploration: Design and Status A. V. Sudnikov, et al. Budker INP SB RAS

Construct 2 A game engine without the programming. The Code Liberation Foundation Lecture 3:

Mix-Automatic Sequences Jrg Endrullis Clemens Grabmayer Dimitri Hendriks Fields Workshop on

Goals for Today Learning Objective: Learn how to achieve - PowerPoint PPT Presentation

Goals for Today Learning Objective: Learn how to achieve reliability + availability in storage!! Announcements, etc: MP3 is out! Due April 18th . MP3 Walkthrough on Monday, slides up later today Reminder : Please put away devices

Meeting your goals Meeting your goals Meeting your goals Meeting your goals We are DNS DNS

APS Strategic Plan Goals and Strategies February 10, 2018 Strategic Plan Goals Goals : Key areas

What is the League Today 1 1/23/2017 What is the League Today What is the League Today 2

Mission, Goals, Metrics: Where Is CSCU Today? CSCU Mission &amp; Goals History BOR adopted the

Using Learning Goals to Inform Instruction Erin Meikle Overview What we mean by learning goals

Social/Network/Analysis mohamed.bouguessa@uqo.ca/ 1 Web/today 2

Lecture 15 Logistics HW4 is due today HW5 posted today HW5 posted today Exam

Do Now What are your team goals for the day? How are you achieving your goals? Goals for the

Presentation Tips Before You Start Understand your goals and the goals of your audience.

of Proposed B u a Financial Financial Goals Policy l i c y Goals P o Goals a l s

Pat Allen | Rachel Barrow | Alex Byard | Melanie Fonner | Brad Frederick | Brian LaChance | Mike

Budget Recalibration Phase 2 Implementation Group Kickoff August 8, 2017 1 Agenda and Goals

One-on-One Interviews Training for SPF SIG coalitions Goals for today After today, you will

WIEMANN LAMPHERE ARCHITECTS MONTPELIER TODAY MONTPELIER TODAY PARKING! VEHICLES ARE

Today. Types of graphs. Today. Types of graphs. Complete Graphs. Trees. Hypercubes. Today.

Welcome back. Today. Welcome back. Today. Continue Sampling combinatorial structures. Welcome

3d Modularity Miranda C. N. Cheng University of Amsterdam Based on joint work with What are the

Martin Schulz Lawrence Livermore National Laboratory VAPLS 2013, Atlanta October 14 th , 2013

2018 FOOD for Lane County Partner Agency Conference Opening Remarks begin in the Worship Center

Supporting Healthy Communities: Planning Tools Miranda Maupin Skeo Program Manager

ZFS The Last Word in Filesystem tzute Computer Center, CS, NCTU What is RAID? 2 Computer

Helical Mirror Concept Exploration: Design and Status A. V. Sudnikov, et al. Budker INP SB RAS

Construct 2 A game engine without the programming. The Code Liberation Foundation Lecture 3:

Mix-Automatic Sequences Jrg Endrullis Clemens Grabmayer Dimitri Hendriks Fields Workshop on

Mission, Goals, Metrics: Where Is CSCU Today? CSCU Mission & Goals History BOR adopted the