1.264 Lecture 19 System architecture, concluded Disk performance - PowerPoint PPT Presentation

1.264 Lecture 19 System architecture, concluded Disk performance (RAID)

Why are disks a problem? Performance of most applications governed by disk access • Disk is slowest “high performance” system element • 100,000 times slower than main memory – Disk gets most attention in architecture and configuration – Disk is most complex subsystem; lots of mistakes are made – Because of disk slowness, mistakes make very large impact on • system Disks are found in greatest numbers of any component – Disk is only major subsystem with moving parts: reliability is – issue Disk is only major subsystem with ‘state’ – Other failed components can ust be replaced j • Disks are getting relatively slower • Processor speeds double every 18 months still – Disk throughput doubles every 5 years, speed even less often – Disk size has grown quickly and cost has dropped but those – aren’t the problems!

Redundant Array of Independent Disks (RAID) Motivated by relative lack of disk performance improvements • Large disks put much data at risk if they fail – Large disk transfer rates are often inadequate for the data they – can store RAID combines commodity (cheap) disk drives into • organizations to improve reliability and performance Use lots of little disks instead of one big one – Prices are high for small configurations but don’t increase • much as size increases: $3,000 for 180GB RAID array – $10,000 for 2500GB RAID array –

RAID-0 (Striping) Stripe Width LOGICAL 1 2 3 4 5 6 7 8 9 10 11 12 ORDER Chunk Size 1 2 3 4 PHYSICAL 5 6 7 8 ORDER 9 10 11 12 Figure by MIT OCW.

RAID-0 concept and reliability Physical drives are organized in stripes and used as a • single logical drive Treat them as a single large ‘logical’ disk. Chunks often 32KB – If you have a 128KB image and you have 32KB stripes, your – read/ write time is ¼ of one disk’s time Each drive split into “chunks” and successive chunks are • stored on different drives High performance but risky • Failure of any member drive results in loss of some data – Hot sparing can’t be used (can’t plug in fresh disk for failed – one) Arrays with 100 disks with 500,000 hr MTBF will have • failures every 5,000 hours, or every 7 months Unacceptable for most organizations; disrupts system until – restored from backup

RAID-0 performance Sequential access approaches aggregate bandwidth of member • disks If 4 disks run at 4MB sec each, striping can reach 15MB/sec / – May reach SCSI bus limit or other constraints – Random access improves substantially also • Striping lowers utilization of disks by 1/N, thus making shorter – queues Hot spots (one chunk frequently accessed) prevent gain • Cache these in memory if possible – RAID-0 requires all disks in array to be identical •

RAID-1: Mirroring Stripe Width LOGICAL 1 2 3 4 5 6 ORDER Chunk Size 1 1 2 2 PHYSICAL ORDER 3 3 4 4 5 5 Mirror A Mirror B 6 6 Figure by MIT OCW.

RAID-1: mirroring Large disk farms have reliability problems • 2,000 disks with 500,000 hr MTBF will have failure every 250 hrs – RAID-1 reserves 1 or more extra disks for each original disk • Every member is identical; writes update every member – Reads can go to any member, which gives a performance – improvement Mirroring improves reliability • If two disks each have 250,000 hr MTBF, mirror has 6*10 hr MTBF 9 – Only real risk is physical destruction of both disks in common – event RAID-1 supports hot-swapping and hot-sparing • Hot-swapping: replace failed disk with new disk – Hot-sparing: extra disk that stays in sync with mirror and comes – on-line if failure is detected in a mirror disk

RAID-1 performance Write performance about 25% slower than regular disk • Most writes occur in parallel – Lack of ‘spindle sync’ causes the degradation – Read performance • Sequential reads same as single disk: served by single RAID – disk Random reads are faster, due to 1/N decrease in utilization – Mirror resynchronization after failure • Done at slow speed to allow ‘good’ disk to continue to serve – its applications RAID mirrors are often taken offline for backup • Mirrored disks with FibreChannel can be miles away from • the server and act as off site storage and disaster recovery

RAID-1+0: Mirrors with stripes Stripe Width LOGICAL 1 2 3 4 5 6 ORDER Chunk Size 1A 2A 1B 2B PHYSICAL ORDER 3A 4A 3B 4B 5A 6A 5B 6B Submirror A Submirror B Figure by MIT OCW.

RAID-1+0 Reliability comparable to RAID-1 (mirror) • Performance in between RAID-0 and RAID-1 • Reads improve but not as much, because of less – striping Writes are about 30% slower than single disk (vs 25% – for RAID-1)

RAID-5: distributed parity stripe Stripe Width LOGICAL 1 2 3 4 5 6 7 8 9 10 11 12 ORDER Chunk Size 1 2 3 4 P0 PHYSICAL P1 5 6 7 8 ORDER 9 P2 10 11 12 Figure by MIT OCW.

RAID-5 reliability Parity stripe is distributed among disks • Parity is ust the sum of the 0s and 1s from the other disks j – We can reconstruct one failure from the other disks and the – parity stripe Reliability: • Cannot withstand loss of 2 disks – Can insert hot spares – RAID-5 uses two-phase commits to ensure parity and data – blocks written together (or rolled back if failure) Two-phase commit: prepare (move data to disk), commit (do it) • Rollback if any failure during the two-phase commit, via logs •

RAID-5 performance Read performance same as stripe with same data disks • RAID-5 with 6 disks same as RAID-0 with 5 disks – Write performance is poor • At least 50% degradation from single disk, because data and parity – must be written to two separate disks Actual performance is worse, possibly by another factor of 2: – Two-phase commit and its logs further degrade performance • Writes to logs and data must be synchronized, to ensure consistency • In degraded mode (1 disk failed) • Read performance is awful: – Must read all disks and use parity to compute data on failed member • Increases utilization of all disks so much that system crawls • Write performance unchanged: impossible to get worse than base – case

Disk configuration Some storage on all mission-critical systems should be • protected, preferably by mirror (RAID-1 or -1+0) Operating system (to reboot from mirror) – Database executable program – DBMS logs, rollback segments, system tables – Hot spares should be available for protected volumes • Disks are most sensitive component to environment (heat • especially) Disks are key to system performance in most applications • Network and CPU are ‘stateless’ and more easily expanded – Much misconfiguration – Disks running at 99% utilization are common! • Reliability and restoral are major issues for real systems: use – RAID, even for relatively small systems

Summary Architecture defines hardware and software configuration • Clients are generally easy to configure – Servers often require substantial memory and disk throughput – DBMS, Web and application servers have varying requirements – Understanding overall system is key to successful architectures • Good architects (and software process gurus, etc.) are rare! – Usually too detached from development and business • You will often (usually?) architect your system yourself – You generally understand the business purpose, database and • application well enough You have to write the business plan, estimate costs, find the money, etc. • You know the basics: • UML use cases for overall architecture – UML class diagrams, which are ust extended data models, for j – design of Web pages, business logic, and database access Role of Web server, application server, database server – Server configuration: benchmarks, analysis (Wong book) – Database is often critical element: Many disks, RAID, split functions –

1.264 Lecture 19 System architecture, concluded Disk performance - PowerPoint PPT Presentation

1.264 Lecture 19 System architecture, concluded Disk performance (RAID) Why are disks a problem? Performance of most applications governed by disk access Disk is slowest high performance system element 100,000 times slower than

I-64/I-264 Ramp Improvements and I-264/Witchduck Road Interchange & Ramp Extension 1

H.264/AVC Standard and H.264/AVC Standard and H.264/AVC Standard and Extensions Extensions

First Parts of H.264 Decoder Chun-Chieh Lin Contents H.264 Overview NAL Unit Unwrapping

Contents H.264 Overview First Parts of H.264 Decoder NAL Unit Unwrapping Details

Why H.264? End-to-end protocol H.264 Luma Predictor Better compression Maxine Lee, Alex

H.264/AVC Standard H.264/AVC Standard 1 History History Objectives: 50% bit rate savings

The EFAST Exam QuickTime and a QuickTime and a QuickTime and a QuickTime and a

I-264 CORRIDOR EVALUATION STUDY September 7, 2016 Eric Stringfield VDOT Planning Director Chris

H.264 Luma Predictor Maxine Lee, Alex Moore May 17, 2006 Integrated Systems Group Massachusetts

Keeping Video Quality Pristine throughout the Production Process: 4:2:2 10-bit AVC/H.264 encoding

DLLBasedPD first last Jay Bob Zoe Ian Ann Eve 182 159 818 271 314 264 DLLBasedPD

A Study of Embedding Operations and Locations for Steganography in H.264 Video Andreas Neufeld

Monads Lecture 12 Prof. Aiken CS 264 Lecture 12 1 Monads A language without side

Malaysian Healthy Ageing Society Plenary Lecture Plenary Lecture Plenary Lecture Plenary

M2sat SNGstreamer Mobile MPEG-4/ H.264 AVC Satellite News Gathering (SNG) System Multi Media

Supreme Court Childrens Commission 86 th Legislative Session Report 1 Texas F xas Family Code

Congressional Budget Office What Changes in Federal Policy Might Spur Innovation? Presentation

Butler Advantage S t e p h a n i e H e a r n | E x e c u t i v e D i r e c t o r | B u t l e

INTELLIGENT BILLING Y O U R U L T I M A T E P A Y M E N T S O L U T I O N The easiest way to

EPA / CDC Region 9 / 10 EXERCISE EPA / CDC Region 9 / 10 EXERCISE W Washington State Log

1 Privacy has become the mainstay of security We have loads of data that we freely give away via

Server Server Server Server Server Datacenter Network (e.g., Ethernet, InfiniBand,

Digital Preservation with libsafe July 2014 Paseo de la Castellana, 153 28046 Madrid Tel:

Coalition for Government Procurement May 14, 2014 The Case for Category Management

Sambuz

Useful Links

Newsletter

Mail Us

1.264 Lecture 19 System architecture, concluded Disk performance - PowerPoint PPT Presentation

1.264 Lecture 19 System architecture, concluded Disk performance (RAID) Why are disks a problem? Performance of most applications governed by disk access Disk is slowest high performance system element 100,000 times slower than

I-64/I-264 Ramp Improvements and I-264/Witchduck Road Interchange &amp; Ramp Extension 1

H.264/AVC Standard and H.264/AVC Standard and H.264/AVC Standard and Extensions Extensions

First Parts of H.264 Decoder Chun-Chieh Lin Contents H.264 Overview NAL Unit Unwrapping

Contents H.264 Overview First Parts of H.264 Decoder NAL Unit Unwrapping Details

Why H.264? End-to-end protocol H.264 Luma Predictor Better compression Maxine Lee, Alex

H.264/AVC Standard H.264/AVC Standard 1 History History Objectives: 50% bit rate savings

The EFAST Exam QuickTime and a QuickTime and a QuickTime and a QuickTime and a

I-264 CORRIDOR EVALUATION STUDY September 7, 2016 Eric Stringfield VDOT Planning Director Chris

H.264 Luma Predictor Maxine Lee, Alex Moore May 17, 2006 Integrated Systems Group Massachusetts

Keeping Video Quality Pristine throughout the Production Process: 4:2:2 10-bit AVC/H.264 encoding

DLLBasedPD first last Jay Bob Zoe Ian Ann Eve 182 159 818 271 314 264 DLLBasedPD

A Study of Embedding Operations and Locations for Steganography in H.264 Video Andreas Neufeld

Monads Lecture 12 Prof. Aiken CS 264 Lecture 12 1 Monads A language without side

Malaysian Healthy Ageing Society Plenary Lecture Plenary Lecture Plenary Lecture Plenary

M2sat SNGstreamer Mobile MPEG-4/ H.264 AVC Satellite News Gathering (SNG) System Multi Media

Supreme Court Childrens Commission 86 th Legislative Session Report 1 Texas F xas Family Code

Congressional Budget Office What Changes in Federal Policy Might Spur Innovation? Presentation

Butler Advantage S t e p h a n i e H e a r n | E x e c u t i v e D i r e c t o r | B u t l e

INTELLIGENT BILLING Y O U R U L T I M A T E P A Y M E N T S O L U T I O N The easiest way to

EPA / CDC Region 9 / 10 EXERCISE EPA / CDC Region 9 / 10 EXERCISE W Washington State Log

1 Privacy has become the mainstay of security We have loads of data that we freely give away via

Server Server Server Server Server Datacenter Network (e.g., Ethernet, InfiniBand,

Digital Preservation with libsafe July 2014 Paseo de la Castellana, 153 28046 Madrid Tel:

Coalition for Government Procurement May 14, 2014 The Case for Category Management

Sambuz

Useful Links

Newsletter

Mail Us

I-64/I-264 Ramp Improvements and I-264/Witchduck Road Interchange & Ramp Extension 1