Drive-Thru: Drive-Thru: Fast, Accurate Evaluation of Fast, - - PowerPoint PPT Presentation

drive thru drive thru fast accurate evaluation of fast
SMART_READER_LITE
LIVE PREVIEW

Drive-Thru: Drive-Thru: Fast, Accurate Evaluation of Fast, - - PowerPoint PPT Presentation

Drive-Thru: Drive-Thru: Fast, Accurate Evaluation of Fast, Accurate Evaluation of Storage Power Management Storage Power Management Daniel Peek Jason Flinn University of Michigan 1 Power Management Needed Power Management Needed


slide-1
SLIDE 1

1

Drive-Thru: Drive-Thru: Fast, Accurate Evaluation of Fast, Accurate Evaluation of Storage Power Management Storage Power Management

Daniel Peek Jason Flinn

University of Michigan

slide-2
SLIDE 2

2

Power Management Needed Power Management Needed

  • Battery lifetime is limited
  • Storage is an energy hog
  • Power management effective
  • Many competing techniques

[Helmbold 96], [Weissel 02], [Douglis 94], [Papthanasiou 03]

How can we evaluate possible power management strategies?

slide-3
SLIDE 3

3

Design Goals Design Goals

Storage power management evaluation should be: Fast

– Want to explore many policies/parameters/workloads

Accurate

– Both time and energy predictions are important

Portable

– Easy to apply technique to other file systems/OSs

slide-4
SLIDE 4

4

Trace Replay Trace Replay

Prerecorded Trace Trace Replay Tool Write (file1) Stat (file2) Delay 5s Fast Accurate Portable

slide-5
SLIDE 5

5

Write (file1) Stat (file2) Delay 5s

Trace Replay Without Idle Time Trace Replay Without Idle Time

Prerecorded Trace Fast Accurate Portable Trace Replay Tool

slide-6
SLIDE 6

6

Simulation Simulation

Simulator Prerecorded Trace Write (file1) Stat (file2) Delay 5s Fast Accurate Portable

?

Estimated Time & Energy Trace Replay Tool

slide-7
SLIDE 7

7

Outline Outline

  • Motivation
  • Design
  • Validation
  • Case Study
  • Related Work
  • Conclusion
slide-8
SLIDE 8

8

The Big Idea The Big Idea

  • Time Dependent

– Disk spindown – Writeback of dirty blocks in the buffer cache

  • Time Independent

– Mapping reads to disk blocks – Satisfying accesses from the buffer cache

slide-9
SLIDE 9

9

Layers of Power Management Layers of Power Management

Time-Dependent Activity Time-Independent Activity Satisfying Accesses File System Buffer Cache Disk Writeback of dirty blocks Spindown Mapping reads to disk blocks Send modifications to network server Accesses

slide-10
SLIDE 10

10

File System Log Device Log Drive-Thru Replay Tool

A Hybrid Methodology A Hybrid Methodology

The accuracy of trace replay without the cost

  • Replay time independent behavior without idle time
  • Simulate time dependent behavior

Base Trace Prerecorded Trace Write (file1) Stat (file2) Delay 5s Simulator

Estimated

Time & Energy

slide-11
SLIDE 11

11

Base Traces Base Traces

File System Log Device Log Base Trace

Write (file1) Write sector 8 Write (file1) Write sector 8 Stat (file2) Stat (file2) Stat (file2) Read sector 4 Stat (file2) Read sector 4 Sync Sync

slide-12
SLIDE 12

12

Merging Replay and Simulation Merging Replay and Simulation

Time & Energy Drive-Thru Simulator I/O Simulator

Write File1 Device Write Stat File2 Device Read Delay

  • Fast: All idle time is simulated
  • Portable: Drive-Thru < 1000 lines
slide-13
SLIDE 13

13

Drive-Thru Simulator

Drive-Thru Operations Drive-Thru Operations

  • Delay:
  • Coalesce:
  • Reorder:

Drive-Thru Simulator Drive-Thru Simulator

slide-14
SLIDE 14

14

Outline Outline

  • Motivation
  • Design
  • Validation
  • Case Study
  • Related Work
  • Conclusion
slide-15
SLIDE 15

15

Validation Setup Validation Setup

How accurate is Drive-Thru?

– Compare to most accurate method, replay with idle times

  • Ran six 45-minute trace segments

– iPAQ 3870 – Ext2 file system – Hitachi 1 GB Microdrive

slide-16
SLIDE 16

16

Validating with ext2 Validating with ext2

  • Drive-Thru prediction on average within 0.21%

1000 2000 3000 4000 5000 Purcell Berlioz Messiaen NFS15 Energy (J) Trace Replay with Idle Times Drive-Thru

slide-17
SLIDE 17

17

1000 2000 3000 4000 5000 Purcell Berlioz Messiaen NFS15 Energy (J) Trace Replay with Idle Times Drive-Thru

Validating with ext2 Validating with ext2

  • Drive-Thru prediction on average within 0.21%
slide-18
SLIDE 18

18

Validating with ext2 Validating with ext2

  • Drive-Thru prediction on average within 3%

100 200 300 400 Purcell Berlioz Messiaen NFS15 File System Energy (J) Trace Replay with Idle Times Drive-Thru

slide-19
SLIDE 19

19

Network File System Validation Network File System Validation

How Accurate is Drive-Thru for network?

– Compare to trace replay with idle times – Compare over 3 power management modes

  • Blue file system [Nightingale 04]
  • 802.11b card
slide-20
SLIDE 20

20

802.11b Power Management 802.11b Power Management

802.11b power management modes:

  • Continuously Aware Mode (CAM)

– High performance, high power

  • Power-Saving Mode (PSM)

– Low performance, low power

  • Self-Tuning Power Mangement (STPM) [Anand 03]

– Adaptively toggles between CAM and PSM

slide-21
SLIDE 21

21

  • Drive-Thru prediction on average within 7%

Validating Network Predictions Validating Network Predictions

1000 2000 3000 4000 Purcell (CAM) Purcell (PSM) Purcell (STPM) NFS15 (CAM) NFS15 (PSM) NFS15 (STPM) File System Energy (J) Trace Replay with Idle Times Drive-Thru

slide-22
SLIDE 22

22

Outline Outline

  • Motivation
  • Design
  • Validation
  • Case Study
  • Related Work
  • Conclusion
slide-23
SLIDE 23

23

Case Study: Local Storage Case Study: Local Storage

Proposed power management strategies:

  • Flush on write [Papathanasiou 02, Weissel 02]

– When a dirty block is written, writeback all dirty blocks

  • Flush on spindown [Weissel 02]

– Before the disk spins down, writeback all dirty blocks

  • Increase writeback delay [Papathanasiou 03, Weissel 02]

– Increase opportunities for write aggregation

Ran 6 traces on ext2 with Drive-Thru (8-28 hours each)

slide-24
SLIDE 24

24

Case Study: Local Storage Case Study: Local Storage

50 100 150 200 250 10 20 30 40 50 60 Writeback Delay (s) File System Energy (J)

Default Flush on Write Flush on Spindown Flush on Both

  • Complete evaluation over 40,000x faster than trace replay
slide-25
SLIDE 25

25

50 100 150 200 250 10 20 30 40 50 60 Writeback Delay (s) File System Energy (J)

Default Flush on Write Flush on Spindown Flush on Both

The 2-Second Peak The 2-Second Peak

Buffer Cache Trace Replay Tool Device Time (s) 0 1 2 Spinup Spinup Spindown Read Write

slide-26
SLIDE 26

26

Microbenchmarks Microbenchmarks: Danger! : Danger!

  • Need to evaluate over representative activity

100 200 300 400 500 600 700 800 900 10 20 30 40 50 60 Writeback Delay (s) File System Energy (J)

Default Flush on Write Flush on Spindown Flush on Both

slide-27
SLIDE 27

27

Case Study: Network File System Case Study: Network File System

Proposed power management strategies:

  • Flush on write

– When any data is sent to server, flush all data to server

  • Flush on PSM

– Before network card transitions, flush all data to server

  • Increase writeback delay

– Increase opportunities for bulk transfer

Ran 6 traces on BlueFS with Drive-Thru (8-28 hours)

slide-28
SLIDE 28

28

Case Study: Network Storage Case Study: Network Storage

100 200 300 400 500 600 10 20 30 40 50 60 Writeback Delay (s) File System Energy (J)

Default Flush on Write Flush on PSM Flush on Both

  • Complete evaluation over 13,000x faster than trace replay
slide-29
SLIDE 29

29

User App

Improving Improving BlueFS BlueFS

Local Storage BlueFS BlueFS Server Network

slide-30
SLIDE 30

30

User App

Improving Improving BlueFS BlueFS

Local Storage BlueFS BlueFS Server Network Flush on Spindown

slide-31
SLIDE 31

31

User App

Improving Improving BlueFS BlueFS

Local Storage BlueFS BlueFS Server Network Flush on Spindown Reduced Writeback Delay From 30 to 2 seconds

slide-32
SLIDE 32

32

Improving Improving BlueFS BlueFS

Modified BlueFS and ran 45-minute Purcell trace

  • Implemented flush on spindown for disk

– File system energy reduced 12.4% – Interactive delay reduced 11.0%

  • Reduced network writeback delay from 30 to 2 secs

– Safety improved for 1.8% file system energy cost

slide-33
SLIDE 33

33

Related Work Related Work

  • Disksim [Bucy 03] / Dempsey [Zedlewski 03]

– Detailed disk performance and power model

  • QualNet [http://www.scalablenetworks.com]

NS2 [http://www.isi.edu/nsnam/ns]

– Detailed network performance model

  • File-Cache-Content Detector [Arpaci-Dusseau 01]
slide-34
SLIDE 34

34

Conclusion Conclusion

Drive-Thru is:

  • Fast: Ran ext2 evaluation 40,000x faster
  • Accurate: 3% error for disk file system energy
  • Portable: < 1,000 lines of code

Case Study Insights:

  • Increasing writeback delay a meager improvement
  • Avoid writeback delay = disk spindown delay
  • Flush on spindown effective for disk