On Selecting the Right Optimizations for Virtual Machine Migration - - PowerPoint PPT Presentation

on selecting the right optimizations for virtual machine
SMART_READER_LITE
LIVE PREVIEW

On Selecting the Right Optimizations for Virtual Machine Migration - - PowerPoint PPT Presentation

On Selecting the Right Optimizations for Virtual Machine Migration Senthil Nathan N , Umesh Bellur, Purushottam Kulkarni Systems and Networks Group (SynerG) Department of Computer Science and Engineering Indian Institute of Technology Bombay


slide-1
SLIDE 1

On Selecting the Right Optimizations for Virtual Machine Migration

Senthil Nathan N, Umesh Bellur, Purushottam Kulkarni Systems and Networks Group (SynerG) Department of Computer Science and Engineering Indian Institute of Technology Bombay April 2, 2016 Virtual Execution Environments (VEE) 2016

slide-2
SLIDE 2

Pre-Copy Live Migration

IIT Bombay 2/22

VM VM memory pages copying VM's main memory Storage Area Network

Stores VM's disk image

source PM destination PM

VM is executing: Memory pages are transferred iteratively

slide-3
SLIDE 3

Pre-Copy Live Migration

IIT Bombay 2/22

VM VM memory pages copying VM's main memory Storage Area Network

Stores VM's disk image

source PM destination PM

VM is executing: Memory pages are transferred iteratively

VM iter #1 migration time iter #2 iter #n downtime source PM destination PM stop VM's execution VM pages dirtied in iter #1 pages dirtied in iter #(n-1) remaining pages & hardware state

slide-4
SLIDE 4

Pre-Copy Live Migration

IIT Bombay 2/22

VM VM memory pages copying VM's main memory Storage Area Network

Stores VM's disk image

source PM destination PM

VM is executing: Memory pages are transferred iteratively

VM iter #1 migration time iter #2 iter #n downtime source PM destination PM stop VM's execution VM pages dirtied in iter #1 pages dirtied in iter #(n-1) remaining pages & hardware state

migration time(VM hosting file server) = 5 minutes

slide-5
SLIDE 5

Existing Optimization Components for Pre-Copy Live Migration

IIT Bombay 3/22

Page Dirty Characteristics

1

page delta (del).

2

frequently/hot dirty pages (hdp).

Page Content Characteristics

1

zero pages (zp) & sub zero pages (szp).

2

duplicate pages (dup) & sub duplicate pages (sdup).

3

word-level duplicates (wld). We identifed a new component: False Dirty Page (fdp)

slide-6
SLIDE 6

False Dirty Pages (fdp)

IIT Bombay 4/22 User Space Migration Process Kernel Space Hypervisor retrieve dirty bitmap reset dirty bitmap mark all pages of VM as read only Before Starting an Iteration & During an Iteration Transfer pages that are marked in the received dirty bitmap

  • 1. write VM’s read only page
  • 2. set the dirty bit
  • 3. mark the page writable

1

An iteration can transfer a page that is already dirted during current iteration.

2

What happens if this page never get dirtied again before the next iteration?

An unmodified page is transferred: False Dirty Page.

slide-7
SLIDE 7

Existing Optimizations

IIT Bombay 5/22

Techniques Account for dirty characteristics content characteristics fdp del hdp zp szp dup sdup wld Delta Compression

  • Page Skip
  • Full Page Deduplication
  • Sub Page Deduplication
  • Data Compression
  • False Dirty Page (fdp) is overlooked in existing study

Question of Interest

1

Which is the best technique in terms of performance & cost?

2

Does optimization’s performance depend on application behaviour?

3

What combinations of optimizations are more appropriate?

4

Can the combinations of optimizations reduce the resource cost? fdp helps in answering these questions

slide-8
SLIDE 8

Page Skip

IIT Bombay 6/22

Send List Collect n pages & transfer ALL PAGES are scheduled to transfer in iter #1

  • ITER. #1
  • locate all pages dirtied during this transfer time
slide-9
SLIDE 9

Page Skip

IIT Bombay 6/22

Send List Collect n pages & transfer ALL PAGES are scheduled to transfer in iter #1

  • ITER. #1
  • locate all pages dirtied during this transfer time

Send List Collect n pages & transfer ALL PAGES are scheduled to transfer in iter #1 Collect n pages & transfer denotes dirtied pages during the transfer of 'n' pages dirtied pages yet to be transferred are skipped

  • ITER. #1
  • Send List

locate all pages dirtied during this transfer time

slide-10
SLIDE 10

Page Skip

IIT Bombay 6/22

Send List Collect n pages & transfer ALL PAGES are scheduled to transfer in iter #1

  • ITER. #1
  • locate all pages dirtied during this transfer time

Send List Collect n pages & transfer ALL PAGES are scheduled to transfer in iter #1 Collect n pages & transfer denotes dirtied pages during the transfer of 'n' pages dirtied pages yet to be transferred are skipped Collect <=n pages & transfer Collect n pages & transfer

  • ITER. #2
  • ITER. #1
  • Dirted pages during the transfer of 'n' pages

ALL dirtied PAGES during iter #1 Eligible pages for skip: Pages dirtied in both iteration 1 and 2 Send List Send List Send List

.....

locate all pages dirtied during this transfer time

..... .....

slide-11
SLIDE 11

Our 3 Goals

IIT Bombay 7/22

1

To perform a comprehensive empirical study to understand and compare between performance and cost of different migration optimization techniques.

2

To understand the relationship between application behavior and the payback of different optimizations.

3

To investigate the feasibility of combining optimizations so as to get the maximum improvement in migration performance for unit resource cost.

slide-12
SLIDE 12

Experimental Setup and Workloads Used for Evaluation

IIT Bombay 8/22

VM PM1 PM2 source destination migration logger logger switch Controller load generator 1 Gbps VM

Issue commands for VM creation, VM migration, Page dirtying rate, Changing resource availability CPU – Xentop Network - /proc/net/dev Page Dirty – shadow page

load generator NFS NTP

Table : Workloads: Web Services Workloads Remark Mediawiki, DVD Store, RUBiS webserver and database server OLTPBench epinions, twitter, seats, voter, tpcc, tatp, ycsb HTTP file server 100s of files each with different sizes Table : Workloads: Multimedia, Biotech, Mining, MPI Workloads Remark gcc kernel compile with multiple threads File compression compress 8 GB data set with tar Linpack linear equation solver Nu-Minebench (data mining) ECLAT, HOP, ScalParC, UtilityMine Mummer gnome sequence alignment Parsec (multimedia tools) bodytrack, ferret, fluidanimate, freqmine, vips, and x264 Dacapo (java benchmarks) avrora, eclipse, fop, h2, jython, luindex, lusearch, pmd, sun- flow, tomcat, tradebeans, tradesoap, and xalan

42 applications, 7 different migration rates. 42×7 = 294 measured values.

slide-13
SLIDE 13

Expectations on Optimization Performance

IIT Bombay 9/22

Performance of Delta Compression & Page Skip ∝ Additional Network Traffic Generated

500 1000 1500 2000 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 240 250 260 270 280 290

Additional Traffic: MB Migration IDd

additional network traffic

Performance of Deduplication ∝ Additional Network Traffic & the Amount of Zero, Duplicate Pages Performance of Data Compression ∝ Total Network Traffic & the Amount of Zero, Word Level Dup.

slide-14
SLIDE 14

Methodology

IIT Bombay 10/22

1

Apply each optimization individually on 294 migration instances.

2

If expectation fails, identify the reason (i.e., application behavior).

3

Compare performance & cost of each optimization and find appropriate combinations.

slide-15
SLIDE 15

Goal 1: Performance and Cost Trade-Off

IIT Bombay 11/22

Techniques

  • Avg. traffic reduction

Increase in CPU at source PM destination PM Page Skip 20 % 1 x 1 x Data Compression 37 % 5 x 1.3 x Delta Compression 17 % 3.2 x 1.2 x Full Page Dedup. 17 % 11 x 1 x Sub Page Dedup. 20 % 13 x 1.2 x Page skip is the best optimization in terms of performance-cost tradeoff

slide-16
SLIDE 16

Significant Traffic Reduction due to fdp & hdp

IIT Bombay 12/22

0.2 0.4 0.6 0.8 1 300 600 900 1200 CDF

Traffic Reduction (MB)

(a)Total traffic reduction

delta compression page skip full page deduplication sub page deduplication (4) data compression

0.2 0.4 0.6 0.8 1 300 600 900 1200 CDF

Traffic reduction (in MB)

(b)Traffic reduction (fdp+hdp)

delta compression page skip full page deduplication sub page deduplication (4)

Observations: Reduction in network traffic

Reduction due to data compression is significant. On average, 40% reduction due to False Dirty Pages (fdp). Approximately, 53% reduction due to Hot Dirty Pages (hdp).

slide-17
SLIDE 17

Goal 2: Co-relation Between Application Behavior and Optimization’s Performance

Techniques Low performance when Delta Compression high dirty frequency, high disk read, memset(),free() Page Skip page dirty rate << migration rate and large writable set size Deduplication high dirty frequency and low dup+zp Data Compression low word level duplicates & zero content

IIT Bombay 13/22

slide-18
SLIDE 18

Delta Compression: Impact of Application Behaviour

IIT Bombay 14/22

slide-19
SLIDE 19

Delta Compression: Impact of Application Behaviour

IIT Bombay 14/22

fdp ∝ 1

df

high df = ⇒ improvement depedentent of page modification size.

disk read, memset(), free() modified all 4 KB.

slide-20
SLIDE 20

Goal 3: Appropriate Combinations—Page Skip as Base Optimization

IIT Bombay 15/22

Page Skip + Reduction in traffic Total Increase in CPU at (over skip) source PM destination PM Data Compression 21 % 4.7 x (-.3) 1.3 x Delta Compression 1.4 % 2.7 x (-.5) 1 x Full Page Dedup. 7 % 10.8 x (-.2) 1 x Sub Page Dedup. 9 % 13 x 1.2 x Page Skip + Data Compression is the best combination in terms of performance-cost tradeoff

slide-21
SLIDE 21

Reduction in Total CPU Utilization due to Combination of Techniques

IIT Bombay 16/22

0.2 0.4 0.6 0.8 1 500 1000 1500 2000 2500 CDF

Total source CPU utilization (%)

(a) Individual Technique

delta compression full page dup sub page dup (4) compression

0.2 0.4 0.6 0.8 1 500 1000 1500 2000 2500 CDF

Total destination CPU utilization (%)

delta compression full page dup sub page dup (4) compression

0.2 0.4 0.6 0.8 1 500 1000 1500 2000 2500 CDF

Total source CPU utilization (%)

(b)Page skip as the base technique

0.2 0.4 0.6 0.8 1 500 1000 1500 2000 2500 CDF

Total destination CPU utilization (%)

slide-22
SLIDE 22

Reduction in Total CPU Utilization due to Combination of Techniques

IIT Bombay 16/22

0.2 0.4 0.6 0.8 1 500 1000 1500 2000 2500 CDF

Total source CPU utilization (%)

(a) Individual Technique

delta compression full page dup sub page dup (4) compression

0.2 0.4 0.6 0.8 1 500 1000 1500 2000 2500 CDF

Total destination CPU utilization (%)

delta compression full page dup sub page dup (4) compression

0.2 0.4 0.6 0.8 1 500 1000 1500 2000 2500 CDF

Total source CPU utilization (%)

(b)Page skip as the base technique

0.2 0.4 0.6 0.8 1 500 1000 1500 2000 2500 CDF

Total destination CPU utilization (%)

0.2 0.4 0.6 0.8 1 500 1000 1500 2000 2500 CDF

Total source CPU utilization (%)

(a) Individual Technique

delta compression full page dup sub page dup (4) compression

0.2 0.4 0.6 0.8 1 500 1000 1500 2000 2500 CDF

Total destination CPU utilization (%)

delta compression full page dup sub page dup (4) compression

0.2 0.4 0.6 0.8 1 500 1000 1500 2000 2500 CDF

Total source CPU utilization (%)

(b)Page skip as the base technique

0.2 0.4 0.6 0.8 1 500 1000 1500 2000 2500 CDF

Total destination CPU utilization (%)

slide-23
SLIDE 23

Reduction in Impact of Application Behaviour due to Combination of Techniques

Page Skip + low performance gain when Delta Compression high dirty frequency, unskipped pages are dirtied due to disk read, memset(),free() Deduplication high dirty frequency, low dup+zp Data Compression low word level duplicates and low zero content

IIT Bombay 17/22

slide-24
SLIDE 24

Appropriate Combinations—Page Skip + Data Compression as Base Optimizations

IIT Bombay 18/22

Page Skip + Data Reduction in traffic Total Increase in CPU at Compression +

  • ver page skip +

data compression source PM destination PM Delta Compression 3 % 7.1 x 1.4 x Full Page Dedup. 2 % 16 x 1.5 x Sub Page Dedup.

  • 3 %

18 x 1.5 x No more suitable combinations

slide-25
SLIDE 25

More Results (refer to paper)

IIT Bombay 19/22

1

Reduction in migration time and downtime due to each technique.

2

Detailed CPU utilization with respect to migration rate.

3

Impact of optimization parameters on performance improvement.

cache size for delta compression. build frequency for page skip. sub page size for deduplication.

4

Proposed a new cache replacement policy for delta compression that improves the performance over 30%.

slide-26
SLIDE 26

Summary of Key Results

IIT Bombay 20/22

1

Page Skip is the best optimization (must apply).

2

Page Skip + Data Compression is the best combination.

3

Deduplication technique need to applied with utmost care (can use KSM).

4

Combination of techniques reduce the

impact of application behaviour on optimization’s performance. total resource utilization (while increasing per second utilization).

slide-27
SLIDE 27

IIT Bombay 21/22

Thank You

We thank Google & Microsoft Research India for travel grant

Questions?