chenyang lu
play

Chenyang Lu Outline OnlineDataMigra.oninStorageServers - PDF document

Chenyang Lu Outline OnlineDataMigra.oninStorageServers Controltheore*cFramework ServicedelaycontrolonWebservers Enterprisestorageservers


  1. Chenyang Lu Outline
 Online
Data
Migra.on
in
Storage
Servers
  Control‐theore*c
Framework 
  Service
delay
control
on
Web
servers
  Enterprise
storage
servers
  On‐line
data
migra*on
in
storage
servers
 E-mail server; DB … need
to
move
data 
  ControlWare:
adap*ve
QoS
control
middleware
 I/Os  System
expansion
  Applica*on
changes
 data 
  Always‐on :
e‐business,
 migration global
data
centers
 New device 
  
 Online 
data
migra*on
 Storage system 39 
 40 
 State
of
Prac.ce
 The
Problem
 Need to bound impact E-mail server; DB…  Execute
a
given
migra*on
plan
on‐line

 on applications!  Challenges
 Slow I/O ʼ s!!! Keep
data
consistent
  SAN Bound
impact
on
applica*on
performance
  New Complete
migra*on
quickly
  device Migration Script data 
 plan migration storage 
 Submover devices Storage system HP-UX LVM 41 
 42 
 Aqueduct
 Adap.ve
solu.on
 Aqueduct E-mail server; DB… migration executor  Feedback
control
loop:
adapts
migra*on
speed
based
 on
applica*on
I/O
latency
 Monitor  Enforce
latency
contract:
Bounded
average
I/O
latency
 I/Os { L i ( k )} SAN Application { LC i } Latency  Complete
migra*on
in
shortest
*me
allowed
by
contract
 Controller Contract  Standard
control‐theore*c
design
 R m ( k ) Migration Actuator  Systema*c
methodology
 data 
 plan migration  Robust,
analy*cally
proven
performance
  Handle
different
workloads
and
devices
 storage 
 Submover devices Storage system HP-UX LVM 43 
 44 
 Quality of Service in Unpredictable 1 Computing Environments

  2. Chenyang Lu Monitor
 Actuator
 Monitor Monitor Controller Controller  Problem:
fine‐grained
control
of
 
migra*on
speed
using
HP‐UX
LVM
 Actuator Actuator  Divide
store
into
small

(32
MB)
substores
(LVs)
  Submover
moves
substore
using
LVM
silvering
  Measure
applica*ons’
average
I/O
latency
of
each
store
in
the
 last
sampling
window 

 Mirror Split Current
implementa*on:
trace
replayer
directly
monitors
I/O
latencies
 Silvering  Can
interface
with
performance
monitoring
tools
(HP
Openview)
   Actuator
enforces
a
submove
rate
by
sleeping
 submv submv sleep sleep 1 submv/sw sleep sleep sleep sleep 2 submv/sw Sampling Window Sampling Window 45 
 46 
 Controller
 Tuning
controller
parameters
 Monitor Controller Actuator Victim latency VL(k) : highest Approximate linear model  Compute
error
for
each
store
i
 average latency among all stores VL(k+1)–VL(k)= G (R m (k)-R m (k-1)) 
 
 E i (k)
=
P*LC i 
‐
L i (k)
 in the k th sampling window 
0<P<1:
safety
margin,
related
to
burs*ness
 Process gain G : Impact of 
k:
represents
the
k th 
sampling
window
 System profiling: Estimate G submove rate changes on  Compute
worst
error
 victim latency. 
 
 E min (k)
=
min{E i (k)}
 Construct transfer function  Integral
controller
computes
new
submove
rate:
 Stability
  
 
 R m (k)
=
R m (k‐1)
+
K*E min (k)
 Tracking:
VL(k)
=
P*LC
in
 Control Analysis  
Control
gain
K:
aggressiveness
of
rate
change
 steady
state
 Compute K SeOling
.me
  Satisfy 47 
 48 
 Experimental
setup
 Experiments
  Baselines:
no
sleeping
between
(sub)moves

 Whole‐store:
move
one
store
at
a
*me
  Aqueduct FC-60 disk array Sub‐store:
move
one
substore
at
a
*me
  (1.05 TB, 5 RAID5 Logical Units) HP-UX 11 & LVM LU 0 HP 9000-N4000 Server  Constant:
steady
Poisson
streams
 emails metadata 8 440MHz processors Replace
Logical
Unit;
migrate
three
640‐MB
stores.
  emails metadata Openmail  Openmail:
trace
of
an
enterprise
e‐mail
server
running
HP
 I/O Trace Fibre Channel Openmail
 Add
Logical
Unit;
migrate
a
1854
MB
store
and
a
96
MB
store

  LU new  Enterprise‐scale
storage
server
 49 
 50 
 Quality of Service in Unpredictable 2 Computing Environments

  3. Chenyang Lu Openmail:
vic.m
latency
 Measure
G
  
Tune
K
 Process gain G: the slope Constant of the curves Average Victim Latency (ms) Control gain K Constant: K = 1.09 Openmail: K = 0.36 Openmail LC 0.8*LC Aqueduct Sub-store Whole-store 51 
 52 
 Openmail:
latency
 Openmail:
latency
&
submove
rate
 LC  Load
highest
on
new
LU
towards
end
of
migra*on
 Aqueduct uniformly better than baselines, but …  By
design,
submove
rate
must
be
1
or
higher
  
 controller
is
working
correctly
 53 
 54 
 Openmail:
average
latency
 Openmail:
latency
CDF
 91% 
 Aqueduct 76% 
 Sub-store Whole-store LC 55 
 56 
 Quality of Service in Unpredictable 3 Computing Environments

  4. Chenyang Lu Related
work
 Summary
  Migra*on
must
be
executed
adap*vely
  Aqueduct
is
neither
overly
aggressive
  Simpler
versions
of
the
problem
 Average
I/O
latency
reduced
by
76%
  Take
(parts
of)
system
offline
  Contract
viola*on
ra*o
reduced
by
78%
  Migrate
data
in
“quiet
periods”

   Nor
overly
conserva*ve
  Silvering
in
Logical
Volume
Manager
[HP‐UX
LVM,
VxVM]:
 Average
vic*m
latency
15%
lower
than
latency
contract
  maintain
data
consistency,
no
QoS
guarantees
  Propor*onal
I/O
scheduling:
hard
to
handle
unpredictability
  Future
  MS
Manners:
no
guarantees
to
important
tasks
 More
detailed
sensi*vity
analysis
   Control‐theory‐based
systems:
distributed
visual
tracking,
Web
 Self‐tuning
controller
  servers,
e‐mail
server,
database
real‐*me
processor
 Mul*‐dimensional
QoS
contracts
  scheduling
...

 57 
 58 
 References
  C. Lu, G. A. Alvarez, J. Wilkes, Aqueduct: Online Data Migration with Performance Guarantees, USENIX Conference on File and Storage Technologies (FAST), 2002.  G.A. Alvarez, C. Lu and J. Wilkes, Method and System for Online Data Migration on Storage Systems with Performance Guarantees, U. S. Patent 7,167,965, January 2007. 59 
 Quality of Service in Unpredictable 4 Computing Environments

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend