Repeatable, Reproducible, or Useful? Amer Diwan and Robert Hundt - - PowerPoint PPT Presentation

repeatable reproducible or useful
SMART_READER_LITE
LIVE PREVIEW

Repeatable, Reproducible, or Useful? Amer Diwan and Robert Hundt - - PowerPoint PPT Presentation

Repeatable, Reproducible, or Useful? Amer Diwan and Robert Hundt Google Repeatable I conduct the experiment twice using the same setup and get the same results Why should we care? If even I don't get consistent results from my


slide-1
SLIDE 1

Repeatable, Reproducible, or Useful?

Amer Diwan and Robert Hundt Google

slide-2
SLIDE 2

Repeatable

  • I conduct the experiment twice using the same

setup and get the same results

  • Why should we care?

– If even I don't get consistent results from my

experiment, then my experiment is doomed!

  • Challenge: inter-run variation

– Page mappings, interference with other

jobs, ...

slide-3
SLIDE 3

What can we do?

  • Repeat experiments as many times as needed

to obtain tight confidence intervals

– T-test, …

  • Report/record results with confidence intervals
slide-4
SLIDE 4

Reproducible

  • My friend and I conduct the same experiment

using the “same” setup and get the same results

  • Why should we care?

– If others cannot reproduce our experiments

then are they actually correct?

  • Challenge: bias
slide-5
SLIDE 5

Biases hiding under every rock...

The setting of irrelevant environment variables can lead to contradictory conclusions

slide-6
SLIDE 6

What can we do

  • Account and control for all sources of bias

– … yeah, right!

  • Account and control for all known sources of

bias

– Try to interactively discover sources of bias by

repeatedly submitting to the archive

slide-7
SLIDE 7

Sources of bias

  • Anything that affects memory layout

– Environment variables, link order, heap size

(Java), …

  • Benchmarks

– What exactly does the benchmark test?

  • Software and hardware components (e.g.,

microprocessors)

  • etc.
  • If we control for all sources of bias, we should

get reproducible results

slide-8
SLIDE 8

Useful

  • Real users should get results consistent with
  • ur experiments
  • Why should we care?

– If our results only apply to lab settings, then

they are irrelevant!

  • Challenge: “Controlling” bias is not a solution
slide-9
SLIDE 9

The problem with controlling bias

  • Repeating an experiment with the “same” bias

gives reproducible but not useful results

– e.g., Every time anyone ask my wife she

predicts the same winner for the election— this is repeatable but always has the same bias!

  • Need randomized trials
slide-10
SLIDE 10

Randomized trials

  • Randomly pick values for variables that cause

bias

  • Run an experiment
  • Repeat

Use statistical methods to summarize the trials

slide-11
SLIDE 11

The vision for an archival system

Repeat every experiment multiple times and use t-test Control for known sources of bias (benchmarks, environment variables...) Randomized trials for known sources of bias Self-contained script for running experiment Repeatable Reproducible Useful Sources of bias