repeatable reproducible or useful
play

Repeatable, Reproducible, or Useful? Amer Diwan and Robert Hundt - PowerPoint PPT Presentation

Repeatable, Reproducible, or Useful? Amer Diwan and Robert Hundt Google Repeatable I conduct the experiment twice using the same setup and get the same results Why should we care? If even I don't get consistent results from my


  1. Repeatable, Reproducible, or Useful? Amer Diwan and Robert Hundt Google

  2. Repeatable ● I conduct the experiment twice using the same setup and get the same results ● Why should we care? – If even I don't get consistent results from my experiment, then my experiment is doomed! ● Challenge: inter-run variation – Page mappings, interference with other jobs, ...

  3. What can we do? ● Repeat experiments as many times as needed to obtain tight confidence intervals – T-test, … ● Report/record results with confidence intervals

  4. Reproducible ● My friend and I conduct the same experiment using the “same” setup and get the same results ● Why should we care? – If others cannot reproduce our experiments then are they actually correct? ● Challenge: bias

  5. Biases hiding under every rock... The setting of irrelevant environment variables can lead to contradictory conclusions

  6. What can we do ● Account and control for all sources of bias – … yeah, right! ● Account and control for all known sources of bias – Try to interactively discover sources of bias by repeatedly submitting to the archive

  7. Sources of bias ● Anything that affects memory layout – Environment variables, link order, heap size (Java), … ● Benchmarks – What exactly does the benchmark test? ● Software and hardware components (e.g., microprocessors) ● etc. ● If we control for all sources of bias, we should get reproducible results

  8. Useful ● Real users should get results consistent with our experiments ● Why should we care? – If our results only apply to lab settings, then they are irrelevant! ● Challenge: “Controlling” bias is not a solution

  9. The problem with controlling bias ● Repeating an experiment with the “same” bias gives reproducible but not useful results – e.g., Every time anyone ask my wife she predicts the same winner for the election— this is repeatable but always has the same bias! ● Need randomized trials

  10. Randomized trials ● Randomly pick values for variables that cause bias ● Run an experiment ● Repeat Use statistical methods to summarize the trials

  11. The vision for an archival system Self-contained script for running experiment Repeat every experiment multiple times and use t-test Repeatable Control for known sources of bias Sources of bias (benchmarks, environment variables...) Reproducible Randomized trials for known sources of bias Useful

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend