Learning Networking by Reproducing Results
Lisa Yan, Lecturer in Computer Science Stanford University June 24, 2020
1
Slides http://stanford.edu/~yanlisa/publications/precs20_yanlisa-slides.pdf
Learning Networking by Reproducing Results Lisa Yan, Lecturer in - - PowerPoint PPT Presentation
Learning Networking by Reproducing Results Lisa Yan, Lecturer in Computer Science Stanford University June 24, 2020 Slides http://stanford.edu/~yanlisa/publications/precs20_yanlisa-slides.pdf 1 Suppose you need to teach 1. Introduction to
Lisa Yan, Lecturer in Computer Science Stanford University June 24, 2020
1
Slides http://stanford.edu/~yanlisa/publications/precs20_yanlisa-slides.pdf
Lisa Yan, 2020
2
Lisa Yan, 2020
3
Application Transport Network Link
Lisa Yan, 2020
4
Lisa Yan, 2020
5
Lisa Yan, 2020
6
(a bit boring) (too risky)
Lisa Yan, 2020
7
circa 2012: the beginning of Mininet, a realistic network emulator
Lisa Yan, 2020
8
Lisa Y Yan and Nick McKeown. Learning Networking by Reproducing Research
CCR a award a at S SIG IGCOMM 2 2017.
circa 2012: the beginning of Mininet, a realistic network emulator
Lisa Yan, 2020
9
Lisa Y Yan and Nick McKeown. Learning Networking by Reproducing Research
CCR a award a at S SIG IGCOMM 2 2017.
Lisa Yan, 2020
10
Lisa Y Yan and Nick McKeown. Learning Networking by Reproducing Research
CCR a award a at S SIG IGCOMM 2 2017.
Lisa Yan, 2020
11
Lisa Y Yan and Nick McKeown. Learning Networking by Reproducing Research
CCR a award a at S SIG IGCOMM 2 2017.
Lisa Yan, 2020
Reproducing research project: Graduate computer networks
Greater impacts
12
Lisa Yan, 2020
Reproducing research project: Graduate computer networks
Greater impacts
13
Lisa Yan, 2020
14
proposal
Day 1 7 14 21 28
Intermediate r report
discussion
In-class presentations
report
reproducingnetworkresearch.wordpress.com
Lisa Yan, 2020
15
top networking conferences
internet standards
systems security
Lisa Yan, 2020
16
B4 Wide Area Network (WAN) Facebook 2000- node cluster NetFPGA, Programmable solutions Theoretical models
ns-2
network traffic simulators and emulators
Mininet - http://mininet.org/ Mahimahi - http://mahimahi.mit.edu/
world wide web
Lisa Yan, 2020
17
Lisa Yan, 2020
18
Lisa Yan, 2020
19
B4 Wide Area Network (WAN) Facebook 2000- node cluster NetFPGA, Programmable solutions Theoretical models
ns-2 Mininet - http://mininet.org/ Mahimahi - http://mahimahi.mit.edu/
world wide web network traffic simulators and emulators
Lisa Yan, 2020
20
Lisa Yan, 2020
21
Lisa Yan, 2020
22
Lisa Yan, 2020
23
System source code Workload generation Open-source 33% Open-source but
18% Part of Linux Kernel 10% Contacted a author 7% Binary available 1% Student-created 19% Not-needed 12% Open-source 19% Sufficient details in paper 40% Student-created 41%
Lisa Yan, 2020
24
Lisa Yan, 2020
25
Spark
video streaming
73 unique published papers, 1993–2018
Lisa Yan, 2020
0% 20% 40% 60% 80% 100% 2019 2018 2017 2016 2015 2014 2013 2012
% of student groups Course offering
Unsuccessful Successful
26
73 unique papers reproduced 300+ students since 2012 (150+ projects)
2018: require new research reproductions 2016: introduced mahimahi emulator
Lisa Yan, 2020
27
2 4 6 8 10 # student reproductions Unique papers (73 total)
Pre-2018 Post-2018
Increasing TC TCP’s Initial Congestion Window (2010)
Data Ce Centers Randomly (2012) 3.
TCP Fast Open (2011)
unstable: picking a vi vide deo streaming r rate is hard (2014)
(require new unique research reproductions)
Lisa Yan, 2020
28
Lisa Yan, 2020
Netw twork c congesti tion: overloading network link, preventing useful communication TCP congestion avoidance:
slowly (additively) with receiver acknowledgments (ACKs)
window quickly (multiplicatively)
29
# packets sent time 1. 2.
data
Sender Receiver
ack
Lisa Yan, 2020
30
Original result from paper
congestion collapse. CCS 2005.
Optimistic acknowledgments (opt-acks) encourage victim senders to send more
Lisa Yan, 2020
31
Original result from paper Students’ reproduced result (2016, blog post)
congestion collapse. CCS 2005.
ns-2 (simulator) Mininet (emulator)
Lisa Yan, 2020
32
Original result from paper (2012)
T.-Y. Huang et al. Confused, Timid, and Unstable: Picking a Video Streaming Rate is Hard. IMC 2012.
Overly conservative video streaming rates lead to dismally low throughput
Lisa Yan, 2020
33
Original result from paper (2012)
T.-Y. Huang et al. Confused, Timid, and Unstable: Picking a Video Streaming Rate is Hard. IMC 2012.
Students’ reproduced result (2013, blog post)
Our experiments use the real backend servers of [Netflix]. We do not use Mininet or any other form of network emulation.
Lisa Yan, 2020
34
Original result from paper (2012)
T.-Y. Huang et al. Confused, Timid, and Unstable: Picking a Video Streaming Rate is Hard. IMC 2012.
Students’ reproduced result (2013, blog post)
We can easily observe that the problem reported in the paper has since been fixed in [Netflix]…We h have contacted t the p paper au authors and they confirm that this is the case.
Lisa Yan, 2020
35
Original result from paper (2012)
T.-Y. Huang et al. Confused, Timid, and Unstable: Picking a Video Streaming Rate is Hard. IMC 2012.
Students’ reproduced result (20 2017, blog post)
We chose to start with examining [Vimeo and YouTube], since they are freely a accessible without a subscription and there exist third party tools … for manipulating video downloads…
Lisa Yan, 2020
36
Original result from paper (2012)
T.-Y. Huang et al. Confused, Timid, and Unstable: Picking a Video Streaming Rate is Hard. IMC 2012.
Students’ reproduced result (2017, blog post)
Experimental results show that Yo YouTube’s player does not exhibit the downward spiral effect…
Lisa Yan, 2020
37
Original result from paper (2012)
T.-Y. Huang et al. Confused, Timid, and Unstable: Picking a Video Streaming Rate is Hard. IMC 2012.
Students’ reproduced result (2017, blog post)
…while Vi Vimeo’s player does.
Lisa Yan, 2020
38
Original result from paper GitHub open-source code Adaptive streaming in wide-area networks (geo-distributed sites, scarce/variable bandwidth)
Lisa Yan, 2020
39
Original result from paper Students’ reproduced result (2019, blog post)
Lisa Yan, 2020
40
Original result from paper
Students’ reproduced result (2019, blog post)
…overall we found [the documentation] to be outdated and at times misleading. We relied mostly on close reading of the code and email c correspondence with t the o
paper’s ’s a authors for guidance.
Lisa Yan, 2020
41
Lisa Yan, 2020
Overambitious engineering Emulator performance restrictions
42
“We spent our last week trying to find a mixed LP optimizer.” (reproduction of FastMPC, SIGCOMM 2015, blog post)
Differences in workloads
“Average QoE measurements were much higher than those reported…our Wifi/International Links more than capable
(reproduction of Pensieve, SIGCOMM 2017, blog post) "We scaled down all load generation parameters, but we still couldn’t achieve target latencies when emulating on a single machine.” (reproduction of QJump, NSDI 2015, blog post)
Lisa Yan, 2020
43
(at least) two examples:
Lisa Yan, 2020
PRR: Proportional Rate Reduction paces out retransmissions across received ACKs
44
The two (additional) discrepancies in /net/ipv4/tcp_input.c do not have a big impact on the experiment results. (2015, blog post)
365 commits to /net/ipv4/tcp_input.c
Ubuntu version Linux kernel 2011 11.10 3.0 No PRR 2014 12.04 3.2 With PRR
12 commits reference PRR
Late 2016 17.04+ 4.10+ Option to turn
Present-day 20.04+ 5.4+
Lisa Yan, 2020
In 2014, HTTPS page load times were slower than HTTP.
45
Original result from paper (Alexa top 500 websites)
noticeably slower
Lisa Yan, 2020
In 2014, HTTPS page load times were slower than HTTP.
46
Original result from paper (Alexa top 500 websites)
Students’ reproduced result (2017, blog post)
(over 4G) (over Fiber)
noticeably slower (vs 40%)
Lisa Yan, 2020
Page Load Time (PLT) much higher in recent years Emulator: Dummynet
47
Pag Page RT RTT (ms ms) PLT: : no non-TF TFO O (s (s) PLT: T : TFO (s (s) Im Improve- me ment: CoNEXT 2011 Amazon 100 2.60 2.34 10% NYTimes 100 4.59 4.30 6% Students 2015 Amazon 100 15.92 12.55 21% NYTimes 100 5.37 4.03 25% (2015, blog post)
Lisa Yan, 2020
48
Lisa Yan, 2020
49
Lisa Yan, 2020
Reproducing research project: Graduate computer networks
Greater impacts
50
Lisa Yan, 2020
51
Student Original researcher New researcher Simulator/ emulator developer
Lisa Yan, 2020
…overall we found [the documentation] to be outdated and at times misleading. We relied mostly on close reading of the code and email c correspondence with t the o
paper’s ’s a authors for guidance.
52
Original result from paper Students’ reproduced result (2019, blog post)
Lisa Yan, 2020
“Their assumption was that [people] would reproduce the results in an actual d datacenter, whereas we did the emulation i in Mi Mininet.” “In the end, we did not use their scripts directly, but it was nice to see that the authors were enthusiastic to have their work reproduced.”
53
(2015, blog post)
Lisa Yan, 2020
54
Student Original researcher New researcher Simulator/ emulator developer I learned how to implement a scheduler for my graduate research! I can confirm and improve my current and past research!
Lisa Yan, 2020
55
Student Original researcher New researcher Simulator/ emulator developer I just started a career in networks, and this prepared me for the real world.
Tool feedback/ development
Mininet
http://mininet.org/
Mahimahi
http://mahimahi.mit.edu/
Lisa Yan, 2020
56
Student Original researcher New researcher Simulator/ emulator developer
A fully reproducible project in the public domain.
build upon it
transfer
We were contacted by both the original authors and a student working on his own research!
Lisa Yan, 2020
57
Lisa Yan, 2020
58
Assignment 1 1 Core topic practice emulation environment
Week 1 mid-quarter 10
Assignment 2 2 Core topic practice emulation environment Final p project Reproduce research
Lisa Yan, 2020
59
Assignment 1 1 Core topic practice emulation environment
Week 1 mid-quarter 10
Assignment 2 2 Reproduce the same project Final p project Reproduce research,
Lisa Yan, 2020
60
Assignment 1 1 Core topic practice emulation environment
Week 1 mid-quarter 10
Assignment 2 2 Core topic practice emulation environment Final p project Reproduce research
to get students started
multitude of applications
Lisa Yan, 2020
61
This talk Reproducible research for all
cs244.stanford.edu/reproducibility
62
https://ccronline.sigcomm.org/2017/learning-networking-by-reproducing-research-results/
Nick McKeown, Keith Winstein, Sachin Katti, Bruce Spang CS 244: Advanced Topics in Networking