Asynchronous Hyperparameter Tuning and Ablation Studies with Apache Spark
sinash@kth.se
Sina Sheikholeslami
Distributed Computing Group, KTH Royal Institute of Technology
October 16 2019 CASTOR Software Days 2019
@cutlash
Asynchronous Hyperparameter Tuning and Ablation Studies with Apache - - PowerPoint PPT Presentation
Asynchronous Hyperparameter Tuning and Ablation Studies with Apache Spark Sina Sheikholeslami Distributed Computing Group, KTH Royal Institute of Technology @cutlash CASTOR Software Days 2019 October 16 2019 sinash@kth.se The Machine
sinash@kth.se
Sina Sheikholeslami
Distributed Computing Group, KTH Royal Institute of Technology
October 16 2019 CASTOR Software Days 2019
@cutlash
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 2
Dataset Machine Learning Model Optimizer Evaluate Problem Definition Data Preparation Model Selection
Repeat if needed Model Training
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 3
Input Layer Hidden Layer Output Layer
ablative brain surgeries by Jean Pierre Flourens
(1794 - 1867)
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 4
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 5
Dataset Machine Learning Model Optimizer Evaluate Problem Definition Data Preparation Model Selection
Repeat if needed Model Training
area rooms floors price
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 6
“Too frequently, authors propose many tweaks absent proper ablation studies … Sometimes just one of the changes is actually responsible for the improved results … this practice misleads readers to believe that all of the proposed changes are necessary.”
(Lipton & Steinhardt, “Troubling Trends in Machine Learning Scholarship”)
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 7
Accuracy: 78%
The Base Model
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 8
Accuracy: 73%
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 9
The Base Model
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 10
Accuracy: 67%
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 11
The Base Model
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 12
Accuracy: 63%
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 13
Machine Learning System
Ablation
New Dataset / Model Configuration Evaluate
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 14
Machine Learning System
Hyperparameter Tuner
New Hyperparameter Values Evaluate
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 15
Machine Learning System
Global Experiment Controller
New Trial Evaluate
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 18
(“Parallel Training” - by Maxim Melnikov)
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 19
(TensorFlow, the TensorFlow logo and any related marks are trademarks of Google Inc.)
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 20
Open-source Platform for Data-intensive AI
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 21
Open-source Platform for Data-intensive AI
What is Hopsworks? https://tinyurl.com/y4ze79d4
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 22
Data Pipelines Ingest & Prep Feature Store Machine Learning Experiments Data Parallel Training Model Serving
Ablation Studies Hyperparameter Tuning
Bottleneck, due to
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 23
Task11
Driver
Task12 Task13 Task1N
…
HDFS
Task21 Task22 Task23 Task2N
…
Barrier Barrier
Task31 Task32 Task33 Task3N
…
Barrier
Metrics1 Metrics2 Metrics3
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 24
Task11
Driver
Task12 Task13 Task1N
…
HDFS
Task21 Task22 Task23 Task2N
…
Barrier Barrier
Task31 Task32 Task33 Task3N
…
Barrier
Metrics1 Metrics2 Metrics3 Wasted Compute Wasted Compute Wasted Compute
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 25
An Open-source Framework for Asynchronous Computation on top of Apache Spark
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 26
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 27 Task11
Driver
Task12 Task13 Task1N
…
Barrier
Metrics New Trial
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 28
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 29
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 30
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 31
area rooms floors price rooms floors price
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 32
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 33
(Example Notebook Available!)
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 34
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 35
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 36
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 37
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 38
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 39
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 40
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 41
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 42
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 43
October 16 2019 Sina Sheikholeslami - KTH Royal Institute of Technology 44
@logicalclocks @hopsworks GitHub
https://github.com/hopshadoop/maggy https://maggy.readthedocs.io/en/latest/ https://logicalclocks.com/whitepapers/ Thanks to the entire Logical Clocks Team J Specially: Moritz Meister @morimeister Jim Dowling @jim_dowling Robin Andersson @robzor92 Kim Hammar @KimHammar1 Alex Ormenisan @alex_ormenisan
(Example Notebook Available!)
sinash@kth.se October 16 2019 CASTOR Software Days 2019
@cutlash