NSML : A Machine Learning Platform That Enables You Focus on Your - PowerPoint PPT Presentation

NSML : A Machine Learning Platform That Enables You Focus on Your Models. ML-Sys WS 2017 @ NIPS Nako Sung , Minkyu Kim, Hyunwoo Jo, Youngil Yang, Jinwoong Kim, Leonard Lausen, Youngkwan Kim,   Gayoung Lee, Donghyun Kwak, Jung-Woo Ha, and Sunghun Kim CLOVA AI Research (CLAIR), NAVER | LINE, Search Solution, NAVER Webtoon, HKUST

What is NSML? • A machine learning platform that enables you focus on your models • Two options: on-premise / PaaS

https://xkcd.com/303/

https://www.youtube.com/watch?v=lxZyxxHOw3Y

https://www.youtube.com/watch?v=lxZyxxHOw3Y Wasted Time

https://www.formula1.com/en/latest/features/2017/2/F1-cars-of-2017.html

https://www.formula1.com/en/latest/features/2017/2/F1-cars-of-2017.html Importance of Fast Machines (Multiple Servers and GPUs)

https://www.sportskeeda.com/f1/what-happens-during-f1-pit-stop

https://www.sportskeeda.com/f1/what-happens-during-f1-pit-stop ML Research Challenges: Incidental Tasks

GPU GPU GPU GPU (busy) (idle) (busy) (idle) GPU GPU GPU GPU (idle) (idle) (idle) (idle) Heavy Model GPU GPU GPU GPU (idle) (idle) (idle) (idle) Model Heavy GPU GPU GPU GPU (idle) (idle) (idle) (idle) Model Heavy Heavy Model

ML Research Challenges: Resource Scheduling and Utilization 14 GPUs available but only 7 GPUs can be used in a single machine. GPU GPU GPU GPU (busy) (idle) (busy) (idle) GPU GPU GPU GPU (idle) (idle) (idle) (idle) Heavy Model GPU GPU GPU GPU (idle) (idle) (idle) (idle) Model Heavy GPU GPU GPU GPU (idle) (idle) (idle) (idle) Model Heavy Heavy Model

https://livingthing.danmackinlay.name/automl.html

https://livingthing.danmackinlay.name/automl.html ML Research Challenges: Hyperparameter Tuning

Tensor board Visdom TRAINING TRAINING DONE DONE γ =1e-2 γ =0.3, K=1 γ =0.1 γ =0.2

Visdom Tensor board ML Research Challenges: Multiple Experiments TRAINING TRAINING DONE DONE γ =1e-2 γ =0.3, K=1 γ =0.1 γ =0.2

https://www.linkedin.com/pulse/protecting-workers-who-work-alone-sandie-baillargeon

https://www.linkedin.com/pulse/protecting-workers-who-work-alone-sandie-baillargeon ML Research Challenges: Isolated Researchers

Challenges • Slack • Incidental Tasks • Ine ffi cient resource utilization • Naive hyperparameter tuning • Painful keeping track of multiple sessions • Isolated researchers

Requirements of ML Platforms • Resource Management • Better computational resource management • Data Management • Post datasets once and reuse them for multiple models • Share datasets with others • Serverless Configuration • No framework / library lock-in • Easy and lightweight task submission

Requirements of ML Platforms • Experiment Management and Visualization • Parallel runs with di ff erent jobs priorities • Automatic visualization and summarization of learning progress • Leaderboard • Leaderboard for each dataset to compare models and hyper parameters • AutoML • Experiment performance prediction based on previously run experiments. • Automatic hyper parameter optimization based on the performance predictions.

Limitations of Previous Solutions • Vendor lock-in (Cloud service) • Ine ffi cient model experiments • Inconsistent research environments • Still hard to keep track of experiments

This work was done for NCSoft and was presented at Nvidia GTC Korea 2015. MINI

This work was done for NCSoft and was presented at Nvidia GTC Korea 2015. My Previous Work in Early 2015 MINI

  URI {Dataset} / {User id} / {Session id} / {Model id} • Every dataset, session and model have uniform resource identifier.   CIFAR_10 CIFAR 10 dataset CIFAR_10/researcher_A/24 research_A’s 24th session for CIFAR_10 CIFAR_10/researcher_A/24/322 Snapshot from epoch 322

Easy One-Liner CLI

Easy One-Liner CLI Dataset registration

Easy One-Liner CLI Dataset registration Train

Easy One-Liner CLI Dataset registration Train Serve

Parallel Experiments to Kill Slack Distributed responses Exp. #1 Exp #2. vari. 1 Exp #2. vari. 2 Exp #3 Time

https://www.interaction-design.org/literature/book/the-encyclopedia-of-human-computer-interaction-2nd-ed/data-visualization-for-human-perception Need to Visualize • Balance your brain to understand without e ff ort

Flexible Analysis DONE Your code @1 TRAINING NSML Visualization tool Your code @2 TRAINING Your code @3

Dynamic Control Flow Typical training loop NSML Forward pass Backward pass Communicate to NSML Command queue model 1 Watch a variable change_lr(0.2) 2 Change a hyper parameter on the fly nsml.save(‘quick’) 3 Save current snapshot nsml.load(424) 4 Load saved snapshot 5 vis.image(model.generate(2)) Generate an image to visdom … …. …

CLI • Base of advanced features like save, load, infer, …

Bring Your Own Workspace • (Almost) Nothing to learn • Cached (Fast)

No Framework Lock-in

GPU server 10.0.0.1 python your_model.py stdout Interactive Mode

Pragmatic Research

Collaboration and Competition Leaderboard, CI-ML

New Workflow for ML Research Collaboration and Competition Leaderboard, CI-ML

Collaborative Research • Easy to reproduce and extend other’s research.

Cohesive and Competitive Dataset-centric environment Models are ranked automatically Standardized and Quantified Easy to compete Towards AutoML

AutoML • Quantitive model analysis makes ML workflow as a gym of AutoML

Dataset ASR Bob’s model 12 98.2% Bob’s model 13 94.2% Alice’s model 4 92.1% REST API   Seamless Connection to Services SOTA server https://service.nsml.navercorp.com/ASR

Dataset ASR Bob’s model 12 98.2% Bob’s model 13 94.2% Alice’s model 4 92.1% Alice’s model 5 98.3% REST API   Seamless Connection to Services SOTA server https://service.nsml.navercorp.com/ASR

Q1. 2018

https://research.clova.ai/nsml-alpha Thank you Several Hundreds of GPUs for this alpha (free)

NSML : A Machine Learning Platform That Enables You Focus on Your - PowerPoint PPT Presentation

NSML : A Machine Learning Platform That Enables You Focus on Your Models. ML-Sys WS 2017 @ NIPS Nako Sung , Minkyu Kim, Hyunwoo Jo, Youngil Yang, Jinwoong Kim, Leonard Lausen, Youngkwan Kim, Gayoung Lee, Donghyun Kwak, Jung-Woo Ha, and Sunghun

Overview Focus Projection Focus Projection Focus to Accent Focus to Accent Restricted View of

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

Machine Learning - Intro Aarti Singh Machine Learning 10-701/15-781 Sept 8, 2010 You tell me

Loop is a global platform that enables consumer product companies and retailers to shift from a

INSIDE THE PLATFORM Who are we Classic platforms Classic platform Modern platform Modern

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

INTRODUCTION TO MACHINE LEARNING Joseph C. Osborn CS 51A Spring 2020 Machine Learning is

Human and Machine Learning Tom Mitchell Machine Learning Department Carnegie Mellon University

Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification

MACHINE LEARNING Kernel Canonical Correlation Analysis 1 ADVANCED MACHINE LEARNING ADVANCED

5 th Annual Conference November 9, 2018 Thank you to our Sponsors 5 th Annual Conference Thank

2014 CLAIR Forum Alderman James Walker Clarence City Council Tasmania Akkeshi Japan - sister

Alma Integrations Plugin Kevin Clair 2020 April 8 Background Developed at University of Denver

Time For a New North American Rail Strategy November 28, 2017 Transp sportation a and Logist

Everyone A Curator: Evaluating the Impact of Social Metadata on Libraries, Archives and

Making The Most of Helm 3 DAN GARFIELD | ANNA BAKER Dan Garfield Chief Technology

Agenda Opening Session I. Report from Chair, Debra Dawson S Reports from Executive II. III.

21/11/2016 2 Data and Information: opportunities and challenges DR CLAIRE GRIFFITHS 3 4 5 6