ONE DOES NOT T SIMPLY DEPLOY ML INTO PRODUCTI TION Henrik Brink - - PowerPoint PPT Presentation

▶

Jan 16, 2024 199 likes •654 views

ONE DOES NOT T SIMPLY DEPLOY ML INTO PRODUCTI TION Henrik Brink Machine Learning Engineering @ Wise.io / GE Digital brinkar Agenda From space to industrial machine learning Challenges Optimization dimensions Infrastructure

SLIDE 1

brinkar

ONE DOES NOT T SIMPLY DEPLOY ML INTO PRODUCTI TION

Henrik Brink

Machine Learning Engineering @ Wise.io / GE Digital

SLIDE 2

brinkar

Agenda

From space to industrial machine learning
Challenges
Optimization dimensions
Infrastructure
Development and deployment
Solutions
The ML meta-algorithm
Containerization
Best engineering practices
Wrap-up and questions

SLIDE 3

brinkar

From astronomy to industrial machine learning...

SLIDE 4

brinkar

SLIDE 5

brinkar

SLIDE 6

brinkar

SLIDE 7

brinkar

SLIDE 8

brinkar

SLIDE 9

brinkar

SLIDE 10

brinkar

SLIDE 11

brinkar

Everyone ready...?

SLIDE 12

brinkar

ONE DOES NOT SIMPLY DEPLOY ML INTO PRODUCTION

SLIDE 13

brinkar

Wh What t to opti timi mize for when bui buildi ding ng ML ML systems?

SLIDE 14

brinkar

Accu Accuracy? cy?

SLIDE 15

brinkar

Implemen Implementatio ion c n cost?

SLIDE 16

brinkar

1920 CPUs 280 GPUs

Ru Runti time co cost?

SLIDE 17

brinkar

In Inter erpr pretabilit ability?

SLIDE 18

brinkar

arxiv.org/abs/1602.04938

LOCAL INTERPRETABLE MODEL- AGNOSTIC EXPLANATIONS (LIME

SLIDE 19

brinkar

Au Autom

mati

tion

vs vs aug augmen mentatio ion? n?

SLIDE 20

brinkar

Wh What t to opti timi mize for when bui buildi ding ng ML ML systems?

Accuracy?
Implementation cost?
Runtime cost?
Interpretability?
Automation vs augmentation?

SLIDE 21

brinkar

1. Identify and define the problem 2. Collect and understand the data 3. Build and deploy a simple model that works end-to-end 4. Iterate to optimize an deploy improved models (inner loop) 5. Monitor and back-propagate changes to problem parameters (outer loop)

A A 5-st step ML meta-alg algorit ithm hm

SLIDE 22

brinkar

1. Identify and define the problem
5. Monitor and back-propagate changes to problem parameters

SLIDE 23

brinkar

2. Collect and understand the data

SLIDE 24

brinkar

2. Collect and understand the data

SLIDE 25

brinkar

2. Collect and understand the data

SLIDE 26

brinkar

3. Deploy simple model end-to-end
4. Iterate to optimize an deploy improved models
Simplest possible model that solves the

problem

End-to-end production deployment:

automated deployment, testing, logging, monitoring, feedback

SLIDE 27

brinkar

Getting cl closer...

SLIDE 28

brinkar

Mach chine learning infrastruct cture

SLIDE 29

brinkar

Co Continuous integration

SLIDE 30

brinkar

SLIDE 31

brinkar

3 r 3 reaso eason f n for using using c container ainers in s in mac machine lear hine learning ning...

SLIDE 32

brinkar

1 P 1 Pac ackag aging ing

SLIDE 33

brinkar

2 Inher 2 Inherit itanc ance

FROM tensorflow/tensorflow:latest-gpu # Your specialized modeling pipeline FROM my-special-pipeline # Your even more specialized pipeline

tensorflow images segmentation classification text sentiment

SLIDE 34

brinkar

Immut Immutabilit ability

SLIDE 35

brinkar

“Data should go through the exact same pipeline when making predictions, as when the model was built.”

- Good ML practitioner

SLIDE 36

brinkar

Us Use gr great t framewor

rks and service

ces... ...

SLIDE 37

brinkar

Op Open sou

urce

ce ML framewor

SLIDE 38

brinkar

RISELab Clipper

Deploy models trained in your choice of framework to Clipper

with a few lines of code by using an existing model container or writing your own

Easily update or add models to running applications
Use adversarial bandit algorithms to dynamically select best

model for prediction at serving time

Set latency service level objectives for reliable query latencies
Run each model in a separate Docker container for simple

cluster management and resource allocation

Deploy models running on CPUs, GPUs, or both in the same

application

SLIDE 39

brinkar

Ho Hosted M ed ML ser servic ices es

SLIDE 40

brinkar

Mach chine learning infrastruct cture

Continuous integration
Containerization
Utilize ML platforms

SLIDE 41

brinkar

Wr Wrap up...

SLIDE 42

brinkar

Real-World Machine Learning

manning.com/brink

SLIDE 43

brinkar

meetup.com/datacph nordic.ai

SLIDE 44

brinkar

ONE DOES NOT T SIMPLY DEPLOY ML INTO PRODUCTI TION

Agenda

From astronomy to industrial machine learning...

Everyone ready...?

ONE DOES NOT SIMPLY DEPLOY ML INTO PRODUCTION

Wh What t to opti timi mize for when bui buildi ding ng ML ML systems?

Accu Accuracy? cy?

Implemen Implementatio ion c n cost?

Ru Runti time co cost?

In Inter erpr pretabilit ability?

LOCAL INTERPRETABLE MODEL- AGNOSTIC EXPLANATIONS (LIME

Au Autom

tion

vs vs aug augmen mentatio ion? n?

Wh What t to opti timi mize for when bui buildi ding ng ML ML systems?

1. Identify and define the problem 2. Collect and understand the data 3. Build and deploy a simple model that works end-to-end 4. Iterate to optimize an deploy improved models (inner loop) 5. Monitor and back-propagate changes to problem parameters (outer loop)

A A 5-st step ML meta-alg algorit ithm hm

problem

automated deployment, testing, logging, monitoring, feedback

Getting cl closer...

Mach chine learning infrastruct cture

Co Continuous integration

3 r 3 reaso eason f n for using using c container ainers in s in mac machine lear hine learning ning...

1 P 1 Pac ackag aging ing

2 Inher 2 Inherit itanc ance

Immut Immutabilit ability

Us Use gr great t framewor

ces... ...

Op Open sou

ce ML framewor

RISELab Clipper

Ho Hosted M ed ML ser servic ices es

Mach chine learning infrastruct cture

Wr Wrap up...

Real-World Machine Learning

Qu Questi tions? s?