1 https://trallard.github.io/Talks/RSE-shefeld The state of machine - - PowerPoint PPT Presentation

▶

Aug 23, 2023 422 likes •1.01k views

1 https://trallard.github.io/Talks/RSE-shefeld The state of machine learning The state of machine learning RSE seminar, University of Shefeld Tania Allard, PhD 2 . 1 Tania Allard Tania Allard Developer advocate Research Software

SLIDE 1

SLIDE 2

The state of machine learning The state of machine learning

RSE seminar, University of Shefeld Tania Allard, PhD

https://trallard.github.io/Talks/RSE-shefeld

2 . 1

SLIDE 3

Tania Allard Tania Allard

Developer advocate Research Software Engineer Data expert  trallard  ixek

2 . 2

SLIDE 4

Machine learning Machine learning everywhere everywhere

 ixek

SLIDE 5

Machine learning Machine learning everywhere everywhere

So much that it is starting to not make sense anymore... like when you say a word 50 times in a row

 ixek

SLIDE 6

For good or for bad it is everywhere:

 ixek

SLIDE 7

For good or for bad it is everywhere:  Deployed in healthcare and warfare

 ixek

SLIDE 8

For good or for bad it is everywhere:   Deployed in healthcare and warfare In the creative industry (from music to books)

 ixek

SLIDE 9

For good or for bad it is everywhere:    Deployed in healthcare and warfare In the creative industry (from music to books) Reading CVs and judging your creditworthiness

 ixek

SLIDE 10

For good or for bad it is everywhere:     Deployed in healthcare and warfare In the creative industry (from music to books) Reading CVs and judging your creditworthiness Making us more Instagram worthy

 ixek

SLIDE 11

The big players:  Apple  Facebook  Google IBM Intel  Microsoft Nvidia Open AI  Twitter

 ixek

SLIDE 12

Machine learning generalised in two workflows Machine learning generalised in two workflows

Model development (R&D) Model serving (production for customers consumption)

 ixek

SLIDE 13

 ixek

SLIDE 14

What are these giants' issues? What are these giants' issues?

 ixek

SLIDE 15

What are these giants' issues? What are these giants' issues?

Mainly scale...in multiple areas

 ixek

SLIDE 16

If we have a small team we have a smaller number of issues... right?

 ixek

SLIDE 17

If we have a small team we have a smaller number of issues... right?  Small number of models to maintain

 ixek

SLIDE 18

If we have a small team we have a smaller number of issues... right?   Small number of models to maintain People have the knowledge in their heads

 ixek

SLIDE 19

If we have a small team we have a smaller number of issues... right?    Small number of models to maintain People have the knowledge in their heads They have their own methods to track progress

 ixek

SLIDE 20

That is the small team performance fallacy That is the small team performance fallacy

We still need processes and best practices in place... so let me get back at this later

 ixek

SLIDE 21

As the team As the team demand demand grows the problems grow grows the problems grow

    Increased complexity of data ow Larger number of workows Managing complexity of ows and scheduling becomes a nightmare Resource allocation has to be on point

 ixek

SLIDE 22

Serving models becomes harder Serving models becomes harder

 ixek

SLIDE 23

 ixek

SLIDE 24

SLIDE 25

How do they serve How do they serve millions of millions of

 ixek

SLIDE 26

customers across customers across the globe? the globe?

SLIDE 27

Three main players:    Infrastructure / resources Processes People

 ixek

SLIDE 28

 ixek

SLIDE 29

SLIDE 30

 ixek

SLIDE 31

Infrastructure as a code Infrastructure as a code

 ixek

SLIDE 32

 ixek

SLIDE 33

Everything as a code Everything as a code

Version control Less ambiguity on the congurations Shorter turnarounds Deterministic environments

 ixek

SLIDE 34

Processes Processes

 ixek

SLIDE 35

 ixek

SLIDE 36

SLIDE 37

Data and code as first class citizens Data and code as first class citizens

 ixek

SLIDE 38

SLIDE 39

 ixek

SLIDE 40

SLIDE 41

People People

Data scientist Data engineer ML Engineer

 ixek

SLIDE 42

What does academia have to What does academia have to

ffer?
ffer?

 Much more than you think

 ixek

SLIDE 43

People People

Researchers Research software engineers Librarians

 ixek

SLIDE 44

Resources and Infrastructure Resources and Infrastructure

We still need to gure this out... it is pretty much an ad-hoc case

 ixek

SLIDE 45

Processes Processes

Scientic rigour Peer review Data management

 ixek

SLIDE 46

Which areas could benefit from academic Which areas could benefit from academic collaborations? collaborations?

 ixek

SLIDE 47

Meta-learning Meta-learning Humans learn across tasks (learn from experience)

 ixek

SLIDE 48

SLIDE 49

If prior tasks are similar then we can carry prior knowledge

 ixek

SLIDE 50

AlphaGo uses some sort of meta-learning

 ixek

SLIDE 51

Algorithmic fairness Algorithmic fairness

It has become increasingly important to ensure that models are making justied calls that are free from unintended bias.

 ixek

SLIDE 52

Algorithmic fairness Algorithmic fairness

It has become increasingly important to ensure that models are making justied calls that are free from unintended bias. The one way to make progress is through interdisciplinary collaboration

 ixek

SLIDE 53

Towards model explainability Towards model explainability

Address the trade-off between performance and interpretability

 ixek

SLIDE 54

Reinforcement learning deadly triad Reinforcement learning deadly triad

Following nature's paradigms RL agents receive awards and then learn to maximise success by performing optimal actions.

 ixek

SLIDE 55

How to keep an algorithm learning if there are far too many potential variables or outcomes to be evaluated without being fed ridiculous amounts of data.

 ixek

SLIDE 56

In brief In brief

Focus on the 3 pillars:    People Infrastructure Processes

 ixek

SLIDE 57

Thank you Thank you

 ixek  tania.allard@microsoft.com