1 https://trallard.github.io/Talks/RSE-shefeld The state of machine - - PowerPoint PPT Presentation

1
SMART_READER_LITE
LIVE PREVIEW

1 https://trallard.github.io/Talks/RSE-shefeld The state of machine - - PowerPoint PPT Presentation

1 https://trallard.github.io/Talks/RSE-shefeld The state of machine learning The state of machine learning RSE seminar, University of Shefeld Tania Allard, PhD 2 . 1 Tania Allard Tania Allard Developer advocate Research Software


slide-1
SLIDE 1

1

slide-2
SLIDE 2

The state of machine learning The state of machine learning

RSE seminar, University of Shefeld Tania Allard, PhD

https://trallard.github.io/Talks/RSE-shefeld

2 . 1

slide-3
SLIDE 3

Tania Allard Tania Allard

Developer advocate Research Software Engineer Data expert  trallard  ixek

2 . 2

slide-4
SLIDE 4

Machine learning Machine learning everywhere everywhere

 ixek

3

slide-5
SLIDE 5

Machine learning Machine learning everywhere everywhere

So much that it is starting to not make sense anymore... like when you say a word 50 times in a row

 ixek

3

slide-6
SLIDE 6

For good or for bad it is everywhere:

 ixek

4

slide-7
SLIDE 7

For good or for bad it is everywhere:  Deployed in healthcare and warfare

 ixek

4

slide-8
SLIDE 8

For good or for bad it is everywhere:   Deployed in healthcare and warfare In the creative industry (from music to books)

 ixek

4

slide-9
SLIDE 9

For good or for bad it is everywhere:    Deployed in healthcare and warfare In the creative industry (from music to books) Reading CVs and judging your creditworthiness

 ixek

4

slide-10
SLIDE 10

For good or for bad it is everywhere:     Deployed in healthcare and warfare In the creative industry (from music to books) Reading CVs and judging your creditworthiness Making us more Instagram worthy

 ixek

4

slide-11
SLIDE 11

The big players:  Apple  Facebook  Google IBM Intel  Microsoft Nvidia Open AI  Twitter

 ixek

5

slide-12
SLIDE 12

Machine learning generalised in two workflows Machine learning generalised in two workflows

Model development (R&D) Model serving (production for customers consumption)

 ixek

6

slide-13
SLIDE 13

 ixek

7

slide-14
SLIDE 14

What are these giants' issues? What are these giants' issues?

 ixek

8

slide-15
SLIDE 15

What are these giants' issues? What are these giants' issues?

Mainly scale...in multiple areas

 ixek

8

slide-16
SLIDE 16

If we have a small team we have a smaller number of issues... right?

 ixek

9

slide-17
SLIDE 17

If we have a small team we have a smaller number of issues... right?  Small number of models to maintain

 ixek

9

slide-18
SLIDE 18

If we have a small team we have a smaller number of issues... right?   Small number of models to maintain People have the knowledge in their heads

 ixek

9

slide-19
SLIDE 19

If we have a small team we have a smaller number of issues... right?    Small number of models to maintain People have the knowledge in their heads They have their own methods to track progress

 ixek

9

slide-20
SLIDE 20

That is the small team performance fallacy That is the small team performance fallacy

We still need processes and best practices in place... so let me get back at this later

 ixek

10

slide-21
SLIDE 21

As the team As the team demand demand grows the problems grow grows the problems grow

    Increased complexity of data ow Larger number of workows Managing complexity of ows and scheduling becomes a nightmare Resource allocation has to be on point

 ixek

11

slide-22
SLIDE 22

Serving models becomes harder Serving models becomes harder

 ixek

12

slide-23
SLIDE 23

 ixek

slide-24
SLIDE 24

13

slide-25
SLIDE 25

How do they serve How do they serve millions of millions of

 ixek

slide-26
SLIDE 26

customers across customers across the globe? the globe?

14

slide-27
SLIDE 27

Three main players:    Infrastructure / resources Processes People

 ixek

15

slide-28
SLIDE 28

 ixek

slide-29
SLIDE 29

16

slide-30
SLIDE 30

 ixek

17

slide-31
SLIDE 31

Infrastructure as a code Infrastructure as a code

 ixek

18

slide-32
SLIDE 32

 ixek

19

slide-33
SLIDE 33

Everything as a code Everything as a code

Version control Less ambiguity on the congurations Shorter turnarounds Deterministic environments

 ixek

20

slide-34
SLIDE 34

Processes Processes

 ixek

21

slide-35
SLIDE 35

 ixek

slide-36
SLIDE 36

22

slide-37
SLIDE 37

Data and code as first class citizens Data and code as first class citizens

 ixek

slide-38
SLIDE 38

23

slide-39
SLIDE 39

 ixek

slide-40
SLIDE 40

24

slide-41
SLIDE 41

People People

Data scientist Data engineer ML Engineer

 ixek

25

slide-42
SLIDE 42

What does academia have to What does academia have to

  • ffer?
  • ffer?

 Much more than you think

 ixek

26

slide-43
SLIDE 43

People People

Researchers Research software engineers Librarians

 ixek

27

slide-44
SLIDE 44

Resources and Infrastructure Resources and Infrastructure

We still need to gure this out... it is pretty much an ad-hoc case

 ixek

28

slide-45
SLIDE 45

Processes Processes

Scientic rigour Peer review Data management

 ixek

29

slide-46
SLIDE 46

Which areas could benefit from academic Which areas could benefit from academic collaborations? collaborations?

 ixek

30

slide-47
SLIDE 47

Meta-learning Meta-learning Humans learn across tasks (learn from experience)

 ixek

slide-48
SLIDE 48

31

slide-49
SLIDE 49

If prior tasks are similar then we can carry prior knowledge

 ixek

32

slide-50
SLIDE 50

AlphaGo uses some sort of meta-learning

 ixek

33

slide-51
SLIDE 51

Algorithmic fairness Algorithmic fairness

It has become increasingly important to ensure that models are making justied calls that are free from unintended bias.

 ixek

34

slide-52
SLIDE 52

Algorithmic fairness Algorithmic fairness

It has become increasingly important to ensure that models are making justied calls that are free from unintended bias. The one way to make progress is through interdisciplinary collaboration

 ixek

34

slide-53
SLIDE 53

Towards model explainability Towards model explainability

Address the trade-off between performance and interpretability

 ixek

35

slide-54
SLIDE 54

Reinforcement learning deadly triad Reinforcement learning deadly triad

Following nature's paradigms RL agents receive awards and then learn to maximise success by performing optimal actions.

 ixek

36

slide-55
SLIDE 55

How to keep an algorithm learning if there are far too many potential variables or outcomes to be evaluated without being fed ridiculous amounts of data.

 ixek

37

slide-56
SLIDE 56

In brief In brief

Focus on the 3 pillars:    People Infrastructure Processes

 ixek

38

slide-57
SLIDE 57

Thank you Thank you

 ixek  tania.allard@microsoft.com

39