1
play

1 https://trallard.github.io/Talks/RSE-shefeld The state of machine - PowerPoint PPT Presentation

1 https://trallard.github.io/Talks/RSE-shefeld The state of machine learning The state of machine learning RSE seminar, University of Shefeld Tania Allard, PhD 2 . 1 Tania Allard Tania Allard Developer advocate Research Software


  1. 1

  2. https://trallard.github.io/Talks/RSE-shef�eld The state of machine learning The state of machine learning RSE seminar, University of Shef�eld Tania Allard, PhD 2 . 1

  3. Tania Allard Tania Allard Developer advocate Research Software Engineer Data expert  trallard  ixek 2 . 2

  4.  ixek Machine learning Machine learning everywhere everywhere 3

  5.  ixek Machine learning Machine learning everywhere everywhere So much that it is starting to not make sense anymore... like when you say a word 50 times in a row 3

  6.  ixek For good or for bad it is everywhere: 4

  7.  ixek For good or for bad it is everywhere:  Deployed in healthcare and warfare 4

  8.  ixek For good or for bad it is everywhere: Deployed in healthcare and warfare  In the creative industry (from music to books)  4

  9.  ixek For good or for bad it is everywhere: Deployed in healthcare and warfare  In the creative industry (from music to books)  Reading CVs and judging your creditworthiness  4

  10.  ixek For good or for bad it is everywhere: Deployed in healthcare and warfare  In the creative industry (from music to books)  Reading CVs and judging your creditworthiness  Making us more Instagram worthy  4

  11.  ixek The big players:  Apple  Facebook  Google IBM Intel  Microsoft Nvidia Open AI  Twitter 5

  12.  ixek Machine learning generalised in two workflows Machine learning generalised in two workflows Model development (R&D) Model serving (production for customers consumption) 6

  13.  ixek 7

  14.  ixek What are these giants' issues? What are these giants' issues? 8

  15.  ixek What are these giants' issues? What are these giants' issues? Mainly scale...in multiple areas 8

  16.  ixek If we have a small team we have a smaller number of issues... right? 9

  17.  ixek If we have a small team we have a smaller number of issues... right?  Small number of models to maintain 9

  18.  ixek If we have a small team we have a smaller number of issues... right? Small number of models to maintain  People have the knowledge in their heads  9

  19.  ixek If we have a small team we have a smaller number of issues... right? Small number of models to maintain  People have the knowledge in their heads  They have their own methods to track progress  9

  20.  ixek That is the small team performance fallacy That is the small team performance fallacy We still need processes and best practices in place... so let me get back at this later 10

  21.  ixek As the team As the team demand demand grows the problems grow grows the problems grow Increased complexity of data �ow  Larger number of work�ows  Managing complexity of �ows and scheduling becomes a nightmare  Resource allocation has to be on point  11

  22.  ixek Serving models becomes harder Serving models becomes harder 12

  23.  ixek

  24. 13

  25.  ixek How do they serve How do they serve millions of millions of

  26. customers across customers across the globe? the globe? 14

  27.  ixek Three main players: Infrastructure / resources  Processes  People  15

  28.  ixek

  29. 16

  30.  ixek 17

  31.  ixek Infrastructure as a code Infrastructure as a code 18

  32.  ixek 19

  33.  ixek Everything as a code Everything as a code Version control Less ambiguity on the con�gurations Shorter turnarounds Deterministic environments 20

  34.  ixek Processes Processes 21

  35.  ixek

  36. 22

  37.  ixek Data and code as first class citizens Data and code as first class citizens

  38. 23

  39.  ixek

  40. 24

  41.  ixek People People Data scientist Data engineer ML Engineer 25

  42.  ixek What does academia have to What does academia have to offer? offer?  Much more than you think 26

  43.  ixek People People Researchers Research software engineers Librarians 27

  44.  ixek Resources and Infrastructure Resources and Infrastructure We still need to �gure this out... it is pretty much an ad-hoc case 28

  45.  ixek Processes Processes Scienti�c rigour Peer review Data management 29

  46.  ixek Which areas could benefit from academic Which areas could benefit from academic collaborations? collaborations? 30

  47.  ixek Meta-learning Meta-learning Humans learn across tasks (learn from experience)

  48. 31

  49.  ixek If prior tasks are similar then we can carry prior knowledge 32

  50.  ixek AlphaGo uses some sort of meta-learning 33

  51.  ixek Algorithmic fairness Algorithmic fairness It has become increasingly important to ensure that models are making justi�ed calls that are free from unintended bias. 34

  52.  ixek Algorithmic fairness Algorithmic fairness It has become increasingly important to ensure that models are making justi�ed calls that are free from unintended bias. The one way to make progress is through interdisciplinary collaboration 34

  53.  ixek Towards model explainability Towards model explainability Address the trade-off between performance and interpretability 35

  54.  ixek Reinforcement learning deadly triad Reinforcement learning deadly triad Following nature's paradigms RL agents receive awards and then learn to maximise success by performing optimal actions. 36

  55.  ixek How to keep an algorithm learning if there are far too many potential variables or outcomes to be evaluated without being fed ridiculous amounts of data. 37

  56.  ixek In brief In brief Focus on the 3 pillars: People  Infrastructure  Processes  38

  57. Thank you Thank you  ixek  tania.allard@microsoft.com 39

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend