RSE 2.0 RSE 2.0 Mark Woodbridge, Imperial College London deRSE19 - - PowerPoint PPT Presentation

rse 2 0 rse 2 0
SMART_READER_LITE
LIVE PREVIEW

RSE 2.0 RSE 2.0 Mark Woodbridge, Imperial College London deRSE19 - - PowerPoint PPT Presentation

RSE 2.0 RSE 2.0 Mark Woodbridge, Imperial College London deRSE19 Potsdam 6 June 2019 INTRODUCTION INTRODUCTION I lead the RSE team at Imperial College London I have previously been a Computer Scientist, soware engineer and


slide-1
SLIDE 1

RSE 2.0 RSE 2.0

Mark Woodbridge, Imperial College London deRSE19 – Potsdam – 6 June 2019

slide-2
SLIDE 2

INTRODUCTION INTRODUCTION

I lead the RSE team at Imperial College London I have previously been a Computer Scientist, soware engineer and bioinformatician I starting working as an RSE ~17 years ago

slide-3
SLIDE 3

PURPOSE OF THIS TALK PURPOSE OF THIS TALK

slide-4
SLIDE 4

PURPOSE OF THIS TALK PURPOSE OF THIS TALK

RSE remains an emerging practice/role/profession

slide-5
SLIDE 5

PURPOSE OF THIS TALK PURPOSE OF THIS TALK

RSE remains an emerging practice/role/profession Much effort (rightly) focused on bringing soware engineering best practices into research

slide-6
SLIDE 6

PURPOSE OF THIS TALK PURPOSE OF THIS TALK

RSE remains an emerging practice/role/profession Much effort (rightly) focused on bringing soware engineering best practices into research Can we now look to the future, identify prevailing trends and prepare accordingly?

slide-7
SLIDE 7

PURPOSE OF THIS TALK PURPOSE OF THIS TALK

RSE remains an emerging practice/role/profession Much effort (rightly) focused on bringing soware engineering best practices into research Can we now look to the future, identify prevailing trends and prepare accordingly? These are subjective, speculative opinions intended (only!) to foster reflection & discussion

slide-8
SLIDE 8

AGENDA AGENDA

Trends Technology development Soware engineering Research practices Wider issues Implications RSE Groups Individual RSEs Researchers, institutions and funders Conclusions

slide-9
SLIDE 9

TRENDS: TECHNOLOGY TRENDS: TECHNOLOGY

slide-10
SLIDE 10

TRENDS: TECHNOLOGY TRENDS: TECHNOLOGY

Disciplines, communities, languages and codes

slide-11
SLIDE 11

TRENDS: TECHNOLOGY TRENDS: TECHNOLOGY

Disciplines, communities, languages and codes Established vs emerging

slide-12
SLIDE 12

TRENDS: TECHNOLOGY TRENDS: TECHNOLOGY

Disciplines, communities, languages and codes Established vs emerging Infrastructure/services, use cases, funding models

slide-13
SLIDE 13

TRENDS: TECHNOLOGY TRENDS: TECHNOLOGY

Disciplines, communities, languages and codes Established vs emerging Infrastructure/services, use cases, funding models Legacy vs novel

slide-14
SLIDE 14

TRENDS: TECHNOLOGY TRENDS: TECHNOLOGY

Disciplines, communities, languages and codes Established vs emerging Infrastructure/services, use cases, funding models Legacy vs novel Pace of change

slide-15
SLIDE 15

TRENDS: TECHNOLOGY TRENDS: TECHNOLOGY

Disciplines, communities, languages and codes Established vs emerging Infrastructure/services, use cases, funding models Legacy vs novel Pace of change Compute capability/accessibility, tools

slide-16
SLIDE 16

PYTHON: ONWARDS AND PYTHON: ONWARDS AND UPWARDS UPWARDS

Python, the fastest-growing major programming language, has risen in the ranks of programming languages in our survey yet again Stack Overflow Developer Survey 2019

slide-17
SLIDE 17

TRENDS: SOFTWARE TRENDS: SOFTWARE ENGINEERING ENGINEERING

slide-18
SLIDE 18

TRENDS: SOFTWARE TRENDS: SOFTWARE ENGINEERING ENGINEERING

Past: Version control

slide-19
SLIDE 19

TRENDS: SOFTWARE TRENDS: SOFTWARE ENGINEERING ENGINEERING

Past: Version control Present: Build scripts, tests, CI

slide-20
SLIDE 20

TRENDS: SOFTWARE TRENDS: SOFTWARE ENGINEERING ENGINEERING

Past: Version control Present: Build scripts, tests, CI Future: Soware quality assurance

slide-21
SLIDE 21

TRENDS: SOFTWARE TRENDS: SOFTWARE ENGINEERING ENGINEERING

Past: Version control Present: Build scripts, tests, CI Future: Automate linting, testing, vuln scanning Soware quality assurance

slide-22
SLIDE 22

TRENDS: SOFTWARE TRENDS: SOFTWARE ENGINEERING ENGINEERING

Past: Version control Present: Build scripts, tests, CI Future: Automate linting, testing, vuln scanning Measure (and track) code quality, test coverage, performance, documentation… Soware quality assurance

slide-23
SLIDE 23

TRENDS: SOFTWARE TRENDS: SOFTWARE ENGINEERING ENGINEERING

Past: Version control Present: Build scripts, tests, CI Future: Automate linting, testing, vuln scanning Measure (and track) code quality, test coverage, performance, documentation… Code quality (type hints, code suggestions…) Soware quality assurance

slide-24
SLIDE 24

TRENDS: SOFTWARE TRENDS: SOFTWARE ENGINEERING ENGINEERING

Past: Version control Present: Build scripts, tests, CI Future: Automate linting, testing, vuln scanning Measure (and track) code quality, test coverage, performance, documentation… Code quality (type hints, code suggestions…) e.g. Facebook: (IDE), (CI) Soware quality assurance Aroma Getafix

slide-25
SLIDE 25

SOFTWARE IN RESEARCH SOFTWARE IN RESEARCH

Johanson and Hasselbring: While the importance of in silico experiments for the scientific discovery process increases, state-of- the-art soware engineering practices are rarely adopted in computational science Soware Engineering for Computational Science: Past, Present, Future

slide-26
SLIDE 26

LEARNING-BASED DEVELOPMENT LEARNING-BASED DEVELOPMENT

Erik Meijer: This new paradigm of soware creation will require a radical rethinking of the ancestral soware engineering and imperative programming practices that have been developed in the second half of the last century. Machine Learning: Alchemy for the Modern Computer Scientist

slide-27
SLIDE 27

DATA-DRIVEN PROGRAMMING DATA-DRIVEN PROGRAMMING

Andrej Karpathy: … our approach is to specify some goal on the behavior of a desirable program, write a rough skeleton of the code that identifies a subset of program space to search, and use the computational resources at our disposal to search this space for a program that works Soware 2.0

slide-28
SLIDE 28

COMPUTATIONAL INTELLIGENCE COMPUTATIONAL INTELLIGENCE

Stephan Wolfram: It’s the pattern of technology today, and it’s going to increasingly be the pattern of technology in the future: we humans define what we want to do—we set up goals—and then technology, as efficiently as possible, tries to do what we want. A World Run with Code

slide-29
SLIDE 29

TRENDS: RESEARCH TRENDS: RESEARCH

slide-30
SLIDE 30

TRENDS: RESEARCH TRENDS: RESEARCH

Data-driven: plan, perform and analyse

slide-31
SLIDE 31

TRENDS: RESEARCH TRENDS: RESEARCH

Data-driven: plan, perform and analyse Daphne Ezer and Kirstie Whitaker: Data science for the scientific life cycle

slide-32
SLIDE 32

TRENDS: RESEARCH TRENDS: RESEARCH

Data-driven: plan, perform and analyse Daphne Ezer and Kirstie Whitaker: Interdisciplinary: common infrastructure, workspace, framework Data science for the scientific life cycle

slide-33
SLIDE 33

TRENDS: RESEARCH TRENDS: RESEARCH

Data-driven: plan, perform and analyse Daphne Ezer and Kirstie Whitaker: Interdisciplinary: common infrastructure, workspace, framework Collaborative: distributed research, data gathering and soware development Data science for the scientific life cycle

slide-34
SLIDE 34

TRENDS: RESEARCH TRENDS: RESEARCH

Data-driven: plan, perform and analyse Daphne Ezer and Kirstie Whitaker: Interdisciplinary: common infrastructure, workspace, framework Collaborative: distributed research, data gathering and soware development Integrity: repeatability and reproducibility Data science for the scientific life cycle

slide-35
SLIDE 35

TRENDS: GENERAL TRENDS: GENERAL

slide-36
SLIDE 36

TRENDS: GENERAL TRENDS: GENERAL

Quantified impact

slide-37
SLIDE 37

TRENDS: GENERAL TRENDS: GENERAL

Quantified impact Skills gap (acquired vs required)

slide-38
SLIDE 38

TRENDS: GENERAL TRENDS: GENERAL

Quantified impact Skills gap (acquired vs required) Expectations of usability/a11y/security/privacy

slide-39
SLIDE 39

TRENDS: GENERAL TRENDS: GENERAL

Quantified impact Skills gap (acquired vs required) Expectations of usability/a11y/security/privacy Growth in industrial research

slide-40
SLIDE 40

TRENDS: GENERAL TRENDS: GENERAL

Quantified impact Skills gap (acquired vs required) Expectations of usability/a11y/security/privacy Growth in industrial research Recognition of role, influence beyond research

slide-41
SLIDE 41

TRENDS: GENERAL TRENDS: GENERAL

Quantified impact Skills gap (acquired vs required) Expectations of usability/a11y/security/privacy Growth in industrial research Recognition of role, influence beyond research Appreciation that diversity can improve outcomes

slide-42
SLIDE 42

IMPLICATIONS: RSE GROUPS (1) IMPLICATIONS: RSE GROUPS (1)

slide-43
SLIDE 43

IMPLICATIONS: RSE GROUPS (1) IMPLICATIONS: RSE GROUPS (1)

Broader services

slide-44
SLIDE 44

IMPLICATIONS: RSE GROUPS (1) IMPLICATIONS: RSE GROUPS (1)

Broader services UCL-RITS : “consultancy service in artificial intelligence (AI) and data science” AI Studio

slide-45
SLIDE 45

IMPLICATIONS: RSE GROUPS (1) IMPLICATIONS: RSE GROUPS (1)

Broader services UCL-RITS : “consultancy service in artificial intelligence (AI) and data science” Infrastructure: CI, GPUs, notebooks, storage AI Studio

slide-46
SLIDE 46

IMPLICATIONS: RSE GROUPS (1) IMPLICATIONS: RSE GROUPS (1)

Broader services UCL-RITS : “consultancy service in artificial intelligence (AI) and data science” Infrastructure: CI, GPUs, notebooks, storage Scalable activities AI Studio

slide-47
SLIDE 47

IMPLICATIONS: RSE GROUPS (1) IMPLICATIONS: RSE GROUPS (1)

Broader services UCL-RITS : “consultancy service in artificial intelligence (AI) and data science” Infrastructure: CI, GPUs, notebooks, storage Scalable activities Less pairing and “product development” AI Studio

slide-48
SLIDE 48

IMPLICATIONS: RSE GROUPS (1) IMPLICATIONS: RSE GROUPS (1)

Broader services UCL-RITS : “consultancy service in artificial intelligence (AI) and data science” Infrastructure: CI, GPUs, notebooks, storage Scalable activities Less pairing and “product development” More resources, exemplars, training, community building, self-service… AI Studio

slide-49
SLIDE 49

IMPLICATIONS: RSE GROUPS (2) IMPLICATIONS: RSE GROUPS (2)

slide-50
SLIDE 50

IMPLICATIONS: RSE GROUPS (2) IMPLICATIONS: RSE GROUPS (2)

Quantify impact/benefits

slide-51
SLIDE 51

IMPLICATIONS: RSE GROUPS (2) IMPLICATIONS: RSE GROUPS (2)

Quantify impact/benefits HPC utilisation, source control adoption, reproducibility, code citations…

slide-52
SLIDE 52

IMPLICATIONS: RSE GROUPS (2) IMPLICATIONS: RSE GROUPS (2)

Quantify impact/benefits HPC utilisation, source control adoption, reproducibility, code citations… Allocate (more) staff time for L&D, prototyping

slide-53
SLIDE 53

IMPLICATIONS: RSE GROUPS (2) IMPLICATIONS: RSE GROUPS (2)

Quantify impact/benefits HPC utilisation, source control adoption, reproducibility, code citations… Allocate (more) staff time for L&D, prototyping (Re)structure groups appropriately

slide-54
SLIDE 54

IMPLICATIONS: RSE GROUPS (2) IMPLICATIONS: RSE GROUPS (2)

Quantify impact/benefits HPC utilisation, source control adoption, reproducibility, code citations… Allocate (more) staff time for L&D, prototyping (Re)structure groups appropriately Daniel Katz et al: Research Soware Development & Management in Universities

slide-55
SLIDE 55

IMPLICATIONS: RSE GROUPS (2) IMPLICATIONS: RSE GROUPS (2)

Quantify impact/benefits HPC utilisation, source control adoption, reproducibility, code citations… Allocate (more) staff time for L&D, prototyping (Re)structure groups appropriately Daniel Katz et al: Produce less code, do more Research Soware Development & Management in Universities code reviews

slide-56
SLIDE 56

SLDC AND TECHNICAL DEBT SLDC AND TECHNICAL DEBT

Eric Lee: However, the code itself is not intrinsically valuable except as tool to accomplish some goal. Meanwhile, code has ongoing costs. You have to understand it, you have to maintain it, you have to adapt it to new goals over time. The more code you have, the larger those ongoing costs will be. Source Code Is A Liability, Not An Asset

slide-57
SLIDE 57

IMPLICATIONS: RSES (1) IMPLICATIONS: RSES (1)

slide-58
SLIDE 58

IMPLICATIONS: RSES (1) IMPLICATIONS: RSES (1)

Be prepared for continuous learning

slide-59
SLIDE 59

IMPLICATIONS: RSES (1) IMPLICATIONS: RSES (1)

Be prepared for continuous learning Consider specialisation

slide-60
SLIDE 60

IMPLICATIONS: RSES (1) IMPLICATIONS: RSES (1)

Be prepared for continuous learning Consider specialisation Role, discipline, domain and/or technology

slide-61
SLIDE 61

IMPLICATIONS: RSES (1) IMPLICATIONS: RSES (1)

Be prepared for continuous learning Consider specialisation Role, discipline, domain and/or technology Seek a mentor

slide-62
SLIDE 62

IMPLICATIONS: RSES (1) IMPLICATIONS: RSES (1)

Be prepared for continuous learning Consider specialisation Role, discipline, domain and/or technology Seek a mentor There are more candidates than ever before!

slide-63
SLIDE 63

IMPLICATIONS: RSES (1) IMPLICATIONS: RSES (1)

Be prepared for continuous learning Consider specialisation Role, discipline, domain and/or technology Seek a mentor There are more candidates than ever before! UKRSE and deRSE can enable this

slide-64
SLIDE 64

IMPLICATIONS: RSES (2) IMPLICATIONS: RSES (2)

slide-65
SLIDE 65

IMPLICATIONS: RSES (2) IMPLICATIONS: RSES (2)

Data science and/or ML will play some role in most projects

slide-66
SLIDE 66

IMPLICATIONS: RSES (2) IMPLICATIONS: RSES (2)

Data science and/or ML will play some role in most projects Kirstie Whitaker at al: The Turing Way - A handbook for reproducible data science

slide-67
SLIDE 67

IMPLICATIONS: RSES (2) IMPLICATIONS: RSES (2)

Data science and/or ML will play some role in most projects Kirstie Whitaker at al: Imperial College London/Coursera: The Turing Way - A handbook for reproducible data science Mathematics for Machine Learning

slide-68
SLIDE 68

IMPLICATIONS: RSES (2) IMPLICATIONS: RSES (2)

Data science and/or ML will play some role in most projects Kirstie Whitaker at al: Imperial College London/Coursera: Microso Research: The Turing Way - A handbook for reproducible data science Mathematics for Machine Learning Soware Engineering for Machine Learning

slide-69
SLIDE 69

IMPLICATIONS: RSES (2) IMPLICATIONS: RSES (2)

Data science and/or ML will play some role in most projects Kirstie Whitaker at al: Imperial College London/Coursera: Microso Research: CPU/GPU/TPU, serverless, cloud The Turing Way - A handbook for reproducible data science Mathematics for Machine Learning Soware Engineering for Machine Learning

slide-70
SLIDE 70

RESEARCH ENGINEERING RESEARCH ENGINEERING

James Hetherington, we have unified our Research Data Scientist and Research Soware Engineer roles to a common JD … it’s all a spectrum. 22 February 2019

slide-71
SLIDE 71

IMPLICATIONS: RSES (3) IMPLICATIONS: RSES (3)

slide-72
SLIDE 72

IMPLICATIONS: RSES (3) IMPLICATIONS: RSES (3)

Notebooks, executable articles/code, UI frameworks

slide-73
SLIDE 73

IMPLICATIONS: RSES (3) IMPLICATIONS: RSES (3)

Notebooks, executable articles/code, UI frameworks Containers (Docker, Singularity?)

slide-74
SLIDE 74

IMPLICATIONS: RSES (3) IMPLICATIONS: RSES (3)

Notebooks, executable articles/code, UI frameworks Containers (Docker, Singularity?) Automated QA, CI

slide-75
SLIDE 75

IMPLICATIONS: RSES (3) IMPLICATIONS: RSES (3)

Notebooks, executable articles/code, UI frameworks Containers (Docker, Singularity?) Automated QA, CI Mozilla Iodide

slide-76
SLIDE 76

IMPLICATIONS: RSES (3) IMPLICATIONS: RSES (3)

Notebooks, executable articles/code, UI frameworks Containers (Docker, Singularity?) Automated QA, CI Mozilla eLife Iodide reproducible documents

slide-77
SLIDE 77

IMPLICATIONS: RSES (3) IMPLICATIONS: RSES (3)

Notebooks, executable articles/code, UI frameworks Containers (Docker, Singularity?) Automated QA, CI Mozilla eLife Diego Alonso Álvarez: GUIs for Python (UKRSE19) Iodide reproducible documents

slide-78
SLIDE 78

WEBASSEMBLY WEBASSEMBLY

Solomon Hykes, If WASM+WASI existed in 2008, we wouldn’t have needed to created Docker. That’s how important it

  • is. Webassembly on the server is the future of

computing. 27 March 2019

slide-79
SLIDE 79

IMPLICATIONS: INSTITUTIONS IMPLICATIONS: INSTITUTIONS

slide-80
SLIDE 80

IMPLICATIONS: INSTITUTIONS IMPLICATIONS: INSTITUTIONS

Foster networks

slide-81
SLIDE 81

IMPLICATIONS: INSTITUTIONS IMPLICATIONS: INSTITUTIONS

Foster networks Jeremy Cohen: Building Research Soware Communities (deRSE19)

slide-82
SLIDE 82

IMPLICATIONS: INSTITUTIONS IMPLICATIONS: INSTITUTIONS

Foster networks Jeremy Cohen: Building Research Soware Communities (deRSE19) Provide career paths (and benefits!)

slide-83
SLIDE 83

IMPLICATIONS: INSTITUTIONS IMPLICATIONS: INSTITUTIONS

Foster networks Jeremy Cohen: Building Research Soware Communities (deRSE19) Provide career paths (and benefits!) James Smithies: King’s Digital Lab Career Development

slide-84
SLIDE 84

IMPLICATIONS: INSTITUTIONS IMPLICATIONS: INSTITUTIONS

Foster networks Jeremy Cohen: Building Research Soware Communities (deRSE19) Provide career paths (and benefits!) James Smithies: Recruitment challenges likely to limit growth King’s Digital Lab Career Development

slide-85
SLIDE 85

IMPLICATIONS: INSTITUTIONS IMPLICATIONS: INSTITUTIONS

Foster networks Jeremy Cohen: Building Research Soware Communities (deRSE19) Provide career paths (and benefits!) James Smithies: Recruitment challenges likely to limit growth Provide training (early-career, knowledge gaps) King’s Digital Lab Career Development

slide-86
SLIDE 86

POLICY CONCLUSION (1) POLICY CONCLUSION (1)

European Commission Open Science Monitor: Universities should also be encouraged to create more research soware groups. Recognising the Importance of Soware in Research – Research Soware Engineers (RSEs), a UK Example

slide-87
SLIDE 87

IMPLICATIONS: FUNDERS IMPLICATIONS: FUNDERS

slide-88
SLIDE 88

IMPLICATIONS: FUNDERS IMPLICATIONS: FUNDERS

Expect RSE involvement (and diversity)

slide-89
SLIDE 89

IMPLICATIONS: FUNDERS IMPLICATIONS: FUNDERS

Expect RSE involvement (and diversity) Demand soware management plans

slide-90
SLIDE 90

IMPLICATIONS: FUNDERS IMPLICATIONS: FUNDERS

Expect RSE involvement (and diversity) Demand soware management plans Acknowledge challenges of sustainability

slide-91
SLIDE 91

IMPLICATIONS: FUNDERS IMPLICATIONS: FUNDERS

Expect RSE involvement (and diversity) Demand soware management plans Acknowledge challenges of sustainability Mandate reproducible results

slide-92
SLIDE 92

IMPLICATIONS: FUNDERS IMPLICATIONS: FUNDERS

Expect RSE involvement (and diversity) Demand soware management plans Acknowledge challenges of sustainability Mandate reproducible results Provide more fellowships, infrastructure…

slide-93
SLIDE 93

IMPLICATIONS: FUNDERS IMPLICATIONS: FUNDERS

Expect RSE involvement (and diversity) Demand soware management plans Acknowledge challenges of sustainability Mandate reproducible results Provide more fellowships, infrastructure… SSI: Aspiring RSE Leaders Workshop 2019

slide-94
SLIDE 94

POLICY CONCLUSION (2) POLICY CONCLUSION (2)

European Commission Open Science Monitor: Funding bodies should include RSEs in the preparation and execution of funding calls Recognising the Importance of Soware in Research – Research Soware Engineers (RSEs), a UK Example

slide-95
SLIDE 95

Developers… …with the lowest job satisfaction include academic researchers, educators, scientists …who work with data … are high earners for their level of experience, while academic researchers and educators are paid less …working in academia and data scientists are looking for work at higher proportions

STACK OVERFLOW DEVELOPER STACK OVERFLOW DEVELOPER SURVEY 2019 SURVEY 2019

slide-96
SLIDE 96

POLICY CONCLUSION (3) POLICY CONCLUSION (3)

European Commission Open Science Monitor: a drastic change in the way researchers are incentivised needs to be implemented Recognising the Importance of Soware in Research – Research Soware Engineers (RSEs), a UK Example

slide-97
SLIDE 97

CONCLUSIONS CONCLUSIONS

Optimistic opinion: we are approaching the end of the beginning for RSE Next: Embrace emerging demands and

  • pportunities to truly accelerate research

Suitably equipped RSEs will play an essential role in digital (i.e. soware- and data-driven) science

slide-98
SLIDE 98

QUESTIONS? QUESTIONS?

(CC BY 4.0) Many thanks to the RSE Team and Jeremy Cohen at Imperial College for their help with preparing this talk m.woodbridge@imperial.ac.uk mwoodbri.github.io/deRSE19/RSE2.0