Building Data Engineering Teams Wouter de Bie Engineering Director - - PowerPoint PPT Presentation

building data engineering teams
SMART_READER_LITE
LIVE PREVIEW

Building Data Engineering Teams Wouter de Bie Engineering Director - - PowerPoint PPT Presentation

Building Data Engineering Teams Wouter de Bie Engineering Director - Data Engineering Hi! So Wouter ... what are you going to talk about in the next 40 minutes? I just got coffee and breakfast and Im dying to listen to your talk! Tell


slide-1
SLIDE 1

Building Data Engineering Teams

Wouter de Bie Engineering Director - Data Engineering

slide-2
SLIDE 2

... what are you going to talk about in the next 40 minutes? I just got coffee and breakfast and I’m dying to listen to your talk! Tell me more!

Hi! So Wouter…

slide-3
SLIDE 3

– Data Engineering – Common patterns – Horizontal vs vertical – High performing teams

Let’s talk about how I’ve built data engineering teams!

slide-4
SLIDE 4

… but before you start, who are you? Why are *you* talking to us?

Alright Wouter, nice to meet you…

slide-5
SLIDE 5

Who am I?

slide-6
SLIDE 6

Wouter de Bie

  • Engineering Director at Datadog
  • Formerly Engineering Director at Spotify
  • Dutch, living in Brooklyn NY until next week
slide-7
SLIDE 7

… we’re here to talk Data and not about where you’re moving. But I can’t help asking the question.. Where to?

OK Wouter...

slide-8
SLIDE 8

New Orleans, Louisiana!

It’s the internet age! Datadog let’s me work remote! Mardi Gras!

slide-9
SLIDE 9

… let’s get to business. You’ve been at a few companies that have gone through hyper-growth. What did that mean ?

Ok, ok Wouter...

slide-10
SLIDE 10

– Spotify: 250 people when I started, 5500 when I left. – Datadog: 100% growth year

  • ver year.

Hyper-growth numbers!

slide-11
SLIDE 11

… apart from employee growth, did you have any scalability challenges?

But Wouter…

slide-12
SLIDE 12

Scale is a challenge!

slide-13
SLIDE 13

Datadog

  • Observability as a service.
  • Metrics, Traces and Logs.
  • Multiple clouds.
  • Billions of points a day.

Spotify

  • Music streaming service
  • Data about playback
  • 100’s of millions of plays a day
  • 100’s of millions of users

What scale?

slide-14
SLIDE 14

Data Engineering

slide-15
SLIDE 15

… you did a bunch of Data Engineering. Tell me about that. What did you do?

So Wouter...

slide-16
SLIDE 16

Data Engineering definition

Techtarget.com: “A data engineer is a worker whose primary job responsibilities involve preparing data for analytical or operational uses.” O’Reilly: “Build and maintain the organization’s data pipeline systems… Clean and wrangle data into a usable state” Datacamp.com: “Ensure clean, reliable, and performative access to data and databases.”

slide-17
SLIDE 17

Data engineering takes care of storing and processing large volumes of data, coming from different sources and makes it available for downstream applications (like ML, Analytics, Product, etc)

Wouter’s definition

slide-18
SLIDE 18

… these Data Engineers process Data and such, but who runs the systems that they use?

But Wouter...

slide-19
SLIDE 19

Data Engineering != Data Infrastructure Engineering

slide-20
SLIDE 20

Data Infrastructure Engineering build, maintains and operates systems that enable Data Engineers to do their work.

Wouter’s definition

slide-21
SLIDE 21

Data Engineering

  • Write Spark/MR/Beam/etc

pipelines

  • Understand concepts like sorting,

grouping, filtering, etc

  • Understand the domain
  • Understand scalability of the data

Data Infrastructure Engineering

  • Operate data processing systems

like YARN/Mesos/HDFS

  • Write tools that make life easier

for Data Engineers

  • Understand scalability of systems

Skills required

slide-22
SLIDE 22

Of course there is

  • verlap
slide-23
SLIDE 23

… you’ve been at a few companies? Any common patterns you have observed?

So Wouter…

slide-24
SLIDE 24

Common patterns

slide-25
SLIDE 25

– There is some interesting data somewhere – Small team uses the data to do multiple things with it – Small team operates infrastructure besides data engineering and analytics

Big Data Starts Small

slide-26
SLIDE 26

– Small team becomes a bottleneck – Team splits up into Analytics, Data Engineering and Data Infrastructure

Big Data Becomes Valuable

slide-27
SLIDE 27

Data Engineering becomes a competence rather than a team, since every domain needs Data Engineers

Big Data Becomes Part of Day-to-Day

slide-28
SLIDE 28

Single team focussed on Analytics, Data Engineering and Infrastructure Single team split up into Data Engineering, Data Infra Currently:

  • Data Infrastructure team
  • Data Engineering for

Historical Metrics team

  • Internal Analytics
  • Revenue Engineering
  • Data Science

Datadog’s Evolution

slide-29
SLIDE 29

Single team focussed on Analytics, Data Engineering and Infrastructure Single team split up into Analytics and Data Infrastructure. Currently:

  • Data Infrastructure org
  • Data Engineering is a

competence in many teams

  • Analytics is a competence

in many teams

  • Data Science is a

competence in many teams

Spotify’s Evolution

slide-30
SLIDE 30

… when Data Engineering becomes a competence that can be found in many teams, how do we ensure knowledge is shared?

But Wouter...

slide-31
SLIDE 31

Data Eng Guild Organizational structure that involves all data engineers. Focussed on knowledge sharing, but sometimes work on things together. Data University Curriculum focussed on teaching Data Engineering skills. Used for

  • n-boarding or training engineers

in the art of Data Engineering.

Two things I’ve done

slide-32
SLIDE 32

… so now I have a bunch of teams. How did you decide how to split them up? Any tips?

Right Wouter…

slide-33
SLIDE 33

Horizontal vs Vertical teams

slide-34
SLIDE 34

Vertical teams Narrow focus on one group of constituents or a single domain. E.g. Historical Metrics, Revenue Engineering, Recommendations. Horizontal teams Wide focus on providing data to many other (downstream) teams. (Infrastructure is horizontal by definition)

Horizontal vs vertical

slide-35
SLIDE 35

Horizontal / Vertical example

Data Infrastructure Foundational Data Analytics Revenue Metrics ML

slide-36
SLIDE 36

… but now I have all these teams building the same thing over and over again. Any thoughts?

I see Wouter...

slide-37
SLIDE 37

The role of infrastructure

Frontend Backend Data Analytics Data-Sci Client Infrastructure Backend Infrastructure Backend Infrastructure DS Infrastructure

slide-38
SLIDE 38

… so I’ve grown my org, split my teams, hired the right folks. Now how do I make sure they perform?

Allright, alright Wouter...

slide-39
SLIDE 39

High performing teams

slide-40
SLIDE 40

Autonomy Our desire to be self directed. It increases engagement over compliance. Mastery The urge to get better skills Purpose The desire to do something that has meaning and is important.

Autonomy, Mastery, Purpose

slide-41
SLIDE 41

… that makes sense. I see you read “Drive” by Daniel H. Pink. But how do I make sure my teams value Autonomy, Mastery and Purpose?

Yes, yes Wouter...

slide-42
SLIDE 42

– Specialized interview process for both Data Eng and Infra – Focus on skills and culture – Enthusiasm about the product is really important

Hiring is key!

slide-43
SLIDE 43

… I learned something and I’m feeling the

  • coffee. Can we wrap this up?

Alright man...

slide-44
SLIDE 44

Let’s wrap up!

slide-45
SLIDE 45

– Data Engineering != Data Infra – Teams and orgs evolve – Horizontal vs Vertical thinking – Build solid Infrastructure – Hiring is key!

Takeaways

slide-46
SLIDE 46

Lastly, a shameless plug

We’re hiring! Come see us at our booth!

slide-47
SLIDE 47

Thank you!

Come see me at office hours!

slide-48
SLIDE 48
slide-49
SLIDE 49
slide-50
SLIDE 50
slide-51
SLIDE 51
slide-52
SLIDE 52

– Supporting text can go here if needed. – Avoid cramming slides full of text. – Consider breaking dense content across several slides.

Large text can be used to make a single important point.

slide-53
SLIDE 53

Standard slide

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Donec aliquam odio iaculis eros fringilla ultricies. Donec placerat ac lectus nec pellentesque.

slide-54
SLIDE 54

Cras mauris purus, varius in justo, varius vulputate eros. Cras mauris purus, varius in justo in, varius vulputate eros. Cras mauris purus, varius in justo, varius vulputate eros. Cras mauris purus, varius in justo in, varius vulputate eros.

Slide with two points

slide-55
SLIDE 55

This row should be visually centered on the page. Adjust vertical position of the entire row together depending on height of text. This row should be visually centered on the page. Adjust vertical position of the entire row together depending on height of text. This row should be visually centered on the page. Adjust vertical position of the entire row together depending on height of text.

Slide with three points

slide-56
SLIDE 56

Identify and convince great people to join Great = able + driven + grown-up Hiring is what makes/breaks us Expect to spend 10-20% of your time on hiring

Slide with four points

slide-57
SLIDE 57

Big Statement

slide-58
SLIDE 58

Geometric top-line growth

1 2 3

Strong product-market fit Financially efficient go-to-market

slide-59
SLIDE 59

Cras mauris purus, varius in justo in, varius vulputate eros.

Secondary text, if needed

Curabitur pharetra dictum mattis curabitur dictum.

Secondary text, if needed

Proin iaculis nulla turpis, a fermentum

  • rci mattis in.

Secondary text, if needed

Use iconography to support content

slide-60
SLIDE 60

Metrics Traces Logs

The Pillars of Observability

slide-61
SLIDE 61

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Donec aliquam odio iaculis eros fringilla ultricies. Donec placerat ac lectus nec pellentesque.

White background slide

slide-62
SLIDE 62

Text+image slide

– Supporting text can go here if needed. – Avoid cramming slides full of text. – Consider breaking dense content across several slides.

Image placeholder Images are to be anchored to the right side of the slide and centered vertically.

slide-63
SLIDE 63

Full-screen images should fill the page. If a border is needed, use matching section

  • color. See next slide

for example.

slide-64
SLIDE 64
slide-65
SLIDE 65
slide-66
SLIDE 66

Section divider 2

slide-67
SLIDE 67

– Supporting text can go here if needed. – Avoid cramming slides full of text. – Consider breaking dense content across several slides.

Large text can be used to make a single important point.

slide-68
SLIDE 68

Standard slide

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Donec aliquam odio iaculis eros fringilla ultricies. Donec placerat ac lectus nec pellentesque.

slide-69
SLIDE 69

Cras mauris purus, varius in justo, varius vulputate eros. Cras mauris purus, varius in justo in, varius vulputate eros. Cras mauris purus, varius in justo, varius vulputate eros. Cras mauris purus, varius in justo in, varius vulputate eros.

Slide with two points

slide-70
SLIDE 70

This row should be visually centered on the page. Adjust vertical position of the entire row together depending on height of text. This row should be visually centered on the page. Adjust vertical position of the entire row together depending on height of text. This row should be visually centered on the page. Adjust vertical position of the entire row together depending on height of text.

Slide with three points

slide-71
SLIDE 71

Identify and convince great people to join Great = able + driven + grown-up Hiring is what makes/breaks us Expect to spend 10-20% of your time on hiring

Slide with four points

slide-72
SLIDE 72

Big Statement

slide-73
SLIDE 73

Geometric top-line growth

1 2 3

Strong product-market fit Financially efficient go-to-market

slide-74
SLIDE 74

Cras mauris purus, varius in justo in, varius vulputate eros.

Secondary text, if needed

Curabitur pharetra dictum mattis curabitur dictum.

Secondary text, if needed

Proin iaculis nulla turpis, a fermentum

  • rci mattis in.

Secondary text, if needed

Use iconography to support content

slide-75
SLIDE 75

Metrics Traces Logs

The Pillars of Observability

slide-76
SLIDE 76

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Donec aliquam odio iaculis eros fringilla ultricies. Donec placerat ac lectus nec pellentesque.

White background slide

slide-77
SLIDE 77

Text+image slide

– Supporting text can go here if needed. – Avoid cramming slides full of text. – Consider breaking dense content across several slides.

Image placeholder Images are to be anchored to the right side of the slide and centered vertically.

slide-78
SLIDE 78

Full-screen images should fill the page. If a border is needed, use matching section

  • color. See next slide

for example.

slide-79
SLIDE 79
slide-80
SLIDE 80

Section divider 3

slide-81
SLIDE 81

– Supporting text can go here if needed. – Avoid cramming slides full of text. – Consider breaking dense content across several slides.

Large text can be used to make a single important point.

slide-82
SLIDE 82

Standard slide

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Donec aliquam odio iaculis eros fringilla ultricies. Donec placerat ac lectus nec pellentesque.

slide-83
SLIDE 83

Cras mauris purus, varius in justo, varius vulputate eros. Cras mauris purus, varius in justo in, varius vulputate eros. Cras mauris purus, varius in justo, varius vulputate eros. Cras mauris purus, varius in justo in, varius vulputate eros.

Slide with two points

slide-84
SLIDE 84

This row should be visually centered on the page. Adjust vertical position of the entire row together depending on height of text. This row should be visually centered on the page. Adjust vertical position of the entire row together depending on height of text. This row should be visually centered on the page. Adjust vertical position of the entire row together depending on height of text.

Slide with three points

slide-85
SLIDE 85

Identify and convince great people to join Great = able + driven + grown-up Hiring is what makes/breaks us Expect to spend 10-20% of your time on hiring

Slide with four points

slide-86
SLIDE 86

Big Statement

slide-87
SLIDE 87

Geometric top-line growth

1 2 3

Strong product-market fit Financially efficient go-to-market

slide-88
SLIDE 88

Cras mauris purus, varius in justo in, varius vulputate eros.

Secondary text, if needed

Curabitur pharetra dictum mattis curabitur dictum.

Secondary text, if needed

Proin iaculis nulla turpis, a fermentum

  • rci mattis in.

Secondary text, if needed

Use iconography to support content

slide-89
SLIDE 89

Metrics Traces Logs

The Pillars of Observability

slide-90
SLIDE 90

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Donec aliquam odio iaculis eros fringilla ultricies. Donec placerat ac lectus nec pellentesque.

White background slide

slide-91
SLIDE 91

Text+image slide

– Supporting text can go here if needed. – Avoid cramming slides full of text. – Consider breaking dense content across several slides.

Image placeholder Images are to be anchored to the right side of the slide and centered vertically.

slide-92
SLIDE 92

Full-screen images should fill the page. If a border is needed, use matching section

  • color. See next slide

for example.

slide-93
SLIDE 93
slide-94
SLIDE 94

Section divider 4

slide-95
SLIDE 95

– Supporting text can go here if needed. – Avoid cramming slides full of text. – Consider breaking dense content across several slides.

Large text can be used to make a single important point.

slide-96
SLIDE 96

Standard slide

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Donec aliquam odio iaculis eros fringilla ultricies. Donec placerat ac lectus nec pellentesque.

slide-97
SLIDE 97

Cras mauris purus, varius in justo, varius vulputate eros. Cras mauris purus, varius in justo in, varius vulputate eros. Cras mauris purus, varius in justo, varius vulputate eros. Cras mauris purus, varius in justo in, varius vulputate eros.

Slide with two points

slide-98
SLIDE 98

This row should be visually centered on the page. Adjust vertical position of the entire row together depending on height of text. This row should be visually centered on the page. Adjust vertical position of the entire row together depending on height of text. This row should be visually centered on the page. Adjust vertical position of the entire row together depending on height of text.

Slide with three points

slide-99
SLIDE 99

Identify and convince great people to join Great = able + driven + grown-up Hiring is what makes/breaks us Expect to spend 10-20% of your time on hiring

Slide with four points

slide-100
SLIDE 100

Big Statement

slide-101
SLIDE 101

Geometric top-line growth

1 2 3

Strong product-market fit Financially efficient go-to-market

slide-102
SLIDE 102

Cras mauris purus, varius in justo in, varius vulputate eros.

Secondary text, if needed

Curabitur pharetra dictum mattis curabitur dictum.

Secondary text, if needed

Proin iaculis nulla turpis, a fermentum

  • rci mattis in.

Secondary text, if needed

Use iconography to support content

slide-103
SLIDE 103

Metrics Traces Logs

The Pillars of Observability

slide-104
SLIDE 104

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Donec aliquam odio iaculis eros fringilla ultricies. Donec placerat ac lectus nec pellentesque.

White background slide

slide-105
SLIDE 105

Text+image slide

– Supporting text can go here if needed. – Avoid cramming slides full of text. – Consider breaking dense content across several slides.

Image placeholder Images are to be anchored to the right side of the slide and centered vertically.

slide-106
SLIDE 106

Full-screen images should fill the page. If a border is needed, use matching section

  • color. See next slide

for example.

slide-107
SLIDE 107
slide-108
SLIDE 108

Section divider 5

slide-109
SLIDE 109

– Supporting text can go here if needed. – Avoid cramming slides full of text. – Consider breaking dense content across several slides.

Large text can be used to make a single important point.

slide-110
SLIDE 110

Standard slide

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Donec aliquam odio iaculis eros fringilla ultricies. Donec placerat ac lectus nec pellentesque.

slide-111
SLIDE 111

Cras mauris purus, varius in justo, varius vulputate eros. Cras mauris purus, varius in justo in, varius vulputate eros. Cras mauris purus, varius in justo, varius vulputate eros. Cras mauris purus, varius in justo in, varius vulputate eros.

Slide with two points

slide-112
SLIDE 112

This row should be visually centered on the page. Adjust vertical position of the entire row together depending on height of text. This row should be visually centered on the page. Adjust vertical position of the entire row together depending on height of text. This row should be visually centered on the page. Adjust vertical position of the entire row together depending on height of text.

Slide with three points

slide-113
SLIDE 113

Identify and convince great people to join Great = able + driven + grown-up Hiring is what makes/breaks us Expect to spend 10-20% of your time on hiring

Slide with four points

slide-114
SLIDE 114

Big Statement

slide-115
SLIDE 115

Geometric top-line growth

1 2 3

Strong product-market fit Financially efficient go-to-market

slide-116
SLIDE 116

Cras mauris purus, varius in justo in, varius vulputate eros.

Secondary text, if needed

Curabitur pharetra dictum mattis curabitur dictum.

Secondary text, if needed

Proin iaculis nulla turpis, a fermentum

  • rci mattis in.

Secondary text, if needed

Use iconography to support content

slide-117
SLIDE 117

Metrics Traces Logs

The Pillars of Observability

slide-118
SLIDE 118

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Donec aliquam odio iaculis eros fringilla ultricies. Donec placerat ac lectus nec pellentesque.

White background slide

slide-119
SLIDE 119

Text+image slide

– Supporting text can go here if needed. – Avoid cramming slides full of text. – Consider breaking dense content across several slides.

Image placeholder Images are to be anchored to the right side of the slide and centered vertically.

slide-120
SLIDE 120

Full-screen images should fill the page. If a border is needed, use matching section

  • color. See next slide

for example.

slide-121
SLIDE 121
slide-122
SLIDE 122

Section divider 6

slide-123
SLIDE 123

– Supporting text can go here if needed. – Avoid cramming slides full of text. – Consider breaking dense content across several slides.

Large text can be used to make a single important point.

slide-124
SLIDE 124

Standard slide

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Donec aliquam odio iaculis eros fringilla ultricies. Donec placerat ac lectus nec pellentesque.

slide-125
SLIDE 125

Cras mauris purus, varius in justo, varius vulputate eros. Cras mauris purus, varius in justo in, varius vulputate eros. Cras mauris purus, varius in justo, varius vulputate eros. Cras mauris purus, varius in justo in, varius vulputate eros.

Slide with two points

slide-126
SLIDE 126

This row should be visually centered on the page. Adjust vertical position of the entire row together depending on height of text. This row should be visually centered on the page. Adjust vertical position of the entire row together depending on height of text. This row should be visually centered on the page. Adjust vertical position of the entire row together depending on height of text.

Slide with three points

slide-127
SLIDE 127

Identify and convince great people to join Great = able + driven + grown-up Hiring is what makes/breaks us Expect to spend 10-20% of your time on hiring

Slide with four points

slide-128
SLIDE 128

Big Statement

slide-129
SLIDE 129

Geometric top-line growth

1 2 3

Strong product-market fit Financially efficient go-to-market

slide-130
SLIDE 130

Cras mauris purus, varius in justo in, varius vulputate eros.

Secondary text, if needed

Curabitur pharetra dictum mattis curabitur dictum.

Secondary text, if needed

Proin iaculis nulla turpis, a fermentum

  • rci mattis in.

Secondary text, if needed

Use iconography to support content

slide-131
SLIDE 131

Metrics Traces Logs

The Pillars of Observability

slide-132
SLIDE 132

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Donec aliquam odio iaculis eros fringilla ultricies. Donec placerat ac lectus nec pellentesque.

White background slide

slide-133
SLIDE 133

Text+image slide

– Supporting text can go here if needed. – Avoid cramming slides full of text. – Consider breaking dense content across several slides.

Image placeholder Images are to be anchored to the right side of the slide and centered vertically.

slide-134
SLIDE 134

Full-screen images should fill the page. If a border is needed, use matching section

  • color. See next slide

for example.

slide-135
SLIDE 135
slide-136
SLIDE 136

Thank you