Building dev tools at the right level of abstraction Ben Davis CTO - - PowerPoint PPT Presentation

building dev tools at the right level of abstraction
SMART_READER_LITE
LIVE PREVIEW

Building dev tools at the right level of abstraction Ben Davis CTO - - PowerPoint PPT Presentation

Building dev tools at the right level of abstraction Ben Davis CTO @BenCDavis ben@gatherdata.co The data engineering industry is very fragmented. Gather is a data integration tool for developers . It makes it really easy to build integration


slide-1
SLIDE 1

Building dev tools at the right level of abstraction

Ben Davis CTO

@BenCDavis · ben@gatherdata.co

slide-2
SLIDE 2

The data engineering industry is very fragmented.

slide-3
SLIDE 3

Gather is a data integration tool for developers. It makes it really easy to build integration pipelines that push and pull data from various SaaS APIs.

slide-4
SLIDE 4

Gather is a data integration tool for developers. It makes it really easy to build integration pipelines that push and pull data from various SaaS APIs.

Your App

slide-5
SLIDE 5

But how did we get here?

slide-6
SLIDE 6

Initial motivating problem: building data pipelines is

  • hard. We should fix it.
slide-7
SLIDE 7

Data pipelining is conceptual. It breaks down into many use-cases.

slide-8
SLIDE 8

Data pipelining is conceptual. It breaks down into many use-cases.

Batch Streaming ETL …

slide-9
SLIDE 9

People will ask “Can I use it for this?” or “Oh I can you like this right?"

slide-10
SLIDE 10

No framework to answer those questions

slide-11
SLIDE 11

–K.K Aggarwal

“Abstraction is amplification of the essential and elimination of the irrelevant.”

slide-12
SLIDE 12

Building companies and products requires choosing a set of abstractions

slide-13
SLIDE 13

The question is what use-cases are you abstracting away in your product? How many are there?

slide-14
SLIDE 14

Inspired by Cheng Lou (Facebook)

slide-15
SLIDE 15

EC2

Inspired by Cheng Lou (Facebook)

slide-16
SLIDE 16

EC2 HEROKU

Inspired by Cheng Lou (Facebook)

slide-17
SLIDE 17

Concrete use-case

Abstraction level

Multiple use-cases No man's land No man's land

slide-18
SLIDE 18

DATA PIPELINES DATA COLLECTION AND INTEGRATION BATCH PROCESSING SAAS & API INTEGRATION CUSTOMER SERVICE SYNC PAYMENT DATA ETC RECSYS REPORTS DATA PREP CLICKSTREAM IOT SENSORS

slide-19
SLIDE 19
  • No deployment from user
  • Not writing api adapters and glue code
  • Off the self connectors
  • Pre-built authentication
  • Not writing tests and worrying about fragile code

Value prop

slide-20
SLIDE 20

The product should abstract away the complexities of those specific use-cases while maintaining flexibility and expressibility

slide-21
SLIDE 21

Options for the product

slide-22
SLIDE 22
  • UI for specific use-cases

Options for the product

slide-23
SLIDE 23
  • UI for specific use-cases
  • Python SDK

Options for the product

slide-24
SLIDE 24
  • UI for specific use-cases
  • Python SDK
  • Kubernetes-like declarative data flow?

Options for the product

slide-25
SLIDE 25

Kubernetes is the right inspiration because it operates at the same level of abstraction

slide-26
SLIDE 26
slide-27
SLIDE 27
slide-28
SLIDE 28
  • Starting at too higher level of abstraction
  • Building the tree is hard
  • Building a product that is misaligned with where you’ve position yourself
  • n that tree

Conclusion

slide-29
SLIDE 29

ben@gatherdata.co

TALK TO ME. PLEASE

THANKS FOR LISTENING