from batch to streaming to both
play

From batch to streaming to both Herman Schaaf, Senior Software - PowerPoint PPT Presentation

From batch to streaming to both Herman Schaaf, Senior Software Engineer A Story About me Herman Schaaf, Senior Software Engineer Data Platform Tribe The Cube From batch to streaming The Single Unified Log The Single Unified Log


  1. From batch to streaming to both Herman Schaaf, Senior Software Engineer

  2. A Story

  3. About me Herman Schaaf, Senior Software Engineer Data Platform Tribe

  4. “The Cube”

  5. From batch to streaming

  6. The Single Unified Log

  7. The Single Unified Log

  8. “Organizations which design data platforms are Lesson 1: constrained to produce Conway’s Law is true for designs which are copies of data platforms their communication structures”

  9. Being self-serve is good …but then metadata is critical

  10. So let’s talk about metadata

  11. prod.identity-service.AuditLog.identity.AuditMessage prod.flyingcircus.applog.applog.Message prod.raccoon_bandit.experiment.bandit.Metric A simple convention

  12. Descriptive Structural Administrative we had some of this Some, from using nope. protobuf schemas

  13. • Especially relationships • Ideally automated • Ideally from the start Lesson 2: • Tools like Schema Registry are a Metadata is Critical start, but not the full solution

  14. Lesson 3: Data Engineers Control the Plot Line

  15. business events

  16. From streaming to both

  17. • Streams have to choose between replays and accepting errors as permanent • Batch processing can be done again Lesson 4: any time Repeatability is important • Going straight to the archive in small batches gets the benefits of both.

  18. • Conway’s Law is true for data platforms • Metadata is Critical • Data Engineers Control the Plot Line Key Takeaways • Repeatability is important

  19. Thanks Contact If you have any questions regarding Skyscanner please contact: Herman Schaaf herman.schaaf@skyscanner.net Herman Schaaf @ironzeb

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend