data at rest data in motion
play

Data At Rest Data In Motion! A Lambda Architecture Overview When - PowerPoint PPT Presentation

Data At Rest Data In Motion! A Lambda Architecture Overview When Things Go Wrong http://xkcd.com/327/ Fault T olerance !!!! Fault T olerance Developer Software Hardware Data Collection Three T ypes Of Data Streams Structured


  1. Data At Rest … Data In Motion! A Lambda Architecture Overview

  2. When Things Go Wrong http://xkcd.com/327/ Fault T olerance !!!!

  3. Fault T olerance Developer Software Hardware

  4. Data Collection Three T ypes Of Data Streams Structured (Databases ...) Semi Structured (JSON, XML, XAML ...) UnStructured (Blogs, E-Mails, Log Files ...)

  5. Lambda Architecture T o The Rescue !!

  6. Lambda Architecture — Requirements Fault-tolerant against both hardware failures and human errors Support variety of use cases that include low latency querying as well as updates Linear scale-out capabilities Extensible, so that the system is manageable and can accommodate newer features easily

  7. Lambda Architecture IMMUTABLE PRECOMPUTE MASTER DATA VIEWS BATCH RECOMPUTE View 1 View 2 View N NEW DATA QUERY MERGE STREAM View 1 View 2 View N PROCESS INCREMENT STREAM VIEWS REAL-TIME INCREMENT

  8. Lambda Architecture IMMUTABLE PRECOMPUTE BATCH LAYER MASTER DATA VIEWS BATCH RECOMPUTE SERVING LAYER View 1 View 2 View N NEW DATA QUERY MERGE STREAM View 1 View 2 View N PROCESS INCREMENT SPEED LAYER STREAM VIEWS REAL-TIME INCREMENT

  9. Lambda Architecture - Layers Batch Layer Managing the master data set, an immutable, append only set of raw data. Pre computing arbitrary query functions, called batch views.

  10. Lambda Architecture - Layers Serving Layer Indexes batch views so that they can be queried in ad hoc with low latency. Merges and reconciles batch and real time views.

  11. Lambda Architecture - Layers Speed Layer Accommodates all requests that are subject to low latency requirements. Using fast and incremental algorithms, deals with recent data only.

  12. Lambda Architecture - Reconciliation Data absorbed into Batch Views Not yet absorbed Time

  13. Lambda Architecture - Reconciliation Now Just a few hours of data Data absorbed into Batch Views Not yet absorbed Time

  14. Lambda Architecture - Immutable Data + Views Times Airpor Flight Action tamp t 2015- DUB EL123 take- 01- off 01T10: 00:0 2015- HEL SA45 take- 01- off 01T10: 05:0 2015- AMS BA99 take- 01- off 01T10: Immutable Master Dataset 07:0 2015- LHR LH17 landin 01- g

  15. Lambda Architecture - Immutable Data + Views Timestamp Airport Flight Action air borne: 2307 Map Reduce 2015-01- DUB EL123 take-off 01T10:00:0 air borne per airline: Map Reduce 2015-01- HEL SA45 take-off 01T10:05:0 Airline 2015-01- AMS BA99 take-off Map Reduce airport load: 01T10:07:0 Airport Planes 2015-01- LHR LH17 landing 01T10:09:0 2015-01- CDG AF03 landing SAS 01T10:10:0 Immutable Master Dataset 2015-01- FCO AZ501 take-off AMS 44 01T10:11:0 BA LHR 69

  16. Lambda Architecture - Implementation IMMUTABLE PRECOMPUTE MASTER DATA VIEWS BATCH RECOMPUTE View 1 View 2 View N NEW DATA QUERY MERGE STREAM View 1 View 2 View N PROCESS INCREMENT STREAM VIEWS REAL-TIME INCREMENT

  17. Lambda Architecture - Implementation Hadoop HDFS Apache Hive BATCH RECOMPUTE HBase HBase HBase NEW DATA ● Spark SQL ● R MERGE STREAM ● Presto Storm Storm Storm Bolt Bolt Bolt Apache Kafka Apache Spark REAL-TIME INCREMENT

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend