an open source streaming machine learning and real time
play

An Open-Source Streaming Machine Learning and Real-Time Analytics - PowerPoint PPT Presentation

An Open-Source Streaming Machine Learning and Real-Time Analytics Architecture Using an IoT example (incubating) (incubating) Fred Melo William Markito @fredmelo_br @william_markito 1 Traditional Data Analytics - Limitations Store


  1. An Open-Source Streaming Machine Learning and Real-Time Analytics Architecture Using an IoT example (incubating) (incubating) Fred Melo William Markito @fredmelo_br @william_markito 1

  2. Traditional Data Analytics - Limitations Store Analyti cs Data Lake HDFS No real-time Hard to change information Labor intensive ETL based Inefficient Data-source specific 2

  3. Stream-based, Real-Time Closed-Loop Analytics In-Memory Real- Data Stream Pipeline Time Data Expert System / Data Lake HDFS Machine Learning Continuous Learning Multiple Data Continuous Sources Improvement Real-Time Processing Continuous Adapting Store Everything 3

  4. A Streaming Machine Learning for IoT Example Predictive Maintenance Scenario Evaluates LIVE DATA “ According to historical Real-Time trends, there’s an 80% chance this equipment would fail in the next 12 Sensor Data hours" Live data becomes historical over time Smart System Learns with HISTORICAL TRENDS Historical " How were the temperature and vibration sensors reading when the latest failures happened? " 4

  5. Streaming Machine Learning Info Look at past trends Machine Learning (for similar input) Evaluate current input Analysis Score / Predict 5

  6. Streaming Machine Learning Info Filter Machine Learning Analysis [ json ] 6

  7. Streaming Machine Learning Info Filter Enrich Machine Learning Analysis 7

  8. Streaming Machine Learning Info Filter Enrich Transform Machine Learning Analysis 8

  9. Streaming Machine Learning Info ML Model Filter Enrich Transform Analysis 9

  10. Streaming Machine Learning Info ML Model Filter Enrich Transform Analysis Transform 10

  11. Streaming Machine Learning ML Model In-Memory Data Grid Update Push Front-end 11

  12. Streaming Machine Learning Supervised Learning Example Neural Network Real-time In-Memory Data Grid scoring Train 12

  13. A Streaming Machine Learning Reference Architecture Other Sources and Destinations Distributed Computing JMS Fast Data Sink Ingest Transform SpringXD Store / Analyze Predict / Machine Learning 13

  14. Indoors Localization - Applied Example 14

  15. Trilateration and its limitations Noisy Data Physical Barriers Large Overlap Areas Moving Targets Innacuracy Large Overlap Areas 15

  16. Particle Filters - Calculating the optimum solution 16

  17. Particle Filters - Calculating the optimum solution 17

  18. The Solution 1. Capture signal strength 2. Calculate distance from antenna 3. Trilaterate different sensors to predict location in real-time 4. Show on a map with live updates 18

  19. Architecture Overview Calculate Device Predict Distance Location Groovy Spring Boot + Distance JSON HTTP Transform Sink Ingest SpringXD GUI Application Platform 19

  20. Geode Basic Concepts • Cache • Configurable through XML, ,Java • Region • Distributed j.u.Map on steroids • Highly available, redundant • Member • Locator, Server, Client • Callbacks • Listener, Writer, AsyncEventListener, Parallel/Serial 20

  21. Introduction to SpringXD Runs as a distributed application or as a single node 21

  22. Spring XD A stream is composed from modules . Each module is deployed to a container and its channels are bound to the transport . 22

  23. Demo

  24. Why have we selected those projects In-memory & Persistent • Iterative & Exploratory Productivity • • Highly Consistent model • Built-in connectors • Extreme transaction Web based REPL • • Cloud Agnostic • processing Multiple Interpreters • Highly Scalable • Thousands of concurrent • Apache Geode • Easy to setup • clients Apache Spark • Streams without coding • Markdown Reliable event model • • Flink • Python… • 24

  25. Source code and detailed instructions available at: https://github.com/Pivotal-Open-Source-Hub/WifiAnalyticsIoT Follow us on GitHub! Fred Melo William Markito @fredmelo_br @william_markito 25 25

  26. Implementing a Highly Scalable In-Memory Stock Prediction System with Apache Geode (incubating), R and Spring XD Room: Tohotom - 14:30, Sep 30 
 Fred Melo, Pivotal, William Markito, Pivotal Fred Melo William Markito @fredmelo_br @william_markito 26 26

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend