Converge Transactional and Predictive Analytics to Effectively - - PowerPoint PPT Presentation

converge transactional and predictive analytics to
SMART_READER_LITE
LIVE PREVIEW

Converge Transactional and Predictive Analytics to Effectively - - PowerPoint PPT Presentation

Converge Transactional and Predictive Analytics to Effectively Scale IoT 2018 11 billion Is the number of connected devices Not including phones and computers How can we leverage this information for context-sensitive insights that


slide-1
SLIDE 1

Converge Transactional and Predictive Analytics to Effectively Scale IoT

2018

slide-2
SLIDE 2

Is the number of connected devices…

11 billion

How can we leverage this information for context-sensitive insights that answer specific questions about specific things at the right point in time?

Not including phones and computers

slide-3
SLIDE 3

We provide the leading in-memory computing platforms for real-time insight to action and extreme transactional processing. With GigaSpaces, enterprises can operationalize machine learning and transactional processing to gain real-time insights on their data and act upon them in the moment.

About GigaSpaces

Direct customers

300+

Fortune / Organizations

50+ / 500+

Large installations in production (OEM)

5,000+

ISVs

25+

InsightEdge is an in-memory real- time analytics platform for instant insights to action; analyzing data as it's born, enriching it with historical context, for smarter, faster decisions In-Memory Computing Platform for microsecond scale transactional processing, data scalability, and powerful event-driven workflows

slide-4
SLIDE 4

Our Customers Span Across Multiple Industries

INSURANCE FINANCIAL SERVICES RETAIL TRANSPORTATION TELCO

slide-5
SLIDE 5

Select Customers OEMs / ISVs / Partners

slide-6
SLIDE 6

74%

want to be data driven

  • nly 23%

are successful,

slide-7
SLIDE 7

Re Real al-ti time me data ta is highly valuable if you act on it on time Ol Old + real al-ti time me data ta is more valuable if you have the means to combine them

Value of Data to Decision

REAL-TIME SECONDS MINUTES HOURS DAYS MONTHS Actionab le Reactive Historical

Ti Time-cri critical cal dec decis isio ion Tra Traditional “batch” bu busin ines ess in intel ellig igen ence

Preventive/ Predictive Actionable Reactive Historical

Time Value

slide-8
SLIDE 8

RE REAL-TI TIME

Real-Time Applications

Sma Smarte test st De Deci cision

  • ns

HI HISTO TORICA CAL L DA DATA VARIOUS SOURCES

slide-9
SLIDE 9

Big Data FROM: TO:

WHY ARE ONLY

23%

SUCCESSFUL?

Insights Real-Time Insights Real-Time Actions Real-Time Insights Insights

slide-10
SLIDE 10

Challenges of Internet of Things Application Enablement

Foundations of an IoT Analytics Platform

CENTER + EDGE LOW-LATENCY PROCESSING BATCH + REAL-TIME DATA CONVERGENCE ON DATA FROM VARIOUS SOURCES CLOSED LOOP ANALYTICS (INSIGHTS TRIGGER WORKFLOWS) MULTI-TENANT, GEO-FEDERATED, SCALE-OUT “Graphs and Topologies”, rather than “Layers and Tiers” From sensors to actuators = from insights to action (at low latency) IoT data ingestion is heterogeneous: streaming, micro- batch, and batch on multiple data types Focus on low latency and event- driven, rather than high-throughput

slide-11
SLIDE 11

InsightEdge: Unifying Real-Time Analytics, AI and Transactional Processing in One Open Source Platform

  • Open Source & Open API
  • Rich ML & DL support
  • Extreme performance
  • Fully Transactional
  • ACID Compliance
  • Enterprise-grade

(Security, High Availability)

  • Co-located Apps and Services
  • Seamless integration with Big Data

ecosystem

  • Data sources (Kafka/Nifi/Talend)
  • Data lakes (S3/Hadoop)
  • BI tools (Tableau/Looker/etc.)

Intelligent Multi-tier Storage Management

ORCHESTRATION

Machine Learning & Deep Learning

GEO SPATIAL COLUMNAR DOCUMENT STREAMING KEY-VALUE TABLE

STORAGE In-Memory Multi Model Store CLOUD/HYBRID/ ON-PREMISE

slide-12
SLIDE 12

ANALYTICS & BIG DATA APPS & MICROSERVICES

MICROSERVICES (REST) EVENT PROCESSING

RPC & MAP/REDUCE

.NET JAVA MICROSERVICES (REST) EVENT PROCESSING

WEB CONTAINERS RPC & MAP/REDUCE DATA MODELS (SPATIAL, POJO, JSON) EVENT PROCESSING

STREAMING

IN-MEMORY DATA GRID

RAM SSD STORAGE STORAGE-CLASS MEMORY DATA REPLICATION & PERSISTENCE

CLUSTER MANAGEMENT & SERVICE DISCOVERY

SEARCH, BI & QUERY

SECURITY AND AUDITING MANAGEMENT AND MONITORING REST ORCHESTRATION

SPARK SQL SQL/JDBC SEARCH

MOBILE WEB IOT

ON-PREMISE CLOUD HYBRID

InsightEdge Architecture Overview

BigDL

MACHINE LEARNING

slide-13
SLIDE 13

ANALYTICS & BIG DATA APPS & MICROSERVICES

MICROSERVICES (REST) EVENT PROCESSING

RPC & MAP/REDUCE

.NET JAVA MICROSERVICES (REST) EVENT PROCESSING

WEB CONTAINERS RPC & MAP/REDUCE DATA MODELS (SPATIAL, POJO, JSON) EVENT PROCESSING

STREAMING

IN-MEMORY DATA GRID

RAM SSD STORAGE STORAGE-CLASS MEMORY DATA REPLICATION & PERSISTENCE

CLUSTER MANAGEMENT & SERVICE DISCOVERY

SEARCH, BI & QUERY

SECURITY AND AUDITING MANAGEMENT AND MONITORING REST ORCHESTRATION

SPARK SQL MACHINE LEARNING SQL/JDBC SEARCH

MOBILE WEB IOT

ON-PREMISE CLOUD HYBRID

InsightEdge Unifying Fast Data Analytics, AI and Transactional Processing

Cloud Native Management, Orchestration, and Monitoring Analytics and AI SQL and BI Real-time Microservices In-Memory Data Grid Multi-Tiered Data Storage and Replication High Availability and Clustering

slide-14
SLIDE 14

MACHINE LEARNING

ANALYTICS & BIG DATA

STREAMING

CLUSTER MANAGEMENT & SERVICE DISCOVERY

SEARCH, BI & QUERY

SECURITY AND AUDITING MANAGENENT AND MONITORING MANAGENENT AND MONITORING

SPARKL SQL SQL/JDBC SEARCH

MOBILE WEB IOT

CLOUD HYBRID

Ultra-low latency and high throughput transactional processing IMDG

RPC & MAP/REDUCE WEB CONTAINERS RPC & MAP/REDUCE DATA MODELS (SPATIAL, POJO, JSON) EVENT PROCESSING

IN-MEMORY DATA GRID

RAM SSD STORAGE STORAGE-CLASS MEMORY DATA REPLICATION & PERSISTENCE

APPS & MICROSERVICES

MICROSERVICES (REST) EVENT PROCESSING .NET JAVA MICROSERVICES (REST) EVENT PROCESSING

Partitioned In-Memory Grid Shared-nothing, linear scalability, elastic capacity Co-Location of Data and Business Logic Co-located ops, event-driven, fast indexing Event-Driven Processing and Map/Reduce No Downtime Auto-healing, multi-data center replication, fault tolerance Fast Indexing Multi-Data Model POJO, .NET, Document/JSON, Geospatial, Time-series Seamless Integration wih Java/Scala ecosystem Cloud Native

ON-PREMISE

slide-15
SLIDE 15

CLUSTER MANAGEMENT & SERVICE DISCOVERY

SEARCH, BI & QUERY

SECURITY AND AUDITING MANAGENENT AND MONITORING MANAGENENT AND MONITORING

SQL/JDBC SEARCH

MOBILE WEB IOT

ON-PREMISE CLOUD HYBRID

Co-located Analytics and AI with Transactional Processing

RPC & MAP/REDUCE WEB CONTAINERS RPC & MAP/REDUCE DATA MODELS (SPATIAL, POJO, JSON) EVENT PROCESSING

IN-MEMORY DATA GRID

RAM SSD STORAGE STORAGE-CLASS MEMORY DATA REPLICATION & PERSISTENCE

APPS & MICROSERVICES

MICROSERVICES (REST) EVENT PROCESSING .NET JAVA MICROSERVICES (REST) EVENT PROCESSING

ANALYTICS & BIG DATA

STREAMING SPARK SQL MACHINE LEARNING

Spark for ML and leading DL frameworks Push-down predicate for ultra-low latency filter (30x faster) Shared RDDs/DataFrames Streaming with 99.999% availability Deep Learning with Intel BigDL Graph processing, text mining, geospatial

SEARCH, BI & QUERY

SQL/JDBC SEARCH

Distributed SQL-99 Real-time integration with Tableau and Business Intelligence tools JDBC driver

MACHINE LEARNING

slide-16
SLIDE 16

ANALYTICS & BIG DATA

STREAMING SPARKL SQL MACHINE LEARNING

SECURITY AND AUDITING MANAGENENT AND MONITORING

MOBILE WEB IOT

ON-PREMISE CLOUD HYBRID

High Availability & Clustering

RPC & MAP/REDUCE WEB CONTAINERS RPC & MAP/REDUCE DATA MODELS (SPATIAL, POJO, JSON) EVENT PROCESSING

IN-MEMORY DATA GRID

APPS & MICROSERVICES

MICROSERVICES (REST) EVENT PROCESSING .NET JAVA MICROSERVICES (REST) EVENT PROCESSING

SEARCH, BI & QUERY

SQL/JDBC SEARCH

RAM SSD STORAGE STORAGE-CLASS MEMORY DATA REPLICATION & PERSISTENCE

CLUSTER MANAGEMENT & SERVICE DISCOVERY

REST ORCHESTRATION ZooKeeper-based clustering for 1000s of nodes Back-up and auto-healing for each grid container N + 1 redundancy Unicast or Multicast discovery

slide-17
SLIDE 17

ANALYTICS & BIG DATA

STREAMING SPARKL SQL MACHINE LEARNING

CLUSTER MANAGEMENT & SERVICE DISCOVERY

SECURITY AND AUDITING MANAGENENT AND MONITORING

MOBILE WEB IOT

ON-PREMISE CLOUD HYBRID

Multi-Tiered Data Storage and Replication for Optimized TCO

RPC & MAP/REDUCE WEB CONTAINERS RPC & MAP/REDUCE DATA MODELS (SPATIAL, POJO, JSON) EVENT PROCESSING

IN-MEMORY DATA GRID

APPS & MICROSERVICES

MICROSERVICES (REST) EVENT PROCESSING .NET JAVA MICROSERVICES (REST) EVENT PROCESSING

SEARCH, BI & QUERY

SQL/JDBC SEARCH

RAM SSD STORAGE STORAGE-CLASS MEMORY DATA REPLICATION & PERSISTENCE

REST ORCHESTRATION In-Memory Data Processing (RAM) Intelligent Data Tiering between RAM, SSD and Storage-Class Memory such as Intel 3DXPoint - Optane SSD/NVMe and Optane DC Persistence

  • memory. Leverages RocksDB

Multi-Data Center Replication Asynchronous Persistence to SQL/NoSQL

* Apache Pass support in Q4 2018

slide-18
SLIDE 18

Effectively Scale IoT: Real-time Analytics for Instant Insights To Action

VARIOUS DATA SOURCES UNIFIED REAL-TIME ANALYTICS, AI & TRANSACTIONAL PROCESSING REAL-TIME LAYER IN-MEMORY MULTI MODEL STORE RAM STORAGE-CLASS MEMORY SSD STORAGE

HOT DATA WARM DATA

APPLICATION

REAL-TIME INSIGHT TO ACTION

DASHBOARDS

  • No ETL, reduced

complexity

  • Built-in integration with

external Hadoop/Data Lakes S3-like

  • Fast access to historical

data

  • Automated

life-cycle management

DEPLOY ANYWHERE CLOUD/ON-PREMISE

BATCH LAYER

COLD DATA
slide-19
SLIDE 19

BATCH LAYER SPEED LAYER

MANAGEMENT LAYER

CONTROL LAYER (Management, Orchestration, and Security)

APPLICATIONS

LAMBDA ARCHITECTURE IS COMPLICATED

STORAGE BATCH ANALYTICS

EMR

EVENT-DRIVEN ANALYTICS Serverless, e.g. AWS Lambda Kafka consumers Kinesis Enabled App

DATA SOURCES

FILES MESSAGE BUS DATABASES SOCIAL SENSOR DATA EVENTS

Capture

Events

CDC, Message Bus

Files

Public Cloud (AWS) Public Cloud (Azure) Private Cloud Public Cloud (GCP)

Azure Cosmos DB

Event Hubs Google Pub/Sub

STORAGE & CACHE

Trigger

DATA CAPTURE/ LAYER

slide-20
SLIDE 20

BATCH LAYER SPEED LAYER

MANAGEMENT LAYER

CONTROL LAYER (Management, Orchestration, and Security)

APPLICATIONS STORAGE BATCH ANALYTICS

EMR

EVENT-DRIVEN ANALYTICS Serverless, e.g. AWS Lambda Kafka consumers Kinesis Enabled App

DATA SOURCES

FILES MESSAGE BUS DATABASES SOCIAL SENSOR DATA EVENTS

Capture

Events

CDC, Message Bus

Public Cloud (AWS) Public Cloud (Azure) Private Cloud Public Cloud (GCP)

Azure Cosmos DB

Event Hubs Google Pub/Sub

STORAGE & CACHE

Trigger

DATA CAPTURE/ LAYER

Files Smart access to historical context

LAMBDA ARCHITECTURE MADE SIMPLE

slide-21
SLIDE 21

Events

DATA CAPTURE/ LAYER

BATCH LAYER SPEED LAYER

MANAGEMENT LAYER

CONTROL LAYER (Management, Orchestration, and Security)

APPLICATIONS

LAMBDA ARCHITECTURE MADE SIMPLE

STORAGE BATCH ANALYTICS

EMR

EVENT-DRIVEN ANALYTICS Serverless, e.g. AWS Lambda Kafka consumers Kinesis Enabled App

DATA SOURCES

FILES MESSAGE BUS DATABASES SOCIAL SENSOR DATA EVENTS

Capture

Events/Messages

CDC, Message Bus

Files

Public Cloud (AWS) Public Cloud (Azure) Private Cloud Public Cloud (GCP)

Azure Cosmos DB

Event Hubs Google Pub/Sub

STORAGE & CACHE

Trigger

Smart access to historical context

  • Extreme Performance
  • Mission Critical Applications
  • Microservices and Event-Driven

Architecture

  • Open-Source ML & DL frameworks
slide-22
SLIDE 22

DATA CAPTURE/ LAYER

BATCH LAYER SPEED LAYER

MANAGEMENT LAYER

CONTROL LAYER (Management, Orchestration, and Security)

APPLICATIONS STORAGE BATCH ANALYTICS

EMR

EVENT-DRIVEN ANALYTICS Serverless, e.g. AWS Lambda Kafka consumers Kinesis Enabled App

DATA SOURCES

FILES MESSAGE BUS DATABASES SOCIAL SENSOR DATA EVENTS

Capture

Events

CDC, Message Bus

Public Cloud (AWS) Public Cloud (Azure) Private Cloud Public Cloud (GCP)

Azure Cosmos DB

Event Hubs Google Pub/Sub

STORAGE & CACHE

Trigger

Files Smart access to historical context

  • No ETL, reduced complexity
  • Built-in integration with external

Hadoop/Data Lakes S3-like

  • Fast access to historical data
  • Automated life-cycle management

LAMBDA ARCHITECTURE MADE SIMPLE

slide-23
SLIDE 23

DATA CAPTURE/ LAYER

BATCH LAYER SPEED LAYER

MANAGEMENT LAYER

CONTROL LAYER (Management, Orchestration, and Security)

APPLICATIONS STORAGE BATCH ANALYTICS

EMR

EVENT-DRIVEN ANALYTICS Serverless, e.g. AWS Lambda Kafka consumers Kinesis Enabled App

DATA SOURCES

FILES MESSAGE BUS DATABASES SOCIAL SENSOR DATA EVENTS

Capture

Events

CDC, Message Bus

Public Cloud (AWS) Public Cloud (Azure) Private Cloud Public Cloud (GCP)

Azure Cosmos DB

Event Hubs Google Pub/Sub

STORAGE & CACHE

Trigger

Files Smart access to historical context

LAMBDA ARCHITECTURE MADE SIMPLE

  • Unifying access to hot and

historical data - faster time to market

  • Agile development
  • Easily deploy ML models in

production

  • Train ML models on continuously

updated production data

slide-24
SLIDE 24

“ ” “ ”

GigaSpaces is now focused on in-memory data processing… The combination of Spark and XAP will enable GigaSpaces to target the new breed of real-time analytics and hybrid operational and analytic workloads. InsightEdge contains all the necessary SQL, Spark, Streaming, and Deep Learning toolkits for scalable data-driven solutions… our preferred solution components: the three-tier Kappa model, including Spark and Kafka, as implemented by GigaSpaces, in combination with its commercial InsightEdge platform.

Everyone Wants “Real-time Analytic Insights” But Which Architecture Will Get You There?

slide-25
SLIDE 25

Predictive Maintenance for Leading Rail-Based Transportation Company CASE STUD UDY:

  • Predictive maintenance of equipment,

field data ingestion and stream processing

  • Ability to redirect trains in a timely manner

BU BUSINESS CHALLENGE:

  • Process streaming data at scale and query

from a live data mart

  • Event driven analytics and business logic
  • Many small low-volume streams that require

correlation and statefulness (the IoT streaming problem)

  • Real-time analytics leveraging GPS, train

sensor data with reference to historical data

TE TECHNICAL L CHALLE LLENGE GE:

  • Simplified big data pipeline
  • High performance stream processing with High

Availability

  • Real-time analytics on relevant data from train

events, fence events and GPS

  • Event-based triggers to direct the output to a
  • perational workflows and live dashboards for

timely maintenance and redirecting of fast moving trains in time

RE RESULTS: TRANSPORTATION

VARIOUS DATA SOURCES UNIFIED REAL-TIME ANALYTICS, AI & TRANSACTIONAL PROCESSING IN-MEMORY MULTI MODEL STORE APPLICATION

REAL-TIME INSIGH TO ACTION

DASHBOARDS

  • No ETL, reduced
complexity
  • Built-in integration
with external Hadoop/Data Lakes S3-like
  • Fast access to
historical data
  • Automated
life-cycle management

BATCH LAYER

DEPLOY ANYWHERE CLOUD/ON-PREMISE
slide-26
SLIDE 26

Magic Software US USE C CASE:

IOT

  • IoT Hub + Predictive Analytics

BU BUSINESS CHALLENGE:

  • Implement predictive analytics and anomaly detection
  • Expand insight context through customer/data-360

integration

  • Trigger transactional workflows based on prediction

criteria

TE TECHNICAL L CHALLE LLENGE GE

  • Simplified HTAP with Streaming data pipeline (3 tiers)
  • IoT streaming analytics with 9s high availability

RE RESULTS:

Yuval Lavi, Vice President of Innovation and Strategy, Magic Software.

“GigaSpaces enables our customers to simplify and accelerate telemetry ingestion, to gain full business value from IoT adoption.”

slide-27
SLIDE 27

MAGIC CLIP

slide-28
SLIDE 28

HI HISTORICA CAL DA DATA ST STREAMING , , RE REAL- TI TIME, BA BATCH RE REAL-TI TIME & EV EVEN ENT-DR DRIVEN AN ANAL ALYTICS Streaming Analytics (+ co-located apps & services) F a s t O p e r a t i

  • n

a l D a t a L a k e s ( u n s t r u c t u r e d + p

  • l

y g l

  • t

d a t a p r

  • c

e s s i n g ) Simplified Lambda Architecture (Real-time + Historical)

Faster, Smarter Insights and Actions

slide-29
SLIDE 29

EXTREME PERFORMANCE INSTANT INSIGHTS TO ACTION TCO OPTIMIZATION MISSION CRITICAL AVAILABILITY

  • f IOPS

sec from data to insight to action

less expensive than

  • nly RAM with

In-memory performance

<1 millions 10X YEARS

No Downtime at leading enterprise customers for And still counting

slide-30
SLIDE 30

THANK YOU

BUILD IT TRY IT