Huawei's story of leveraging GridGain as a distributed caching service
- n its public cloud environment
Paul Chen
Chief Architect,
Cloud Services Research and Development, Huawei Technologies Canada Lab
Huawei's story of leveraging GridGain as a distributed caching - - PowerPoint PPT Presentation
Huawei's story of leveraging GridGain as a distributed caching service on its public cloud environment Paul Chen Chief Architect, Cloud Services Research and Development, Huawei Technologies Canada Lab Agenda Huawei Public Cloud
Paul Chen
Chief Architect,
Cloud Services Research and Development, Huawei Technologies Canada Lab
2
4
Digital manufacturing Internet finance Smart City e-Government
860+ solution partners for business
innovation, and 2900+ service partners for E2E services including consultancy, deployment and O&M High-performance ECSs and BMSs guarantee cloudification of critical businesses. Atlas heterogeneous hardware, HPC, AI, and latest GPU and FPGA improve the computing capability. Customized CPU, NVMe SSD card, smart NIC, RDMA, InfiniBand network and security chipset
14 categories 100+services 60+ Solutions
Heterogeneous computing capacity supports artificial intelligent applications. Enterprise-class storage, DB, and data analysis services deeply dig into values of data. Software & Hardware Services Solutions Co-
Security: Anti-DDoS, WAF, and DBSS guarantee business security.
Generic-specific solution, to adapt to industry business and optimize services
Cloud Office Migration Cloud DR
Dedicated IT hosting
FCS
SAP On Cloud HPC IoT Web& Mobile Cloud Commun ication
…
Chip Server Storage Network Software
Computing Storage Network Security Management and deployment Database Application EI Development and testing Enterprise application Video Cloud communication IoT DevCloud
5
Management & Deployment
7
5 5
3
84NKP
Database Data Analysis IoT
5IKGL
Builder
0MCHA
FIN-3 ICFF 0+0 H
3
CKMIHHM 1
Network
1
Apps Enterprise Cloud Comm.
IBE FIN3CFCH FIN5FL 3KIDM0H IN FINFIP 7LM0H FIN- 0ICF7LM 01
Dev Cloud
FINNCF IFFIKMCIH KOCMANHMCIHMA
Application
I a a S P a a S BigData
System, Network, Storage, …
P a a S DevOps
S a a S
S
u t i
s
P a a S
Distributed Caching Services
categories services categories services categories services categoriesservices
VMs
T enant Resources
Bare-metal (x86/ARM)
Shared Resources
Horizontal Scale On-demand DMZ App developers
Resources are isolated per tenant
Apps
uses
App users
Caching Engines
Manage my caching instance
Caching Service Dashboard Caching Service Broker (manager)
Provision service instances
Resource Scheduling & Deployment PRV Caching service providers
8
, more flexible and more secure
, cache persistency and alert/notification
9
Side Cache HTTP Session Replication Change Data Capturing Write-through/Write- behind/Map-reduced SQL-like Query
10
Side Cache
HTTP Session Replication Change Data Capturing SQL-like Query
Cache engine
Websites
feeds)
Write-through/Write- behind/Map-reduced
11
Side Cache
HTTP Session Replication
Change Data Capturing Write-through/Write- behind/Map-reduced SQL-like Query
Filter
Web App
App Server
Filter
Web App
App Server
Filter
Web App
App Server
HTTP Server
Web App Layer
Instance 2 Instance 1 Instance 3
Database
Cache Client plugin
“HTTP Session objects cached on DCS”
session data (shopping cart, store catalogues, browsing histories ..)
“ App instances were down or restarted”
Cache
DCS Caching Cluster
12
Side Cache HTTP Session Replication
Change Data Capturing
Write- through/Write- behind/Map- reduced SQL-like Query
14
Web Server
ELB
Web Server Web Server Web Server
DCS
RDS
Messaging Service
15
Business Management Platforms Info Sources Info Metrics DCS cluster
instance instance instance instance
huge amount of the redundant data/objects/ messages
Messaging Service
after removing the redundancy
Info. Analytic Engine synchronization
16
GridGain Engine (Enterprise v8.4.1)
Nodes Replica Threads
heap
Requests CPU Usage % MEM Usage Network Mbps Latency msec Performance (Average per node)
Driver Server Driver
Server
Use Case 12 clients 1 replicaincreases # of nodes and # of client connections 9 1 360 8G 9000,000 498 252 1.56G 5.72G 60 2.01 95417
Redis Engine (v4.0.11)
Nodes Replica Threads
heap
Requests Network Mbps Latency msec Performance (Average per node)
Use Case 12 clients 1 replicaincreases # of nodes and # of client connections 8 1 320 64G 1000,000 1.5 91795
Note: the following result is for reference purpose only – not for comparison)
18
20
21
Real-time applications
In-memory data grid
Data Lake Database
Compute
nodes
Streaming data (e.g. Kafka, JMS, Feeds)
Time series database
OLAP
SQL-like query + data-intensive parallel executions
Messages (e.g. Kafka) Challenges
performance impact
and private cloud