Speed up Mission-Critical Analytics in the Cloud
Billy Liu, VP of Kyligence, Apache Kylin PMC Yiming.liu@kyligence.io
Speed up Mission-Critical Analytics in the Cloud Billy Liu, VP of - - PowerPoint PPT Presentation
Speed up Mission-Critical Analytics in the Cloud Billy Liu, VP of Kyligence, Apache Kylin PMC Yiming.liu@kyligence.io + + Formed
Billy Liu, VP of Kyligence, Apache Kylin PMC Yiming.liu@kyligence.io
+
seconds query speed on PB/TB dataset
YARN
SQL on Hadoop
powered by Apache Kylin
BI Visualization OLAP Data Mart Big Data Platform Source Data HDFS YARN MapReduce Spark Kafka
…
Spark SQL
dataset
pattern intelligently
SQL on Hadoop RDBMS Cube
Mission critical analysis Sub-second delay
§ Spark SQL § Hive § Impala § … ... Exploratory analysis Minute delay SQL Intelligent speed up
#226 of Fortune 500
#4 Smart Phone Global
#1 Fintech in China
#252 of Fortune 500
#41 of Fortune 500
#47 of Fortune 500
#83 of Fortune 500
#3 Securities in China
#2 Securities in China 1000+ open source adoptions
CBA.A:D
Cognos 10.2.2 KAP Hadoop 2.7.2 Hive 1.3.0 HBase 1.0.2 :BA.C )D:
B: BD
)
source technology
Access anytime & anywhere Scalability & Elasticity Gap between BI and Big Data Continuous Available Resiliency & Redundancy Global Deployment
Security & Privacy High cost for data-intensive application Mission-critical DW migration Low performance at web scale Cost Efficiency System Optimization Skills shortage Data movement between RDBMS and Big Data
source, storage, and services on cloud managed Hadoop
and Hadoop stack in minutes
computing resource dynamically for on- demand workload Kyligence Analytics Platform
Cluster Mgnt Cloud Native Storage Kyligence Console Cloud Adaptor AWS S3 Azure Blob Storage … Task Node/Work Node AWS EMR Azure HDI …
VPC
Encrypted Key Edge Node Kyligence Analytics Platform Analyst
User Space
Power BI Excel Cognitiv e Service Ingestion OMS AAD Admin Monitor Cube Analysis KAP Service Processing Scheduler ERP Devices Online Logs Machine Learning Data Service
1 2 3
Sub-second Response
Database Blob Storage Event Hub Data Lake Blob Storage/Data Lake
A-C
D A (AA D( AA
AD EE EE
Cubes
YARN workload
temporary resource for cubing job
resource for analytics services
demand
deployment for near access*
and exceptions
best
tasks