Data Inges*on for the Connected World
John Meehan, Cansu Aslantas, Stan Zdonik (Brown University) Nesime Tatbul (Intel Labs & MIT) Jiang Du (University of Toronto)
Data Inges*on for the Connected World John Meehan, Cansu Aslantas, - - PowerPoint PPT Presentation
Data Inges*on for the Connected World John Meehan, Cansu Aslantas, Stan Zdonik (Brown University) Nesime Tatbul (Intel Labs & MIT) Jiang Du (University of Toronto) The IoT Era Tradi*onal Data Inges*on (ETL) E XTRACT T RANSFORM L OAD
John Meehan, Cansu Aslantas, Stan Zdonik (Brown University) Nesime Tatbul (Intel Labs & MIT) Jiang Du (University of Toronto)
DATA WAREHOUSE FLAT FILES STAGING OLAP/STORAGE DATA SOURCES
INTERMEDIATE RESULTS DATA CLEANING
DATA NORMALIZATION INTERMEDIATE RESULTS
3
4
hWp://www.tpc.org/tpcdi/ Poess et al, VLDB 2014
5
hWp://www.tpc.org/tpcdi/ Poess et al, VLDB 2014.
6
hWp://www.tpc.org/tpcdi/ Poess et al, VLDB 2014.
7
hWp://www.tpc.org/tpcdi/ Poess et al, VLDB 2014.
8
9
DISK STORAGE S-STORE
SP1 SP2 SP3
MAIN-MEMORY STORAGE
POSTGRES
BIGDAWG KAFKA
DATA SOURCES
10
DISK STORAGE S-STORE
SP1 SP2 SP3
MAIN-MEMORY STORAGE
POSTGRES
BIGDAWG KAFKA
DATA SOURCES
11
12
DA DATE, TE, TIME, TIME, ST STATUS, TUS, TYPE TYPE SECURITY LOOKUP ACCOUNT LOOKUP UPDATE TRADE DATA (STAGING) Date Time
Status Type
DimSecurity DimAccount DimTrade
13
DA DATE, TE, TIME, TIME, ST STATUS, TUS, TYPE TYPE SECURITY LOOKUP ACCOUNT LOOKUP UPDATE TRADE DATA (STAGING) Date Time
Status Type
DimSecurity DimAccount DimTrade TE1 TE2
Transaction Execution (TE) = An instance of a stored procedure executing on an input batch
14
DA DATE, TE, TIME, TIME, ST STATUS, TUS, TYPE TYPE SECURITY LOOKUP ACCOUNT LOOKUP UPDATE TRADE DATA (STAGING) Date Time
Status Type
DimSecurity DimAccount DimTrade TE1 TE2
Shared state read or written by TEs
15
DISK STORAGE S-STORE
SP1 SP2 SP3
MAIN-MEMORY STORAGE
POSTGRES
BIGDAWG KAFKA
DATA SOURCES
16
17
18
19