What the heck is an In-Memory Data Grid? @addisonhuddy How are we - PowerPoint PPT Presentation

What the heck is an In-Memory Data Grid? @addisonhuddy

How are we going to answer this question? 1. Tell you about my first introduction to IMDGs 2. See some real-world use cases 3. Design an IMDG 4. Implement Use Cases

Definition IMDGs provide a lightweight, distributed, scale-out in-memory object store — the data grid. Multiple applications can concurrently perform transactional and/or analytical operations in the low-latency data grid, thus minimizing access to high-latency, hard-disk-drive-based or solid-state-drive-based data storage. 1 Gartner 1 https://www.gartner.com/reviews/market/in-memory-data-grids

My First Thought

My Second Thought

Two Examples Southwest China Railway Airlines Corporation 5,700 train stations 70+ cities 4.5 million tickets per day 4,000 daily flights 20 million daily users 706 aircraft 1.4 billion page views per day Largest airline website by visitors 40,000 visits per second

When Not To Use An IMDG - Small Amounts of Data - Low-latency isn’t mission critical - Not a total replacement for RDBMS

Let’s Make an IMDG

Design Goals - Extremely Low Latency - High Throughput - Durability - Large Datasets - Consistency?

Design Goals - Memory First - Extremely Low Latency - Horizontal Scalability / - High Throughput Elasticity - Data Aware Routing - Durability - Serialization / - Large Datasets Deserialization - Consistency

https://github.com/apache/geode

Memory First

Latency Comparison Latency Comparison Numbers -------------------------- L1 cache reference 0.5 ns Branch mispredict 5 ns L2 cache reference 7 ns 14x L1 cache Mutex lock/unlock 25 ns Main memory reference 100 ns 20x L2 cache, 200x L1 cache Compress 1K bytes with Zippy 3,000 ns 3 us Send 1K bytes over 1 Gbps network 10,000 ns 10 us SSD Seek 100,000 ns 100 us Read 4K randomly from SSD* 150,000 ns 150 us ~1GB/sec SSD Read 1 MB sequentially from memory 250,000 ns 250 us Round trip within same datacenter 500,000 ns 500 us Read 1 MB sequentially from SSD* 1,000,000 ns 1,000 us 1 ms ~1GB/sec SSD, 4X memory Disk seek 10,000,000 ns 10,000 us 10 ms 20x datacenter roundtrip Read 1 MB sequentially from disk 20,000,000 ns 20,000 us 20 ms 80x memory, 20X SSD Send packet CA->Netherlands->CA 150,000,000 ns 150,000 us 150 ms 1 Credit Jeff Dean, Peter Norvig, and Jonas Bonér

Why Memory? Read 1 MB Comparison Hardware True Time Scaled Time Memory 250,100 ns 2 days SSD 1,100,000 ns 9 days Disk 30,000,000 8 months

Horizontal Scalability / Elasticity

System Architecture Client Server Server Client Client Client Client ... Server Server Client Client Client Client Client Locator Locator

System Architecture Client Server Server Client Client Client Client ... Client Client Client Client Client Locator Locator

System Architecture Client Server Server Client Client Client Client ... Server Client Client Client Client Client Locator Locator

IMDGs & CAP Theorem A vailability C onsistency P artition Tolerance

WAN Replication lient Data Center Data Center (NYC) (Tokyo) S S S S S S S S L L L L

Data Aware Routing

Latency Comparison Latency Comparison Numbers -------------------------- L1 cache reference 0.5 ns Branch mispredict 5 ns L2 cache reference 7 ns 14x L1 cache Mutex lock/unlock 25 ns Main memory reference 100 ns 20x L2 cache, 200x L1 cache Compress 1K bytes with Zippy 3,000 ns 3 us Send 1K bytes over 1 Gbps network 10,000 ns 10 us SSD Seek 100,000 ns 100 us Read 4K randomly from SSD* 150,000 ns 150 us ~1GB/sec SSD Read 1 MB sequentially from memory 250,000 ns 250 us Round trip within same datacenter 500,000 ns 500 us Read 1 MB sequentially from SSD* 1,000,000 ns 1,000 us 1 ms ~1GB/sec SSD, 4X memory Disk seek 10,000,000 ns 10,000 us 10 ms 20x datacenter roundtrip Read 1 MB sequentially from disk 20,000,000 ns 20,000 us 20 ms 80x memory, 20X SSD Send packet CA->Netherlands->CA 150,000,000 ns 150,000 us 150 ms 1 Credit Jeff Dean, Peter Norvig, and Jonas Bonér

Single Hop Client Server Server Client Client Client Client ... Server Server Client Client Client Client Client Locator Locator

Local Cache Client Server Server Client Client Client Client ... Server Server Client Client Client Client Client Locator Locator

Serialization 1. Only (de)serialize when it is necessary 2. Only (de)serialize what is absolutely necessary 3. Distribute (de)serialize cost as much as possible

Basic User Operations

What have we created? - Put/Get - Key/Value Object Store - Queries - Share-nothing - Server-side functions architecture - Registered Interests - Continuous Queries - Memory Oriented - Event Queues - Strongly Consistent

Use Cases

In-line Caching S S Client Client RDBMS Client Client C S S L L

Look-Aside Caching RDBMS Client Client Client Client C S S S S L L

Pub / Sub System 1 Client Server Server Client Client 2 Client Client ... 2 Server Server Client Client Client Client Client Locator Locator

Real-Time Analytics with Functions Client Server Server Client Client Client Client ... Server Server Client Client Client Client Client Locator Locator

Distributed Computation Client Server Server Cient Server Server Client

Real-Time Analytics Client Client Client Server Server Client Rapidly Changing Data Client Client Server Server Client Client Client

O’Reilly Book

Questions @addisonhuddy

What the heck is an In-Memory Data Grid? @addisonhuddy How are we - PowerPoint PPT Presentation

What the heck is an In-Memory Data Grid? @addisonhuddy How are we going to answer this question? 1. Tell you about my first introduction to IMDGs 2. See some real-world use cases 3. Design an IMDG 4. Implement Use Cases Definition IMDGs

Sun and Grid John Barr Grid Business Development 07808 328351 john.barr@sun.com Sun and Grid

ON-GRID VS OFF-GRID SOLAR On-Grid Solar is solar generation that is connected to the utility grid

Massachusetts Creative E Massachusetts Creative E conomy conomy What the Heck THE HECK WHAT ^

Memory II. Memory improvement III. Problems with memory 3 systems/stages of Memory: memory

Migrating from Grid to Cloud: Migrating from Grid to Cloud: Migrating from Grid to Cloud:

1 Memory SoC Persistent Memory-Driven Memory Memory Processor-Centric Memory SoC SoC

SEE-GRID Deploying a Grid-enabled eInfrastructure in SE Europe www.see-grid.org Jorge Sanchez,

Networks Computer-Computer Comm CPU CPU CPU CPU Memory Device Device Memory Memory

Modernizing T&D on the Electric Grid 11/29/2011 Mark Nealon System Meter & Smart Grid

Grid Grid to Grid Grid-to to Ports Clock Routing for to-Ports Clock Routing for Ports Clock

Grid/Clo d Comp ting Grid/Clo d Comp ting Grid/Cloud Computing Grid/Cloud Computing over

SEE-GRID-SCI SEE-GRID Infrastructure for Regional eScience www.see-grid-sci.eu International

Virtual Memory 1 Memory Hierarchy Memory 4GB Cache 1M Registers 1K Question: What if

Personal SE Computer Memory Addresses C Pointers Computer Memory Organization Memory is a

Memory Memory processing is the ability to: Acquire (Short term memory) Manipulate

Memory Management Memory Manager Requirements Minimize primary memory access time

Categories of natural models of type theory CT 2016 (Halifax, NS, Canada) Clive Newstead

A Two-Stage Parsing Method for Text-Level Discourse Analysis Yizhong Wang , Sujian Li, Houfeng

1 2

LIGO-Virgo Searches for Gravita5onal- Waves Associated with GRBs

Transactions and Concurrency Control (Manga Guide to DB, Chapter 5, pg 125-137, 153-160) 1

Architecting HBM as a High Bandwidth, High Capacity, Self-Managed Last-Level Cache Tyler

Epidemics on random graphs with a given degree sequence Malwina Luczak 1 2 School of Mathematical

Extending Scalability of Collective IO Through Nessie and Staging Parallel Data Storage Workshop

What the heck is an In-Memory Data Grid? @addisonhuddy How are we - PowerPoint PPT Presentation

What the heck is an In-Memory Data Grid? @addisonhuddy How are we going to answer this question? 1. Tell you about my first introduction to IMDGs 2. See some real-world use cases 3. Design an IMDG 4. Implement Use Cases Definition IMDGs

Sun and Grid John Barr Grid Business Development 07808 328351 john.barr@sun.com Sun and Grid

ON-GRID VS OFF-GRID SOLAR On-Grid Solar is solar generation that is connected to the utility grid

Massachusetts Creative E Massachusetts Creative E conomy conomy What the Heck THE HECK WHAT ^

Memory II. Memory improvement III. Problems with memory 3 systems/stages of Memory: memory

Migrating from Grid to Cloud: Migrating from Grid to Cloud: Migrating from Grid to Cloud:

1 Memory SoC Persistent Memory-Driven Memory Memory Processor-Centric Memory SoC SoC

SEE-GRID Deploying a Grid-enabled eInfrastructure in SE Europe www.see-grid.org Jorge Sanchez,

Networks Computer-Computer Comm CPU CPU CPU CPU Memory Device Device Memory Memory

Modernizing T&amp;D on the Electric Grid 11/29/2011 Mark Nealon System Meter &amp; Smart Grid

Grid Grid to Grid Grid-to to Ports Clock Routing for to-Ports Clock Routing for Ports Clock

Grid/Clo d Comp ting Grid/Clo d Comp ting Grid/Cloud Computing Grid/Cloud Computing over

SEE-GRID-SCI SEE-GRID Infrastructure for Regional eScience www.see-grid-sci.eu International

Virtual Memory 1 Memory Hierarchy Memory 4GB Cache 1M Registers 1K Question: What if

Personal SE Computer Memory Addresses C Pointers Computer Memory Organization Memory is a

Memory Memory processing is the ability to: Acquire (Short term memory) Manipulate

Memory Management Memory Manager Requirements Minimize primary memory access time

Categories of natural models of type theory CT 2016 (Halifax, NS, Canada) Clive Newstead

A Two-Stage Parsing Method for Text-Level Discourse Analysis Yizhong Wang , Sujian Li, Houfeng

1 2

LIGO-Virgo Searches for Gravita5onal- Waves Associated with GRBs

Transactions and Concurrency Control (Manga Guide to DB, Chapter 5, pg 125-137, 153-160) 1

Architecting HBM as a High Bandwidth, High Capacity, Self-Managed Last-Level Cache Tyler

Epidemics on random graphs with a given degree sequence Malwina Luczak 1 2 School of Mathematical

Extending Scalability of Collective IO Through Nessie and Staging Parallel Data Storage Workshop

Modernizing T&D on the Electric Grid 11/29/2011 Mark Nealon System Meter & Smart Grid