MapD #mapd @datarefined www.map-d.com 180 Sansome St. Todd Mostak - PowerPoint PPT Presentation

MapD #mapd @datarefined www.map-d.com 180 Sansome St. Todd Mostak todd@map-d.com Ι Ι @datarefined San Francisco, CA 94104

super-fast database MapD? built into GPU memory world’s fastest Do? real-time big data analytics interactive visualization twitter analytics platform Demo? 1billion+ tweets millisecond response time

The importance of interactivity People have struggled for a long time to build interactive visualizations of big data that can deliver insight Interactivity means: • Hypothesis testing can occur at “speed of thought” How Interactive is interactive enough? • According to a study by Jeffrey Heer and Zhicheng Liu, “an injected delay of half a second per operation adversely affects user performance in exploratory data analysis.” • Some types of latency are more detrimental than others: • For example, linking and brushing more sensitive than zooming

The Arrival of In-Memory Systems • Traditional RDBMS used to be too slow to serve as a back-end for interactive visualizations. • Queries of over a billion records could take minutes if not hours • But in-memory systems can execute such queries in a fraction of the time. • Both full DBMS and “pseudo” -DBMS solutions • But still often too slow

Enter Map-D

the technology

Core Innovation SQL-enabled column store database built into the memory architecture on GPUs and CPUs Code developed from scratch to take advantage of: • Memory and computational bandwidth of multiple GPUs • Heterogeneous architectures (CPUs and GPUs) • Fast RDMA between GPUs on different nodes • GPU Graphics pipeline Two-level buffer pool across GPU and CPU memory Shared scans – multiple queries of the same data can share memory bandwidth System can scan data at > 2TB/sec per node, with > 10TB/sec per node logical throughput with shared scans

The Hardware Switch IB IB IB IB GPU 0 GPU 1 GPU 2 GPU 3 GPU 0 GPU 1 GPU 2 GPU 3 PCI PCI PCI PCI QPI QPI CPU 0 CPU 1 CPU 0 CPU 1 RAID Controller RAID Controller S1 S2 S3 S4 S1 S2 S3 S4 Node 0 Node 1

The Two-Level Buffer Pool GPU Memory CPU Memory SSD

Shared Nothing Processing Multiple GPUs, with data partitioned between them Filter Filter Filter text ILIKE ‘rain’ text ILIKE ‘rain’ text ILIKE ‘rain’ Node 1 Node 2 Node 3

the product

Product GPU powered end-to-end big data analytics and visualization platform License Image processing Simple Machine learning OpenGL # of GPUs Graph analytics H.264/VP8 streaming Mobile/server versions GPU pipeline Visualization Complex Analytics Scale to cluster of GPU nodes SQL compiler Shared scans User defined functions Hybrid GPU/CPU execution GPU in-memory SQL OpenCL and CUDA database

MapD hardware architecture Big Data Large Data Single GPU Map-D code 12GB memory runs on GPU + Map-D code CPU memory integrated into Map-D code 36U rack: GPU memory 8 cards = 4U box ~400GB GPU ~12TB CPU Single CPU 768GB memory Next Gen Flash Map-D code integrated into 40TB CPU memory 4 sockets = 4U box 100GB/s Small Data Mobile NVIDIA TEGRA Map-D running Mobile chip small datasets 4GB memory Native App Map-D code Web-based integrated into service chip memory

MapD www.map-d.com @datarefined info@map-d.com

MapD #mapd @datarefined www.map-d.com 180 Sansome St. Todd Mostak - PowerPoint PPT Presentation

MapD #mapd @datarefined www.map-d.com 180 Sansome St. Todd Mostak todd@map-d.com @datarefined San Francisco, CA 94104 super-fast database MapD? built into GPU memory worlds fastest Do? real-time big data analytics interactive

2019 | MAPD PLANNING PROCESS June une 20 2018 18 Grant awarded to Neighborhood Association

Big Data, Little Cluster: Using a Small Footprint of GPU Servers to Interactively Query and

Delano Plan Advisory Committee 11/27/17 - Meeting 4 MAPD WELCOME Agenda Visual Preference

Delano Plan Advisory Committee 8/28/17 - Meeting 1 MAPD WELCOME Agenda Nomination and

Buffer Pools Lecture # 05 Database Systems Andy Pavlo AP AP Computer Science 15-445/15-645

Hash Tables Lecture # 06 Database Systems Andy Pavlo AP AP Computer Science 15-445/15-645

Problems in the appearance of silicon photomultipliers: a brief history and perspectives Ziraddin

Combining Data Remapping and Voltage/Frequency Scaling of Second Level Memory for Energy

Key Stage 2 SATs 2016 Information on the changes and expectations for 2016 Key Stage 2 SATs

Lets Go Series Authors: G. Kocienda K. Frazier R. Nakata B. Hoskins Publisher: OXFORD

Year 12 Post-GCSE Options Parent Presentation Tuesday 30 th January 2018 Welcome & Whos

Los Angeles Unified School District Class Size & Facilities Planning Impacts g p FACILITIES

A Trie Merging Approach with Incremental Updates for Virtual Routers Layong Luo* , Gaogang

Project Plan Security Analytics Suite: Dataset Merger Tool The Capstone Experience Team Avata

Broadway - Elmhurst Corridor Safety Improvement Project 2014 Commissioner Polly Trottenberg New

1964 "In the 1660's the English Crown instructed the Lord Proprietors to build a system

Presentation Proposal PRESENTER ONE Name Title Institution E-mail PRESENTER TWO (if

Static Program Analysis Xiangyu Zhang The slides are compiled from Alex Aikens Michael D.

IntelliSense RFID RFID PLATFORM WITH SENSORS INTEGRATION CAPABILITIES One Global Sensing

Coronado Neighborhood Association Fall 2005 MPA Capstone: Emmanuel Garcia, Gary Jackson, Jose

Variable Fonts and the future of typography Jason Pamental | @jpamental Design 4 Drupal |

RHAPSODY & AUTOSAR WALTER VAN DER HEIDEN WILLERT SOFTWARE TOOLS ABOUT WILLERT SOFTWARE TOOLS

Reviewed by: Jie Wei BEMS EMS- Building Energy Management Systems Environmental-friendly

BALLOON PLATFORM FOR HIGH ALTITUDE RESEARCH System Details UPRA-ReHAB Tests and project

Sambuz

Useful Links

Newsletter

Mail Us

MapD #mapd @datarefined www.map-d.com 180 Sansome St. Todd Mostak - PowerPoint PPT Presentation

MapD #mapd @datarefined www.map-d.com 180 Sansome St. Todd Mostak todd@map-d.com @datarefined San Francisco, CA 94104 super-fast database MapD? built into GPU memory worlds fastest Do? real-time big data analytics interactive

2019 | MAPD PLANNING PROCESS June une 20 2018 18 Grant awarded to Neighborhood Association

Big Data, Little Cluster: Using a Small Footprint of GPU Servers to Interactively Query and

Delano Plan Advisory Committee 11/27/17 - Meeting 4 MAPD WELCOME Agenda Visual Preference

Delano Plan Advisory Committee 8/28/17 - Meeting 1 MAPD WELCOME Agenda Nomination and

Buffer Pools Lecture # 05 Database Systems Andy Pavlo AP AP Computer Science 15-445/15-645

Hash Tables Lecture # 06 Database Systems Andy Pavlo AP AP Computer Science 15-445/15-645

Problems in the appearance of silicon photomultipliers: a brief history and perspectives Ziraddin

Combining Data Remapping and Voltage/Frequency Scaling of Second Level Memory for Energy

Key Stage 2 SATs 2016 Information on the changes and expectations for 2016 Key Stage 2 SATs

Lets Go Series Authors: G. Kocienda K. Frazier R. Nakata B. Hoskins Publisher: OXFORD

Year 12 Post-GCSE Options Parent Presentation Tuesday 30 th January 2018 Welcome &amp; Whos

Los Angeles Unified School District Class Size &amp; Facilities Planning Impacts g p FACILITIES

A Trie Merging Approach with Incremental Updates for Virtual Routers Layong Luo* , Gaogang

Project Plan Security Analytics Suite: Dataset Merger Tool The Capstone Experience Team Avata

Broadway - Elmhurst Corridor Safety Improvement Project 2014 Commissioner Polly Trottenberg New

1964 &quot;In the 1660's the English Crown instructed the Lord Proprietors to build a system

Presentation Proposal PRESENTER ONE Name Title Institution E-mail PRESENTER TWO (if

Static Program Analysis Xiangyu Zhang The slides are compiled from Alex Aikens Michael D.

IntelliSense RFID RFID PLATFORM WITH SENSORS INTEGRATION CAPABILITIES One Global Sensing

Coronado Neighborhood Association Fall 2005 MPA Capstone: Emmanuel Garcia, Gary Jackson, Jose

Variable Fonts and the future of typography Jason Pamental | @jpamental Design 4 Drupal |

RHAPSODY &amp; AUTOSAR WALTER VAN DER HEIDEN WILLERT SOFTWARE TOOLS ABOUT WILLERT SOFTWARE TOOLS

Reviewed by: Jie Wei BEMS EMS- Building Energy Management Systems Environmental-friendly

BALLOON PLATFORM FOR HIGH ALTITUDE RESEARCH System Details UPRA-ReHAB Tests and project

Sambuz

Useful Links

Newsletter

Mail Us

Year 12 Post-GCSE Options Parent Presentation Tuesday 30 th January 2018 Welcome & Whos

Los Angeles Unified School District Class Size & Facilities Planning Impacts g p FACILITIES

1964 "In the 1660's the English Crown instructed the Lord Proprietors to build a system

RHAPSODY & AUTOSAR WALTER VAN DER HEIDEN WILLERT SOFTWARE TOOLS ABOUT WILLERT SOFTWARE TOOLS