RAPIDS: PLATFORM INSIDE AND OUT Joshua Patterson 3-19-2019 RAPIDS - PowerPoint PPT Presentation

RAPIDS: PLATFORM INSIDE AND OUT Joshua Patterson 3-19-2019

RAPIDS End to End Accelerate GPU Data Science Data Preparation Model Training Visualization cuDF cuIO cuML cuGraph PyTorch Chainer MxNet cuXfilter <> Kepler.gl Analytics Machine Learning Graph Analytics Deep Learning Visualization GPU Memory 2

DATA MOVEMENT AND TRANSFORMATION The bane of productivity and performance APP B Read Data APP B GPU APP B Copy & Convert Data CPU GPU Copy & Convert GPU APP A Copy & Convert Data APP A APP A Load Data 4

DATA MOVEMENT AND TRANSFORMATION What if we could keep data on the GPU? APP B Read Data APP B GPU APP B Copy & Convert Data CPU GPU Copy & Convert GPU APP A Copy & Convert Data APP A APP A Load Data 5

LEARNING FROM APACHE ARROW From Apache Arrow Home Page - https://arrow.apache.org/ 6

RAPIDS Core of RAPIDS Data Preparation Model Training Visualization cuDF cuIO cuML cuGraph PyTorch Chainer MxNet cuXfilter <> Kepler.gl Analytics Machine Learning Graph Analytics Deep Learning Visualization GPU Memory 8

ROAD TO 1.0 RAPIDS Is Fast… 800.00 700.00 600.00 500.00 milliseconds 400.00 300.00 200.00 100.00 0.00 lower() find(#) slice(1,15) Pandas cudastrings • CPU to GPU 10-25x improvement on average Simple Python Interface • … could be even faster! • • JIT Compression/Decompression Improved caching • • More static compiled kernels 9

ROAD TO 1.0 Focused on Robust Functionality, Deployment, and User Experience Window and Rolling Window Functions • Improved Apply function • Improved String Support • Words to Vec • Geospatial Functions • Improved Integration with Numba • Statistical functions (ANOVA, Covariance, etc…) • Feature Request https://9-volt.github.io/bug-life/?repo=rapidsai/cudf 10

ROAD TO 1.0 Focused on Robust Functionality, Deployment, and User Experience • Better error handling Replace CFFI with Cython for more descriptive • cuDF C++ errors and exceptions Cover more edge cases of functionality • cuDF Python • Push more functionality down from cuDF Python into cuDF C++ for performance and future languages Support a proper C++ API Bugs • https://9-volt.github.io/bug-life/?repo=rapidsai/cudf 11

ROAD TO 1.0 GTC Europe – Launch - RAPIDS 0.1 cuML SG MG MGMN cuGraph SG MG MGMN Gradient Boosted Decision Trees Jaccard (GBDT) GLM Weighted Jaccard Logistic Regression PageRank Random Forest (regression) Louvain K-Means SSSP K-NN BFS DBSCAN SSWP UMAP Triangle Counting ARIMA Subgraph Extraction Kalman Filter Holts-Winters Principal Components Singular Value Decomposition 12

ROAD TO 1.0 GTC San Jose – Today - RAPIDS 0.6 cuML SG MG MGMN cuGraph SG MG MGMN Gradient Boosted Decision Trees Jaccard (GBDT) GLM Weighted Jaccard Logistic Regression PageRank Random Forest (regression) Louvain K-Means SSSP K-NN BFS DBSCAN SSWP UMAP Triangle Counting ARIMA Subgraph Extraction Kalman Filter Holts-Winters Principal Components Singular Value Decomposition 13

ROAD TO 1.0 Q4 – 2019 - RAPIDS 0.12? cuML SG MG MGMN cuGraph SG MG MGMN Gradient Boosted Decision Trees Jaccard (GBDT) GLM Weighted Jaccard Logistic Regression PageRank Random Forest (regression) Louvain K-Means SSSP K-NN BFS DBSCAN SSWP UMAP Triangle Counting ARIMA Subgraph Extraction Kalman Filter Holts-Winters Principal Components Singular Value Decomposition 14

ROAD TO 1.0 Focused on Robust Functionality, Deployment, and User Experience Integration with every major cloud provider Both containers and cloud specific machine instances Support for Enterprise and HPC Orchestration Layers 15

ROAD TO 1.0 Focused on Robust Functionality, Deployment, and User Experience 3/19/19 S9788 Building a GPU-Focused CI Solution Michael Wendt • CI/CD essential to RAPIDS • Airspeed Velocity (ASV) for regression • Nightlies! • https://hub.docker.com/r/rapid sai/rapidsai-nightly • https://anaconda.org/rapidsai- nightly 16

RAPIDS Core of RAPIDS Data Preparation Model Training Visualization cuDF cuIO cuML cuGraph PyTorch Chainer MxNet cuXfilter <> Kepler.gl Analytics Machine Learning Graph Analytics Deep Learning Visualization GPU Memory 17

ETL - THE BACKBONE OF DATA SCIENCE cuDF is… CUDA With Python Bindings • Low level library containing function implementations and • A Python library for manipulating GPU DataFrames C/C++ API • Python interface to CUDA C++ with additional functionality • Importing/exporting Apache Arrow using the CUDA IPC • Creating Apache Arrow from Numpy arrays, Pandas mechanism DataFrames, and PyArrow Tables • CUDA kernels to perform element-wise math operations on GPU • JIT compilation of User-Defined Functions (UDFs) using Numba DataFrame columns • CUDA sort, join, groupby, and reduction operations on GPU DataFrames • Apache Arrow data format • Pandas-like API • Unary and Binary Operations • Joins / Merges • GroupBys • Filters • User-Defined Functions (UDFs) • Accelerated file readers • Etc. 18

ETL - THE BACKBONE OF DATA SCIENCE cuDF is not the end of the story Data Preparation Model Training Visualization Dask cuDF cuIO cuML cuGraph PyTorch Chainer MxNet cuXfilter <> Kepler.gl Analytics Machine Learning Graph Analytics Deep Learning Visualization GPU Memory 3/19/19 S9793 cuDF: RAPIDS GPU-Accelerated Data Frame Library Keith Kraus & Dante Gama Dessavre 3/20/19 RAPIDS CUDA DataFrame Internals for C++ Developers Jake Hemstad 19

ETL - THE BACKBONE OF DATA SCIENCE Why Dask PyData Native • Built on top of NumPy, Pandas Scikit-Learn, • … (easy to migrate) With the same APIs (easy to train) • With the same developer community (well • trusted) Scales • Easy to install and use on a laptop • Scales out to thousand-node clusters • Popular • Most common parallelism framework today • at PyData and SciPy conferences Deployable • HPC: SLURM, PBS, LSF , SGE • Cloud: Kubernetes • Hadoop/Spark: Yarn • 20

ETL - THE BACKBONE OF DATA SCIENCE Why Dask PyData Native • Built on top of NumPy, Pandas Scikit-Learn, • … (easy to migrate) With the same APIs (easy to train) • With the same developer community (well • trusted) Scales • Easy to install and use on a laptop • Scales out to thousand-node clusters • Popular • Most common parallelism framework today • at PyData and SciPy conferences Deployable • HPC: SLURM, PBS, LSF , SGE • Cloud: Kubernetes • Hadoop/Spark: Yarn • 21

ETL - THE BACKBONE OF DATA SCIENCE Dask-cuDF improvements in 0.7 & 0.8 Make cuDF more Pandas like • The more cuDF follows the Pandas API, the fewer changes to Dask • DataFrame Replace Dask communications with Open UCX • Pickling CUDA IPC was a clever hack, but would not scale past a • single node Focus on Dask-cuDF errors • Dask will prevent most out of memory errors users currently • experience with cuDF alone Improvements to Dask-cuDF still improve cuDF • Better memory monitoring in Dask • Improve String Support… • 22

ETL - THE BACKBONE OF DATA SCIENCE String Support in Dask-cuDF Now 0.6 String Support: Element-wise operations • Split, Find, Extract, Cat, Typecasting, etc… • String GroupBys • String Joins • Power Support now possible • Future 0.7 & 0.8 String Support: GPU accelerated to_csv • More Pandas String API compatibility • Element-wise String Comparisons • Improved Categorical column support • 23

EXTRACTION IS THE CORNERSTONE OF ETL cuIO Is Born • CSV Reader • Follows API of pandas.read_csv • Current implementation is >10x speed improvement over pandas • Parquet Reader – v0.7 • Work in progress: Will follow API of pandas.read_parquet • ORC Reader – v0.7 • Work in progress: Will have similar API of Parquet reader • Additionally looking towards GPU-accelerating decompression for common compression schemes Source: Apache Crail blog: SQL Performance: Part 1 - Input File Formats 24

ETL IS NOT JUST DATAFRAMES! 25

RAPIDS Core of RAPIDS Data Preparation Model Training Visualization Dask cuDF cuIO cuML cuGraph PyTorch Chainer MxNet cuXfilter <> Kepler.gl Analytics Machine Learning Graph Analytics Deep Learning Visualization GPU Memory 26

RAPIDS Core of RAPIDS Data Preparation Model Training Visualization Dask cuDF cuIO CuPy Numba cuML cuGraph PyTorch Chainer MxNet cuXfilter <> Kepler.gl Analytics Machine Learning Graph Analytics Deep Learning Visualization GPU Memory 27

INTEROPERABILITY FOR THE WIN DLPack and __cuda_array_interface__ 28

INTEROPERABILITY FOR THE WIN DLPack and __cuda_array_interface__ 29

RAPIDS: PLATFORM INSIDE AND OUT Joshua Patterson 3-19-2019 RAPIDS - PowerPoint PPT Presentation

RAPIDS: PLATFORM INSIDE AND OUT Joshua Patterson 3-19-2019 RAPIDS End to End Accelerate GPU Data Science Data Preparation Model Training Visualization cuDF cuIO cuML cuGraph PyTorch Chainer MxNet cuXfilter <> Kepler.gl Analytics

RAPIDS: Deep Dive Into How the Platform Works Paul Mahler, 3/18/19 Introduction to RAPIDS 2

Webinar Series CITY OF GRAND RAPIDS' CANNABIS LICENSING, SOCIAL EQUITY, AND ZONING REGULATIONS

Welcome Perham to Pelican Rapids Regional Trail Perham to Pelican Rapids Regional Trail Status

INSIDE THE PLATFORM Who are we Classic platforms Classic platform Modern platform Modern

IMC Presentation Recommendation Adopt Inside Out and Back Again by Thanhha Lai Adopt One

Little Rapids Habitat Restoration St. Marys River AOC Engineering and Design Project Update

Age-Friendly Grand Rapids Strategic Priority Alignment Economic Vibrancy & Affordability

MARS RAPIDS GPU

0.07 0.06 0.05 0.04 Unspecialized inside Specialized inside (rot, trans) Specialized inside

REMIT Inside Information Platform Key Principles: Market Participants are responsible

Commission: Out of touch, out of date, out of pocket April 2017 Commission: Out of touch, out of

Marijuana In Grand Rapids Development Center Lunch & Learn September 17, 2019 Agenda 1.

RAPIDS CUDA DataFrame Internals for C++ Developers - S91043 Jake Hemstad - NVIDIA - Developer

MARIJUANA IN GRAND RAPIDS Practitioner Informational Meeting #3 March 1, 2019 Landon Bartley,

Fair Housing Cedar Rapids Civil Rights Commission LaSheila Yates Executive Director Johnny

Housing For All Strong Neighborhoods, Strong City 2016 Grand Rapids Neighborhood Summit

Statistical Methods for Multivariate Spatial and Spatial-Temporal Processes Mikyoung Jun

Blind Chip Rate Estimation in Multirate Blind Chip Rate Estimation in Multirate CDMA

General Relativity without paradigm of space-time covariance: sensible quantum gravity and

The Matrix- F Prior for Estimating and Testing Covariance Matrices Joris Mulder & Luis R.

A geostatistical model for teleconnections Josh Hewitt 12 May 2017 Joint work with Jennifer

Robust and Practical Depth Map Fusion for Time-of-Flight Cameras Markus Ylimki 1 , Juho Kannala

Trajectory Modeling by Shape Nicholas P. Jewell Departments of Statistics & School of Public

Time-variable gravity field determination from GRACE-FO data at AIUB Martin Lasser 1 , Ulrich

RAPIDS: PLATFORM INSIDE AND OUT Joshua Patterson 3-19-2019 RAPIDS - PowerPoint PPT Presentation

RAPIDS: PLATFORM INSIDE AND OUT Joshua Patterson 3-19-2019 RAPIDS End to End Accelerate GPU Data Science Data Preparation Model Training Visualization cuDF cuIO cuML cuGraph PyTorch Chainer MxNet cuXfilter <> Kepler.gl Analytics

RAPIDS: Deep Dive Into How the Platform Works Paul Mahler, 3/18/19 Introduction to RAPIDS 2

Webinar Series CITY OF GRAND RAPIDS' CANNABIS LICENSING, SOCIAL EQUITY, AND ZONING REGULATIONS

Welcome Perham to Pelican Rapids Regional Trail Perham to Pelican Rapids Regional Trail Status

INSIDE THE PLATFORM Who are we Classic platforms Classic platform Modern platform Modern

IMC Presentation Recommendation Adopt Inside Out and Back Again by Thanhha Lai Adopt One

Little Rapids Habitat Restoration St. Marys River AOC Engineering and Design Project Update

Age-Friendly Grand Rapids Strategic Priority Alignment Economic Vibrancy &amp; Affordability

MARS RAPIDS GPU

0.07 0.06 0.05 0.04 Unspecialized inside Specialized inside (rot, trans) Specialized inside

REMIT Inside Information Platform Key Principles: Market Participants are responsible

Commission: Out of touch, out of date, out of pocket April 2017 Commission: Out of touch, out of

Marijuana In Grand Rapids Development Center Lunch &amp; Learn September 17, 2019 Agenda 1.

RAPIDS CUDA DataFrame Internals for C++ Developers - S91043 Jake Hemstad - NVIDIA - Developer

MARIJUANA IN GRAND RAPIDS Practitioner Informational Meeting #3 March 1, 2019 Landon Bartley,

Fair Housing Cedar Rapids Civil Rights Commission LaSheila Yates Executive Director Johnny

Housing For All Strong Neighborhoods, Strong City 2016 Grand Rapids Neighborhood Summit

Statistical Methods for Multivariate Spatial and Spatial-Temporal Processes Mikyoung Jun

Blind Chip Rate Estimation in Multirate Blind Chip Rate Estimation in Multirate CDMA

General Relativity without paradigm of space-time covariance: sensible quantum gravity and

The Matrix- F Prior for Estimating and Testing Covariance Matrices Joris Mulder &amp; Luis R.

A geostatistical model for teleconnections Josh Hewitt 12 May 2017 Joint work with Jennifer

Robust and Practical Depth Map Fusion for Time-of-Flight Cameras Markus Ylimki 1 , Juho Kannala

Trajectory Modeling by Shape Nicholas P. Jewell Departments of Statistics &amp; School of Public

Time-variable gravity field determination from GRACE-FO data at AIUB Martin Lasser 1 , Ulrich

Age-Friendly Grand Rapids Strategic Priority Alignment Economic Vibrancy & Affordability

Marijuana In Grand Rapids Development Center Lunch & Learn September 17, 2019 Agenda 1.

The Matrix- F Prior for Estimating and Testing Covariance Matrices Joris Mulder & Luis R.

Trajectory Modeling by Shape Nicholas P. Jewell Departments of Statistics & School of Public