Using NVIDIA CUDF to Simplify and Accelerate Data Prep for Credit - PowerPoint PPT Presentation

Using NVIDIA CUDF to Simplify and Accelerate Data Prep for Credit Card Algo. Prediction March 19, 2019 Richard Liu Vice President

Agenda • Macro economics trends • Behavioral surplus • Paradigm shift • Deep dive to the data • How RAPIDS/cuDF helps 2

Perspective on the challenges Business case: Credit card business now faces the challenges on risk management and more importantly on payment or transaction behavior. The conventional balance sheet data approaches can hardly afford such new requirement. U.S. Credit Cards 1,000,000 4.00% 900,000 3.50% 800,000 3.00% Amounts in $ Million 700,000 2.50% 600,000 $ % Rate 500,000 2.00% 400,000 1.50% 300,000 1.00% 200,000 0.50% 100,000 0 0.00% 1984Q1 1985Q1 1986Q1 1987Q1 1988Q1 1989Q1 1990Q1 1991Q1 1992Q1 1993Q1 1994Q1 1995Q1 1996Q1 1997Q1 1998Q1 1999Q1 2000Q1 2001Q1 2002Q1 2003Q1 2004Q1 2005Q1 2006Q1 2007Q1 2008Q1 2009Q1 2010Q1 2011Q1 2012Q1 2013Q1 2014Q1 2015Q1 2016Q1 2017Q1 2018Q1 Total outstanding Noncurrent rate Source from FDIC: Loan Performance (as of 2018/Q4)

Trade secret on behavioral surplus Traditional Digital Age Action to behavior to data to prediction Surveillance capitalism Pool level thinking Book keeping

Paradigm Shift But … How to walk the talk?

Now We Ha Have New Way T y To Look At Dat ata Examples (simulated data for illustration purpose) Customer ID: cust_id Merchant category code: mcc Transaction date: trans_date Dollar amount: trans_amt Array objects after pivoting process: [array of mcc], [array of trans_date], [array of trans_amt] Neuroscience observation on customer behavior (Visualization)

Why y RAPID IDS S cu cuDF Prog ogress ress so far has large gely ly been n towar ard d demonstratin onstrating g general neral approac oaches hes for building lding narrow ow systems stems rather er than n general neral approac oaches hes for building lding genera neral l systems. stems. Progre gress ss toward rd the forme mer r does s not entail ail substanti stantial l progre gress ss towar ard d the latter. ter. AlphaGo and AI Progress. Retrieved October 24, 2017, from http://www.milesbrundage.com/blog-posts/alphago-and-ai-progress. Our expectation: The advantage of modern computation: • The efficient way to deal with very sparse Functional language: data against computation • Performance with ease of programming increment :: [int] -> [int] (Python Pandas like) increment = map (1+) • Much better return on GPU solution investment

How RAPIDs Helps On Transaction Over Time Horizon Easier yet efficient way to resolve the chronic “horizon stacking” data Conventional Distributed over GPU cores SELECT COUNT(), PATTERN = HORIZON(); SUM(), Data object STD(), (0 until array.length) .map( I = PATTERN PARTITION BY () # Window function .addData( attributes( i ),array( i ))) FROM … LEFT JOIN … Smart distributed computation by RAPIDS GROUP BY … Time interval = 1…n

Challenges from DL computation With conventional table way, how to find a departure from the prevailing deep learning zeitgeist that prizes learning from scratch, tabula rasa. Table with system records Hebbian learning like representation SDR

CuDF with Better format for Deep Learning Like Computation Inspiration from Recursive Cortical Network, Hierarchical Temporal Memory function feature_map( hierarchical , data [1.. T ], C ) levels [1.. L ] <- hierarchical.levels for l <- 1 to L do regions <- levels [ l ]. regions for all r in regions do Until spatialPooling converged for r for t <-1 to T do spatialPooling ( r , data [ t ]) end for end Until for c <- 1 to C do for t <- 1 to T do spatialPooling ( r , data [ t ]) Sparse_Data_Representation <- pivoted_array Time_Horizon_Pooling ( r , Sparse_Data_Representation ) end for end for * Dileep George et al. Science 2017;358:eaag2612 (published by AAAS) end for end for end function

How Much RAPIDs Helps • Speed, speed, speed! Things you should know by yesterday. • More time to think (smart machine for smart people). – Feature engineering (more and accurate) – Computational significance (less data yet robust to noise) Dileep George et al. Science 2017;358:eaag2612 (published by AAAS) Github scripts: https://github.com/vicariousinc/science_rcn

Thank you

Using NVIDIA CUDF to Simplify and Accelerate Data Prep for Credit - PowerPoint PPT Presentation

Using NVIDIA CUDF to Simplify and Accelerate Data Prep for Credit Card Algo. Prediction March 19, 2019 Richard Liu Vice President Agenda Macro economics trends Behavioral surplus Paradigm shift Deep dive to the data How

Lindab Group We simplify construction 1 lindab | we simplify construction lindab | we

Grant Prep Boot Camp Grant Prep Boot Camp Grant Prep Boot Camp Grant Prep Boot Camp Robyn Gershon,

Partners PrEP Trial Oral PrEP for Heterosexual Couples in Kenya and Uganda Partners PrEP: Study

RAPIDS CUDA DataFrame Internals for C++ Developers - S91043 Jake Hemstad - NVIDIA - Developer

FOR THE BEST VDI USER EXPERIENCE NVIDIA VIRTUAL GPU PRODUCT POSITIONING NVIDIA GRID NVIDIA

NVIDIA NSIGHT ECLIPSE EDITION CHRISTOPH ANGERER, NVIDIA JULIEN DEMOUTH, NVIDIA WHAT YOU WILL

Austin Marathon and Half Prep & Pump 2015 The Prep, Part 1: Your Mind Lennie Waite

The PREP Process An Implementing Partner's Guide to a Quality PREP Melissa Joy,

ACCELERATE AUDIT ACCELERATE ATTAIN ALIGN ACCREDIT THE 4 STAGE PROCESS ACCELERATE ACCREDIT

NVIDIA Quadro and NVS Video Walls NVIDIA Quadro and NVS Video Walls Using NVIDIA technology to

NVIDIA INDEX IMPLEMENTING ADVANCED DATA VISUALIZATION WITH NVIDIA INDEX Alexander Kuhn and Marc

NVIDIA INDEX IMPLEMENTING CLOUD SERVICES FOR MASSIVE DATA VISUALIZATION Marc Nienhaus (NVIDIA),

with OpenACC Directives Michael Wolfe michael.wolfe@pgroup.com http://www.pgroup.com/accelerate

Red Hat and the NVIDIA DGX: Tried, Tested, Trusted NVIDIA GTC 2019 Jeremy Eder, Andre Beausoleil,

Cutting Edge Tools and Techniques for Real-Time Rendering with NVIDIA GameWorks David Coombes,

NVIDIA VIDEO TECHNOLOGIES Abhijit Patait, 3/20/2019 NVIDIA Video Technologies Overview Turing

Third quarter 2018 results Delivering a world-class investment case Royal Dutch Shell plc

CRE Markets 2 Mortgage Bankers Association 1 11/9/2017 Mortgage Bankers Association Snapshot

SN SNX1 X1000 00 Report eport Month nthly R Repor eport t on on th the e Emergi

Construction and Real Estate Developments in Morocco Presentation by MEYS Emerging Markets

W O O L W O R T H S H O L D I N G S L I M I T E D 2018 Annual Results 1

REITweek 2018 Investor Conference June 2018 NYSE: ZAYO @ZayoGroup Safe Harbor Information

Australian Employment Projections Carmel ORegan Director Occupational and Industry Analysis

Greece in the European Semester Policy Framework post-Covid19 Presentation by Declan Costello

Using NVIDIA CUDF to Simplify and Accelerate Data Prep for Credit - PowerPoint PPT Presentation

Using NVIDIA CUDF to Simplify and Accelerate Data Prep for Credit Card Algo. Prediction March 19, 2019 Richard Liu Vice President Agenda Macro economics trends Behavioral surplus Paradigm shift Deep dive to the data How

Lindab Group We simplify construction 1 lindab | we simplify construction lindab | we

Grant Prep Boot Camp Grant Prep Boot Camp Grant Prep Boot Camp Grant Prep Boot Camp Robyn Gershon,

Partners PrEP Trial Oral PrEP for Heterosexual Couples in Kenya and Uganda Partners PrEP: Study

RAPIDS CUDA DataFrame Internals for C++ Developers - S91043 Jake Hemstad - NVIDIA - Developer

FOR THE BEST VDI USER EXPERIENCE NVIDIA VIRTUAL GPU PRODUCT POSITIONING NVIDIA GRID NVIDIA

NVIDIA NSIGHT ECLIPSE EDITION CHRISTOPH ANGERER, NVIDIA JULIEN DEMOUTH, NVIDIA WHAT YOU WILL

Austin Marathon and Half Prep &amp; Pump 2015 The Prep, Part 1: Your Mind Lennie Waite

The PREP Process An Implementing Partner's Guide to a Quality PREP Melissa Joy,

ACCELERATE AUDIT ACCELERATE ATTAIN ALIGN ACCREDIT THE 4 STAGE PROCESS ACCELERATE ACCREDIT

NVIDIA Quadro and NVS Video Walls NVIDIA Quadro and NVS Video Walls Using NVIDIA technology to

NVIDIA INDEX IMPLEMENTING ADVANCED DATA VISUALIZATION WITH NVIDIA INDEX Alexander Kuhn and Marc

NVIDIA INDEX IMPLEMENTING CLOUD SERVICES FOR MASSIVE DATA VISUALIZATION Marc Nienhaus (NVIDIA),

with OpenACC Directives Michael Wolfe michael.wolfe@pgroup.com http://www.pgroup.com/accelerate

Red Hat and the NVIDIA DGX: Tried, Tested, Trusted NVIDIA GTC 2019 Jeremy Eder, Andre Beausoleil,

Cutting Edge Tools and Techniques for Real-Time Rendering with NVIDIA GameWorks David Coombes,

NVIDIA VIDEO TECHNOLOGIES Abhijit Patait, 3/20/2019 NVIDIA Video Technologies Overview Turing

Third quarter 2018 results Delivering a world-class investment case Royal Dutch Shell plc

CRE Markets 2 Mortgage Bankers Association 1 11/9/2017 Mortgage Bankers Association Snapshot

SN SNX1 X1000 00 Report eport Month nthly R Repor eport t on on th the e Emergi

Construction and Real Estate Developments in Morocco Presentation by MEYS Emerging Markets

W O O L W O R T H S H O L D I N G S L I M I T E D 2018 Annual Results 1

REITweek 2018 Investor Conference June 2018 NYSE: ZAYO @ZayoGroup Safe Harbor Information

Australian Employment Projections Carmel ORegan Director Occupational and Industry Analysis

Greece in the European Semester Policy Framework post-Covid19 Presentation by Declan Costello

Austin Marathon and Half Prep & Pump 2015 The Prep, Part 1: Your Mind Lennie Waite