with GPUs and Design Leo Meyerovich (@LMeyerov) CEO 1 is: - - PowerPoint PPT Presentation

with gpus and design
SMART_READER_LITE
LIVE PREVIEW

with GPUs and Design Leo Meyerovich (@LMeyerov) CEO 1 is: - - PowerPoint PPT Presentation

Scaling Data Visualization with GPUs and Design Leo Meyerovich (@LMeyerov) CEO 1 is: Supercharging visual analytics through GPU cloud streaming. (We tricky graphs.) The Future of Visual Analysis 3 Not the Future We Were Promised 4


slide-1
SLIDE 1

Scaling Data Visualization with GPUs and Design

Leo Meyerovich (@LMeyerov) CEO

1

slide-2
SLIDE 2

is: Supercharging visual analytics through GPU cloud streaming. (We tricky graphs.)

slide-3
SLIDE 3

3

The Future of Visual Analysis

slide-4
SLIDE 4

4

Not the Future We Were Promised

slide-5
SLIDE 5

5

Ballot Boxes: 100K rows x 30 col CSV

slide-6
SLIDE 6

Stack Towns by Voter Turnout

6

0% 25% 50% 75% 100%

Voter Turnout # Towns Most towns had ~40% people vote ballot box stuffing?

slide-7
SLIDE 7

7

slide-8
SLIDE 8

Opposition Incumbent Tiny square shows town size (area) and vote (color)

8

slide-9
SLIDE 9

Filter for towns w/ high turnout

9

slide-10
SLIDE 10

Tag suspicious with black

10

slide-11
SLIDE 11

11

Analyze suspicious activity in context What parts of the supply chain were hit?

slide-12
SLIDE 12

12

A slider is worth a hundred queries.

slide-13
SLIDE 13

13

A slider is worth a hundred queries.

Challenge: Tools must keep pace with human ingenuity: interact meaningfully and quickly

slide-14
SLIDE 14

DEMO: The Power of Meaningful Layouts

slide-15
SLIDE 15

On a small graph (77 nodes), meaningful design adds some clarity

slide-16
SLIDE 16

CASE STUDY: TWITTER FRAUD Naïve layout on 1K+ node graphs give impenetrable hairballs.

Gauss-Seidel Force-Directed Graph, O(N^2) n-body, GPU

Node: Twitter account Edge: Friendship

Friends and friend-of-friends of a bot who randomly messaged real people and retweeted them.

slide-17
SLIDE 17

With smart layouts, fake account clusters pop out

ForceAtlas2 Layout, O(n log n) n-body, GPU

The spambot is an entrypoint to more bots…

slide-18
SLIDE 18

A quiet small business who buys virtual game currency from gamers…

slide-19
SLIDE 19

Who somehow got exactly 1 message massively trended & advertised by Twitter

slide-20
SLIDE 20

spammer laundering accounts bot retweet network It’s a “retweet laundering” botnet! Tricks Twitter into targeting gamers to check out a cyberfraud site. They steal gamers’ money and identities.

slide-21
SLIDE 21

DEMO: GPUs Enable Exploration

slide-22
SLIDE 22

22

Uber Trips through SF Start to End

slide-23
SLIDE 23

23

Connecting the Dots: OVERPLOTTED!

slide-24
SLIDE 24

24

Uber Trips through SF Start to End

Edge Bundling Reveals Arteries

slide-25
SLIDE 25

25

Uber Trips through SF Start to End

Edge Bundling Reveals Arteries

… But too slow to filter on time, location, demographics, …

slide-26
SLIDE 26

26

Uber Trips through SF Start to End

Edge Bundling Reveals Arteries

DEMO: GPU Acceleration

slide-27
SLIDE 27

Under the Hood: Architecting for GPU Cloud Streaming & Benchmarks

slide-28
SLIDE 28

Thin/Thick is Dead.

Home: broadband Office: GigE

Build thick/BIG.

multicore + GPU multicore + GPU

slide-29
SLIDE 29

Architecting Visual Analytics around thick/BIG (GPU Cloud Streaming)

layout & analytics

cluster, filter, summarize, …

multicore decoder multicore encoder compressed geometry (VBO) rendering engine

zoom, drag, mouseover, …

Portable & predictable. Scalable.

slide-30
SLIDE 30

0.1 1 10 100 500K 1.0M 1.5M Frames per second Graph Size: # Nodes + # Edges

Explore 15X More Data, 60X Faster

10

interactivity threshold

(multicore)

60x faster

Graphistry streaming from AWS G2 Gephi on 2014 MacBook Pro Multiple SNAP datasets

(thick/big)

15X+ bigger data

slide-31
SLIDE 31

Region by pop. GPUs RAM TFLOPS Cost*

SF 1 4GB 2 $0.06/hr

*Calculated as $0.60/hr AWS G2 instance / 10x timesharing

slide-32
SLIDE 32

Region by pop. GPUs RAM TFLOPS Cost*

SF 1 4GB 2 $0.06/hr Bay Area 10 40GB 20 $0.60/hr

*Calculated as $0.60/hr AWS G2 instance / 10x timesharing

slide-33
SLIDE 33

Region by pop. GPUs RAM TFLOPS Cost*

SF 1 4GB 2 $0.06/hr Bay Area 10 40GB 20 $0.60/hr California 100 400GB 200 $6.00/hr

*Calculated as $0.60/hr AWS G2 instance / 10x timesharing

slide-34
SLIDE 34

Region by pop. GPUs RAM TFLOPS Cost*

SF 1 4GB 2 $0.06/hr Bay Area 10 40GB 20 $0.60/hr California 100 400GB 200 $6.00/hr America 1000 4TB 2 PFLOPS $60.00/hr

*Calculated as $0.60/hr AWS G2 instance / 10x timesharing

… less than even

  • ne consultant …
slide-35
SLIDE 35

35

Code less, explore more. New era is thick/BIG: GPU cloud streaming.

slide-36
SLIDE 36

We’re Hiring! Infoviz & frontend (and contact for info on using/embedding)

36

info@graphistry.com Twitter: @LMeyerov