Prioritizing Attention in Fast Data Principles and Promise Peter - - PowerPoint PPT Presentation

prioritizing attention in fast data
SMART_READER_LITE
LIVE PREVIEW

Prioritizing Attention in Fast Data Principles and Promise Peter - - PowerPoint PPT Presentation

Prioritizing Attention in Fast Data Principles and Promise Peter Bailis Edward Gan Kexin Rong Sahaana Suri CIDR 2017 Edward Kexin Sahaana Edward Kexin Sahaana Deepak Firas Matei John Tony Edward Kexin Sahaana Deepak Firas


slide-1
SLIDE 1

Prioritizing Attention in Fast Data

Principles and Promise

Peter Bailis Edward Gan Kexin Rong Sahaana Suri CIDR 2017

slide-2
SLIDE 2

Sahaana Kexin Edward

slide-3
SLIDE 3

Deepak Firas Tony John Matei

Sahaana Kexin Edward

slide-4
SLIDE 4

Deepak Firas Tony John Matei

Sahaana Kexin Edward

Sam Ihab Lei Xu

slide-5
SLIDE 5

abundant data, scarce attention

slide-6
SLIDE 6

abundant data, scarce attention

data is increasingly too big for manual inspection

slide-7
SLIDE 7

abundant data, scarce attention

Twitter, LinkedIn, Facebook, Google: log 12M+ events/s projected 40% year-over-year growth (e.g., via IoT)

data is increasingly too big for manual inspection

slide-8
SLIDE 8

abundant data, scarce attention

Twitter, LinkedIn, Facebook, Google: log 12M+ events/s projected 40% year-over-year growth (e.g., via IoT)

today’s operators say: < 6% of this data is ever accessed after ingest

data is increasingly too big for manual inspection

slide-9
SLIDE 9

abundant data, scarce attention

Twitter, LinkedIn, Facebook, Google: log 12M+ events/s projected 40% year-over-year growth (e.g., via IoT)

today’s operators say: < 6% of this data is ever accessed after ingest

data is increasingly too big for manual inspection

call this trend “fast data”

slide-10
SLIDE 10

6%

slide-11
SLIDE 11

abundant data, scarce attention

e.g., telemetry and metrics from 100k-MM devices is the application behaving as expected?

slide-12
SLIDE 12

idea: use a classifier to filter data classifier

slide-13
SLIDE 13

idea: use a classifier to filter data classifier

too much data? filter the stream for “interesting”/useful data

slide-14
SLIDE 14

basic classifier: static rules classifier

slide-15
SLIDE 15

basic classifier: static rules classifier

slide-16
SLIDE 16

basic classifier: static rules classifier

example: power drain > 2W?

slide-17
SLIDE 17

basic classifier: static rules classifier

example: power drain > 2W? pros: scalable, simple cons: highly brittle, may miss events

slide-18
SLIDE 18

better classifier: use ML & statistics classifier

example: compute statistical likelihood of power activity given user population

slide-19
SLIDE 19

More than k standard deviations from µ Mean µ

better classifier: use ML & statistics classifier

example: compute statistical likelihood of power activity given user population

slide-20
SLIDE 20

More than k standard deviations from µ Mean µ

better classifier: use ML & statistics classifier

example: compute statistical likelihood of power activity given user population pros: can model dynamic & complex events cons: often slow!

slide-21
SLIDE 21

models are expensive to run

e.g., state-of-art CNN: 30fps requires $1200 GPU

slide-22
SLIDE 22

anecdote: speed vs. quality

engineers at major online service monitoring per-device QoS:

  • ff-the-shelf stats packages too slow, not scalable

solution: manually tune thresholds per-user, per-device!

models are expensive to run

e.g., state-of-art CNN: 30fps requires $1200 GPU

slide-23
SLIDE 23

anecdote: speed vs. quality

engineers at major online service monitoring per-device QoS:

  • ff-the-shelf stats packages too slow, not scalable

solution: manually tune thresholds per-user, per-device!

result: brittle, reactive, false negatives wanted: accurate, scalable classifiers

models are expensive to run

e.g., state-of-art CNN: 30fps requires $1200 GPU

slide-24
SLIDE 24

raw data is still too much classifier

even filtered data is problematic at scale high volume still overwhelms human attention high-dimensional attributes can obscure trends

slide-25
SLIDE 25
slide-26
SLIDE 26

android device types by popularity

slide-27
SLIDE 27

explanations aggregate results classify explain

highlight commonalities and trends

slide-28
SLIDE 28

explanations aggregate results classify

e.g., Android Galaxy S7 devices running app version 2.4.4 are 51x more likely than usual to have extreme power drain

explain

highlight commonalities and trends return aggregates and representative events instead of returning raw data

slide-29
SLIDE 29

the key to fast data combine: classify and explain

classify explain

slide-30
SLIDE 30

the key to fast data combine: classify and explain

classify explain how should we do it?

slide-31
SLIDE 31

dataflow (alone) is not enough

dataflow: a substrate, not a complete solution

slide-32
SLIDE 32

dataflow (alone) is not enough

dataflow: a substrate, not a complete solution

slide-33
SLIDE 33

dataflow (alone) is not enough

dataflow: a substrate, not a complete solution missing: scalable, modular operators for prioritizing attention via classification and explanation

slide-34
SLIDE 34

macrobase: a fast data system classify explain

a system providing fast, reusable, modular operators for classification and explanation prioritizing attention in fast data

slide-35
SLIDE 35

MacroBase default workflow input: data attributes, key performance metrics

  • utput: attributes that explain deviations in metrics
slide-36
SLIDE 36
slide-37
SLIDE 37

correlated attributes

slide-38
SLIDE 38

key metric

correlated attributes

slide-39
SLIDE 39

“MacroBase discovered a rare issue with the CMT application and a device-specific battery problem. Consultation and investigation with the CMT team confirmed these issues as previously unknown…”

slide-40
SLIDE 40

classify explain

key: make this combo fast

slide-41
SLIDE 41

example: end-to-end optimization

slide-42
SLIDE 42

example: end-to-end optimization

streaming explanation A B A A A B C D A B D C B B E B B

slide-43
SLIDE 43

example: end-to-end optimization

standard solution: find correlations w/in each class

streaming explanation A B A A A B C D A B D C B B E B B

slide-44
SLIDE 44

example: end-to-end optimization

standard solution: find correlations w/in each class

streaming explanation A B A A A B C D A B D C B B E B B A: 80% B: 20%

slide-45
SLIDE 45

example: end-to-end optimization

standard solution: find correlations w/in each class

streaming explanation A B A A A B C D A B D C B B E B B A: 80% B: 20% A: 0.1% C: 31.9% B: 46% D: 22%

slide-46
SLIDE 46

example: end-to-end optimization

standard solution: find correlations w/in each class

streaming explanation

better idea: exploit cardinality imbalance correlate “outliers”, probe “inliers”

A B A A A B C D A B D C B B E B B A: 80% B: 20% A: 0.1% C: 31.9% B: 46% D: 22%

slide-47
SLIDE 47

example: end-to-end optimization

standard solution: find correlations w/in each class

streaming explanation

better idea: exploit cardinality imbalance correlate “outliers”, probe “inliers”

A B A A A B C D A B D C B B E B B A: 80% A: 80% B: 20% A: 0.1% C: 31.9% B: 46% D: 22%

slide-48
SLIDE 48

example: end-to-end optimization

standard solution: find correlations w/in each class

streaming explanation

better idea: exploit cardinality imbalance correlate “outliers”, probe “inliers”

A B A A A B C D A B D C B B E B B A: 80% A: 80% B: 20% A: 0.1% C: 31.9% B: 46% D: 22% A: 0.1%

slide-49
SLIDE 49

classify explain

key: make this combo fast

slide-50
SLIDE 50

classify explain

key: make this combo fast surprise: this combo enables new

  • ptimizations
slide-51
SLIDE 51

1.) read a textbook on statistics/ML 2.) implement the thing that should work 3.) observe it’s really slow 4.) make it fast using systems techniques needed: classic systems techniques

indexing, caching, predicate pushdown, sketching

  • ne weird trick for

2017 systems research

slide-52
SLIDE 52

explain

is this system just a bunch of one-off hacks?

classify

slide-53
SLIDE 53

explain

is this system just a bunch of one-off hacks? no! only need a small # of core operators, coupled with domain-specific features

featurize classify

slide-54
SLIDE 54

explain

is this system just a bunch of one-off hacks? no! only need a small # of core operators, coupled with domain-specific features

featurize classify

  • st

groupby(video) + CV xform

. . .

MAD

. . .

  • ptical flow

mean

  • ptical flow

mean

slide-55
SLIDE 55

explain featurize classify

a range of interfaces empowers a range of users: domain experts: point and click UI scripters: custom dataflow pipelines ML and systems ninjas: custom operators

slide-56
SLIDE 56

users inform design

automotive monitoring fleet QoS

  • nline services & datacenters (DevOps / monitoring)

identifying slow containers, exception telemetry industrial manufacturing key sources of process variance in product geophysics Lunar water ice detection, seismic activity detection

slide-57
SLIDE 57

macrobase

  • an open source search engine for fast data
  • modular, efficient classification and explanation

fast data

  • overabundant data, scarce human attention
  • a major opportunity for systems, w/ real use cases

explain featurize classify