Todd Mytkowicz Kathryn S. McKinley Wikipedia Sensors Big data - PowerPoint PPT Presentation

Uncertain< T > A First-Order Type for Uncertain Data James Bornholt Supervisor: Steve Blackburn

Todd Mytkowicz Kathryn S. McKinley

Wikipedia Sensors Big data Sampson et al. Bishop hidden units z M w (1) w (2) MD KM x D y K inputs outputs y 1 x 1 w (2) z 1 10 x 0 z 0 Approximate computing Machine learning uncertain data

struct ¡Geocoordinate ¡{ ¡ ¡ ¡ ¡ ¡double ¡Latitude; ¡ ¡ ¡ ¡ ¡double ¡Longitude; ¡ } ¡ ¡ Geocoordinate ¡Loc ¡= ¡GetGPSLocation(); ¡ discrete type

uncertain data + discrete type ??? =

uncertain data + discrete type = who cares?

uncertain data + discrete type = uncertainty bug

uncertain data + discrete type = uncertainty bug errors that occur when applications pretend that uncertain data is certain

treating estimates as facts struct ¡Geocoordinate ¡{ ¡ ¡ ¡double ¡Latitude; ¡ ¡ ¡double ¡Longitude; ¡ ¡ ¡ ¡double ¡HorizontalAccuracy; ¡ } ¡ 95% of apps ignore accuracy!

computation compounds error

computation compounds error Usain Bolt

computation compounds error 100 80 Walking speed (km/h) 60 Usain Bolt 40 20 0 Time

false positives in questions if ¡(Speed ¡> ¡60) ¡ ¡IssueSpeedingTicket(); ¡ 60 km/h s = 75 km/h Speed 0 30 60 90 120 Speed (km/h)

uncertainty bugs Treating estimates as facts Computation compounds error False positives in questions Caused by poor programming language abstractions Uncertainty should not be abstracted away

related work Flexible Simple Developer computations Uncertain data Sensors, measurements, probabilistic models

related work Flexible Simple Developer computations No Current abstraction abstractions Uncertain data Sensors, measurements, probabilistic models

related work Flexible Simple Developer computations No Probabilistic Current abstraction programming abstractions Uncertain data Sensors, measurements, probabilistic models

probabilistic programming Reasoning about probabilistic models earthquake ¡= ¡ Bernoulli (0.0001) ¡ burglary ¡ ¡ ¡= ¡ Bernoulli (0.001) ¡ alarm ¡ ¡ ¡ ¡ ¡ ¡= ¡earthquake ¡ or ¡burglary ¡ ¡ if ¡(earthquake) ¡ ¡ ¡phoneWorking ¡= ¡ Bernoulli (0.7) ¡ else ¡ ¡ ¡phoneWorking ¡= ¡ Bernoulli (0.99) ¡

inference earthquake ¡= ¡ Bernoulli (0.0001) ¡ burglary ¡ ¡ ¡= ¡ Bernoulli (0.001) ¡ alarm ¡ ¡ ¡ ¡ ¡ ¡= ¡earthquake ¡ or ¡burglary ¡ if ¡(earthquake) ¡ ¡ ¡phoneWorking ¡= ¡ Bernoulli (0.7) ¡ else ¡ ¡ ¡phoneWorking ¡= ¡ Bernoulli (0.99) ¡ ¡ observe(alarm=true) ¡ query(phoneWorking) ¡ What is Pr[phoneWorking= v | alarm=True], for each possible value of v (i.e. True and False)?

inference is expensive Some paths of execution are very unlikely 150 Pr[earthquake] ● 0.01 Time to query (sec) 0.0001 100 50 ● ● ● ● ● 0 100 200 300 400 500 Number of samples

related work Flexible Simple Developer computations No Probabilistic Current abstraction programming abstractions Probabilistic data Sensors, measurements, probabilistic models

related work Flexible Simple Developer computations No Probabilistic Current Uncertain<T> abstraction programming abstractions Probabilistic data Sensors, measurements, probabilistic models

Uncertain<T> is an uncertain type abstraction. Encapsulates distributions, like prior work. But focuses on an accessible interface. For everyday programmers, Uncertain<T> enables programs that are more concise, expressive, and correct.

using Uncertain<T> Identify the distribution Compute with the distribution Ask questions using conditionals Improve the quality of estimates

identify compute question improve identifying the distribution Many library programmers already know the distribution they need to return!

identify compute question improve identifying the distribution Many library programmers already know the distribution they need to return! “Get the estimated accuracy of this location, in meters. We define accuracy as the radius of 68% confidence. […] In statistical terms, it is assumed that location errors are random with a normal distribution.” —Android

identify compute question improve representing distributions − ( x − µ ) 2 ⇢ � 1 Norm( x ; µ, σ ) = 2 πσ exp √ 2 σ 2

identify compute question improve representing distributions Store probability density functions? − ( x − µ ) 2 ⇢ � 1 Norm( x ; µ, σ ) = 2 πσ exp √ 2 σ 2 Two problems: 1. Even simple operations are complex: Z ∞ f X + Y ( z ) = f Y ( z − x ) f X ( x ) d x −∞ 2. Many interesting distributions don’t have PDFs

identify compute question improve representing distributions Random sampling: two birds with one stone Simple operations are simple (e.g., +) More distributions can be represented Later: how to implement random sampling

identify compute question improve computing with distributions Propagating uncertainty through calculations automatically with operator overloading A key advantage of random sampling: computation is simply* lifting of the original operators

identify compute question improve computing with distributions Propagating uncertainty through calculations automatically with operator overloading A key advantage of random sampling: computation is simply* lifting of the original operators X If x a sample of X Y X+Y and y a sample of Y then x+y a sample of X+Y

identify compute question improve computing with distributions * The caveat is that this only works if the operands are independent If not, we need to know something about how the variables are related This is an issue for all probabilistic programming

identify compute question improve induced dependencies (X,Y independent) A ¡= ¡X ¡+ ¡Y ¡ B ¡= ¡A ¡+ ¡X ¡

identify compute question improve induced dependencies We can distinguish inherent dependencies from programmer-induced dependencies (X,Y independent) A ¡= ¡X ¡+ ¡Y ¡ B ¡= ¡A ¡+ ¡X ¡ When evaluating B, both operands depend on X, so they are not independent Lazy evaluation to the rescue!

identify compute question improve asking questions if ¡(Speed ¡> ¡60) ¡ ¡IssueSpeedingTicket(); ¡ 60 km/h s = 75 km/h Speed 0 30 60 90 120 Speed (km/h)

identify compute question improve comparing means if ¡( Speed.E() ¡> ¡60) ¡ ¡IssueSpeedingTicket(); ¡ E[Speed] 60 km/h 0 30 60 90 120 Speed (km/h)

identify compute question improve comparing evidence if ¡( (Speed ¡> ¡60).E() ¡> ¡0.95 ) ¡ ¡IssueSpeedingTicket(); ¡ 60 km/h Pr[Speed > 60] 0 30 60 90 120 Speed (km/h)

identify compute question improve comparing evidence > is a lifted operator if ¡( (Speed ¡> ¡60).E() ¡> ¡0.95 ) ¡ ¡IssueSpeedingTicket(); ¡ 60 km/h Pr[Speed > 60] 0 30 60 90 120 Speed (km/h)

identify compute question improve comparing evidence type Uncertain<bool> if ¡( (Speed ¡> ¡60).E() ¡> ¡0.95 ) ¡ ¡IssueSpeedingTicket(); ¡ 60 km/h Pr[Speed > 60] 0 30 60 90 120 Speed (km/h)

identify compute question improve comparing evidence mean of Uncertain<bool> if ¡( (Speed ¡> ¡60).E() ¡> ¡0.95 ) ¡ ¡IssueSpeedingTicket(); ¡ 60 km/h Pr[Speed > 60] 0 30 60 90 120 Speed (km/h)

identify compute question improve comparing evidence = number in [0,1] if ¡( (Speed ¡> ¡60).E() ¡> ¡0.95 ) ¡ ¡IssueSpeedingTicket(); ¡ 60 km/h Pr[Speed > 60] 0 30 60 90 120 Speed (km/h)

identify compute question improve comparing evidence % of True instances if ¡( (Speed ¡> ¡60).E() ¡> ¡0.95 ) ¡ ¡IssueSpeedingTicket(); ¡ 60 km/h Pr[Speed > 60] 0 30 60 90 120 Speed (km/h)

identify compute question improve comparing evidence is there a >95% chance that Speed > 60? if ¡( (Speed ¡> ¡60).E() ¡> ¡0.95 ) ¡ ¡IssueSpeedingTicket(); ¡ 60 km/h Pr[Speed > 60] 0 30 60 90 120 Speed (km/h)

Todd Mytkowicz Kathryn S. McKinley Wikipedia Sensors Big data - PowerPoint PPT Presentation

Uncertain< T > A First-Order Type for Uncertain Data James Bornholt Supervisor: Steve Blackburn Todd Mytkowicz Kathryn S. McKinley Wikipedia Sensors Big data Sampson et al. Bishop hidden units z M w (1) w (2) MD KM x D y K inputs

A Bridge To Your Future Transitioning to Middle School McKinley Classical Academy McKinley

Static Stages for Heterogeneous Programming Adrian Sampson, Cornell Kathryn S McKinley, Google

Measuring and Optimizing Tail Latency Kathryn S McKinley, Google CRA-W Undergraduate Town Hall

Probabilistic Assertions Adrian Sampson University of Washington Pavel Panchekha Todd

Hows the Parallel Computing Revolution Going? Towards Parallel, Scalable VM Services Kathryn

BLPC #8 Meeting BLPC #8 Meeting McKinley Elementary School McKinley Elementary School

McKINLEY ELEMENTARY SCHOOL FINAL DESIGN REPORT | 01.22.2015 RENOVATION AND ADDITION 1 McKinley

Linda Rhyne-McKinley Management Analyst Mecklenburg County Child Support LinkedIn Linda

Programming the Internet of Uncertain <T>hings James Bornholt University of Washington Na

Uncertain< T > A First-Order Type for Uncertain Data James Bornholt Australian National

Sensors Approximate computing approximate edge detection Machine learning hidden

Theres Something About Bayes Effective Probabilistic Programming for the Rest of Us James

Presenters: Kathryn Wright, Associate Kathryn.wright@mc-group.com or

Local Voluntary Committee Annual Gathering 2019 Dr Kathryn Scott Chief Executive Welcome and

Agenda Number 3. Kathryn Marquoit Assistant Ombudsman for Public Access Kathryn Marquoit joined

Welcome River Hill High School Principal Kathryn McKinley Graduation Requirements Service

CS 188: Artificial Intelligence Bayes Nets Instructors: Sergey Levine --- University of

Zipper Alarm Blue A Sketch Model Review PROBLEM Frequent travelers do not have a convenient and

Sound Abstraction and Decomposition of Probabilistic Programs Steven Holtzen and Guy Van den

Features Alarms Albert L. Rossi Fermi Na2onal

Burst Photography ! EE367/CS448I: Computational Imaging and Display ! stanford.edu/class/ee367 !

Module 6.1 Memory Access Performance DRAM Bandwidth Objective To learn that memory

BurScale: Using Burstable Instances for Cost-Effective Autoscaling in the Public Cloud Ata Fatahi

What to use for the default supernova burst model(s)? K. Scholberg DAQ Physics Performance WG

Todd Mytkowicz Kathryn S. McKinley Wikipedia Sensors Big data - PowerPoint PPT Presentation

Uncertain< T > A First-Order Type for Uncertain Data James Bornholt Supervisor: Steve Blackburn Todd Mytkowicz Kathryn S. McKinley Wikipedia Sensors Big data Sampson et al. Bishop hidden units z M w (1) w (2) MD KM x D y K inputs

A Bridge To Your Future Transitioning to Middle School McKinley Classical Academy McKinley

Static Stages for Heterogeneous Programming Adrian Sampson, Cornell Kathryn S McKinley, Google

Measuring and Optimizing Tail Latency Kathryn S McKinley, Google CRA-W Undergraduate Town Hall

Probabilistic Assertions Adrian Sampson University of Washington Pavel Panchekha Todd

Hows the Parallel Computing Revolution Going? Towards Parallel, Scalable VM Services Kathryn

BLPC #8 Meeting BLPC #8 Meeting McKinley Elementary School McKinley Elementary School

McKINLEY ELEMENTARY SCHOOL FINAL DESIGN REPORT | 01.22.2015 RENOVATION AND ADDITION 1 McKinley

Linda Rhyne-McKinley Management Analyst Mecklenburg County Child Support LinkedIn Linda

Programming the Internet of Uncertain &lt;T&gt;hings James Bornholt University of Washington Na

Uncertain&lt; T &gt; A First-Order Type for Uncertain Data James Bornholt Australian National

Sensors Approximate computing approximate edge detection Machine learning hidden

Theres Something About Bayes Effective Probabilistic Programming for the Rest of Us James

Presenters: Kathryn Wright, Associate Kathryn.wright@mc-group.com or

Local Voluntary Committee Annual Gathering 2019 Dr Kathryn Scott Chief Executive Welcome and

Agenda Number 3. Kathryn Marquoit Assistant Ombudsman for Public Access Kathryn Marquoit joined

Welcome River Hill High School Principal Kathryn McKinley Graduation Requirements Service

CS 188: Artificial Intelligence Bayes Nets Instructors: Sergey Levine --- University of

Zipper Alarm Blue A Sketch Model Review PROBLEM Frequent travelers do not have a convenient and

Sound Abstraction and Decomposition of Probabilistic Programs Steven Holtzen and Guy Van den

Features Alarms Albert L. Rossi Fermi Na2onal

Burst Photography ! EE367/CS448I: Computational Imaging and Display ! stanford.edu/class/ee367 !

Module 6.1 Memory Access Performance DRAM Bandwidth Objective To learn that memory

BurScale: Using Burstable Instances for Cost-Effective Autoscaling in the Public Cloud Ata Fatahi

What to use for the default supernova burst model(s)? K. Scholberg DAQ Physics Performance WG

Programming the Internet of Uncertain <T>hings James Bornholt University of Washington Na

Uncertain< T > A First-Order Type for Uncertain Data James Bornholt Australian National