Probability and Statistics for Computer Science The - PowerPoint PPT Presentation

Probability ¡and ¡Statistics ¡ ì ¡ for ¡Computer ¡Science ¡ ¡ “The ¡weak ¡law ¡of ¡large ¡ number ¡gives ¡us ¡a ¡very ¡ valuable ¡way ¡of ¡thinking ¡ about ¡expecta:ons.” ¡-‑-‑-‑Prof. ¡ Forsythe ¡ ¡ ¡ Credit: ¡wikipedia ¡ Hongye ¡Liu, ¡Teaching ¡Assistant ¡Prof, ¡CS361, ¡UIUC, ¡9.24.2019 ¡

Last ¡time ¡ ✺ Random ¡Variable ¡ ¡ ✺ Expected ¡value ¡ ✺ Variance ¡& ¡covariance ¡ ✺ Towards ¡the ¡weak ¡law ¡of ¡large ¡ numbers ¡

Content ¡ ✺ Random ¡Variable ¡ ¡ ✺ Review ¡with ¡ques>ons ¡ ✺ The ¡weak ¡law ¡of ¡large ¡numbers ¡ ✺ Simula>on ¡& ¡example ¡of ¡airline ¡ overbooking ¡

Content ¡ ✺ Random ¡Variable ¡ ¡ ✺ Review ¡with ¡ques,ons ¡ ✺ The ¡weak ¡law ¡of ¡large ¡numbers ¡ ✺ Simula>on ¡& ¡example ¡of ¡airline ¡ overbooking ¡

Expected ¡value ¡ ✺ The ¡ expected ¡value ¡(or ¡ expecta,on ) ¡ of ¡a ¡random ¡variable ¡ X ¡is ¡ � E [ X ] = xP ( x ) x The ¡expected ¡value ¡is ¡a ¡weighted ¡sum ¡ of ¡the ¡values ¡ X ¡can ¡take ¡ ¡

Linearity ¡of ¡Expectation ¡ ✺ For ¡random ¡variables ¡ X ¡and ¡ Y ¡ and ¡constants ¡k,c ¡ ✺ Scaling ¡property ¡ ¡ E [ kX ] = kE [ X ] ✺ Addi:vity ¡ E [ X + Y ] = E [ X ] + E [ Y ] ✺ And ¡ ¡ E [ kX + c ] = kE [ X ] + c

Expected ¡value ¡of ¡a ¡function ¡of ¡ X ✺ If ¡ f ¡is ¡a ¡func:on ¡of ¡a ¡random ¡ variable ¡ X ¡, ¡then ¡ Y ¡= ¡ f ¡( X ) ¡is ¡a ¡ random ¡variable ¡too ¡ ✺ The ¡expected ¡value ¡of ¡ Y ¡= ¡ f ¡( X ) ¡is ¡ � E [ Y ] = E [ f ( X )] = f ( x ) P ( x ) ¡ x ¡

Q: ¡ What ¡is ¡E[E[X]]? ¡ ¡ ¡ ¡ A. E[X] ¡ B. 0 ¡ C. Can’t ¡be ¡sure ¡

Probability ¡distribution ¡ ✺ Given ¡the ¡random ¡variable ¡ X , ¡ what ¡is ¡ ¡ E[2| X | ¡+1]? ¡ A. ¡ ¡0 ¡ p ( x ) P ( X = x ) B. ¡ ¡1 ¡ C. ¡ ¡2 ¡ D. ¡ ¡3 ¡ 1/2 ¡ E. ¡ ¡5 ¡ 0 ¡ -‑1 ¡ 1 ¡ X

Probability ¡distribution ¡and ¡ cumulative ¡distribution ¡ ✺ Given ¡the ¡random ¡variable ¡ X , ¡ what ¡is ¡ ¡ E[2| X | ¡+1]? ¡ E [ | X | ] = 1 × 1 2 + 1 × 1 2 = 1 E [ X 2 ] = 1 × 1 2 + 1 × 1 ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ 2 = 1 E [2 | X | + 1] = 2 E [ | X | ] + 1 = 3

A ¡neater ¡expression ¡for ¡variance ✺ Variance ¡of ¡Random ¡Variable ¡X ¡is ¡ defined ¡as: ¡ ¡ var [ X ] = E [( X − E [ X ]) 2 ] ¡ ¡ ✺ It’s ¡the ¡same ¡as: ¡ var [ X ] = E [ X 2 ] − E [ X ] 2

Probability ¡distribution ¡and ¡ cumulative ¡distribution ¡ ✺ Given ¡the ¡random ¡variable ¡ X , ¡ what ¡is ¡ ¡ var[2| X | ¡+1]? ¡ A. ¡ ¡0 ¡ p ( x ) P ( X = x ) B. ¡ ¡1 ¡ C. ¡ ¡2 ¡ D. ¡ ¡3 ¡ 1/2 ¡ E. ¡ ¡-‑1 ¡ 0 ¡ -‑1 ¡ 1 ¡ X

Probability ¡distribution ¡and ¡ cumulative ¡distribution ¡ ✺ Given ¡the ¡random ¡variable ¡ X , ¡ what ¡is ¡ ¡ var[2| X | ¡+1]? ¡ E [ | X | ] = 1 × 1 2 + 1 × 1 2 = 1 E [ X 2 ] = 1 × 1 2 + 1 × 1 2 = 1 E [2 | X | + 1] = 2 E [ | X | ] + 1 = 3 var [2 | X | + 1] = E [(2 | X | + 1) 2 ] − ( E [2 | X | + 1]) 2 = E [4 X 2 + 4 | X | + 1] − 3 2 = 4 × 1 + 4 × 1 + 1 − 9 = 0

Probability ¡distribution ¡ ✺ Given ¡the ¡random ¡variable ¡ X , ¡ what ¡is ¡ ¡ var[2| X | ¡+1]? ¡ ¡Let ¡ Y ¡= ¡2| X |+1 ¡ p ( y ) P ( Y = y ) 1 ¡ 0 ¡ 3 ¡ X

A ¡neater ¡form ¡for ¡covariance ¡ ✺ A ¡neater ¡expression ¡for ¡ covariance ¡(similar ¡deriva:on ¡as ¡ for ¡variance) ¡ cov ( X, Y ) = E [ XY ] − E [ X ] E [ Y ]

Correlation ¡coefficient ¡is ¡ normalized ¡ ¡covariance ¡ ✺ The ¡correla:on ¡coefficient ¡is ¡ corr ( X, Y ) = cov ( X, Y ) ¡ σ X σ Y ✺ When ¡ X, Y ¡takes ¡on ¡values ¡with ¡equal ¡ probability ¡to ¡generate ¡data ¡sets ¡ {( x,y )}, ¡the ¡correla:on ¡coefficient ¡will ¡ be ¡as ¡seen ¡in ¡Chapter ¡2. ¡

Q: ¡ ✺ Which ¡of ¡the ¡following ¡is ¡NOT ¡generally ¡ true ¡about ¡two ¡independent ¡random ¡ variables ¡X ¡and ¡Y? ¡ ¡A. ¡E[X+Y] ¡= ¡E[X] ¡+E[Y] ¡ ¡B. ¡var[X+Y] ¡= ¡var[X]+V[Y] ¡ ¡C. ¡E[XY] ¡= ¡E[X]E[Y] ¡ ¡D. ¡corr(X,Y) ¡= ¡0 ¡ ¡E. ¡std[X+Y] ¡= ¡std[X]+std[Y] ¡

Content ¡ ✺ Random ¡Variable ¡ ¡ ✺ Review ¡with ¡ques>ons ¡ ✺ The ¡weak ¡law ¡of ¡large ¡numbers ¡ ✺ Simula>on ¡& ¡example ¡of ¡airline ¡ overbooking ¡

Towards ¡the ¡weak ¡law ¡of ¡large ¡ numbers ¡ ✺ The ¡weak ¡law ¡says ¡that ¡if ¡we ¡repeat ¡an ¡experiment ¡ many ¡:mes, ¡the ¡average ¡of ¡the ¡observa:ons ¡will ¡ “converge” ¡to ¡the ¡expected ¡value ¡ ✺ For ¡example, ¡if ¡you ¡repeat ¡the ¡profit ¡example, ¡the ¡ average ¡earning ¡will ¡“converge” ¡to ¡E[ X ]=20p-‑10 ¡ ¡ ✺ The ¡weak ¡law ¡jus:fies ¡using ¡simula:ons ¡(instead ¡of ¡ calcula:on) ¡ ¡to ¡es:mate ¡the ¡expected ¡values ¡of ¡ random ¡variables ¡

Indicator ¡functions ¡ ✺ An ¡indicator ¡func:on ¡for ¡an ¡event ¡ A ¡is ¡a ¡ func:on ¡of ¡ x ¡such ¡that ¡ � 1 event occurs for the value x ( x ) = [ A ] ¡ 0 otherwise ✺ The ¡expected ¡value ¡of ¡the ¡indicator ¡func:on ¡ is ¡the ¡probability ¡of ¡event ¡ A ¡ E[ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ( x ) ] ¡= ¡ 1×P( A )+0×(1-‑P( A )) ¡= ¡P( A ) ¡ [ A ] ¡

Markov’s ¡inequality ¡ ✺ For ¡any ¡random ¡variable ¡ X ¡and ¡constant ¡ a ¡ >0 ¡ P ( | X | ≥ a ) ≤ E [ | X | ] ¡ a ✺ So, ¡a ¡random ¡variable ¡is ¡unlikely ¡to ¡have ¡the ¡ absolute ¡value ¡much ¡larger ¡than ¡the ¡mean ¡of ¡ its ¡absolute ¡value ¡ ¡ ¡ ✺ For ¡example, ¡if ¡ a ¡= ¡10 ¡E[| X |] ¡ ¡ ¡ ¡ P ( | X | ≥ 10 E [ | X | ]) ≤ 0 . 1 ¡ ¡

Proof ¡of ¡Markov’s ¡inequality ¡ � 1 if | X | ≥ a [ |X| ≥ a ] ¡ ( X ) = 0 otherwise

Proof ¡of ¡Markov’s ¡inequality ¡ � 1 if | X | ≥ a [ |X| ≥ a ] ¡ ( X ) = 0 otherwise ≤ | X | a

Proof ¡of ¡Markov’s ¡inequality ¡ � 1 if | X | ≥ a ( X ) = [ |X| ≥ a ] ¡ 0 otherwise ≤ | X | a ≤ E [ | X | ] E[ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ( X ) ] ¡ [ |X| ≥ a ] ¡ a

Proof ¡of ¡Markov’s ¡inequality ¡ � 1 if | X | ≥ a ( X ) = [ |X| ≥ a ] ¡ 0 otherwise ≤ | X | a ≤ E [ | X | ] E[ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ( X ) ] ¡ [ |X| ≥ a ] ¡ a LHS ¡= ¡ ¡

Proof ¡of ¡Markov’s ¡inequality ¡ � 1 if | X | ≥ a ( X ) = [ |X| ≥ a ] ¡ 0 otherwise ≤ | X | a ≤ E [ | X | ] E[ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ( X ) ] ¡ [ |X| ≥ a ] ¡ a LHS ¡= ¡ ¡ P ( | X | ≥ a )

Proof ¡of ¡Markov’s ¡inequality ¡ � 1 if | X | ≥ a ( X ) = [ |X| ≥ a ] ¡ 0 otherwise ≤ | X | a ≤ E [ | X | ] E[ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ( X ) ] ¡ [ |X| ≥ a ] ¡ a P ( | X | ≥ a ) ≤ E [ | X | ] LHS ¡= ¡ ¡ a

Probability and Statistics for Computer Science The - PowerPoint PPT Presentation

Probability and Statistics for Computer Science The weak law of large number gives us a very valuable way of thinking about

Probability Basics Martin Emms October 1, 2020 Probability Basics Outline Probability

Continuing Probability. Wrap up: Total Probability and Conditional Probability. Continuing

Chapter 2 Probability 1. Definition of Probability 2. Probability of disjoint events 3.

Probability Basics Probability Background Martin Emms October 1, 2020 Probability Basics

Chapter 2 Probability 1. Definition of Probability 2. Probability of disjoint events 3.

Categorical Probability and Statistics Peter McCullagh Department of Statistics University of

Counting and Probability Whats to come? Counting and Probability Whats to come?

Unit 2: Probability and distributions Lecture 1: Probability and conditional probability

Which probability Which probability Which probability Which probability theory for cosmology?

Recap of Basic Probability Elements of basic probability theory probability theory The

1 2 3 4 Stopping Probability Visiting Probability 5 Stopping

ACMS 20340 Statistics for Life Sciences Chapter 9: Introducing Probability Why Consider

Statistics 1B Statistics 1B 1 (11) 0. Lecture 1. Introduction and probability review

Statistics 370 Probability and Statistics for Engineers Instructor: Peter Bloomfield Course

Chapter II.2: Basic Probability Theory and Statistics 1. What is a probability? 1.1. Probability

Official Statistics Matt Dray, Assistant Statistician Official Statistics 2 Official

Transport for the North Smart Ticketing 2 Prepared for: Transport Focus 13 th April 2016 Our

The Italian Regional Helpdesk System Helpdesk System R. Brunetti INFN-Torino www.eu-egee.org

2019 Mik Mike e Salop Salop Senior Vice President, Investor Relations 2 2 2 Sa Safe e

Apics Cscp Learning System with Instructor Led Student Slides 2015 By Curtis Brewer,Bose Greg P.

Processes @ your Service Using Process Mining to Turn Big Data into Real Value prof.dr.ir. Wil

CSE543 - Introduction to Computer and Network Security Module: Authentication Professor Trent

Data Strategy Discussion Friday, December 11, 2015 Hosted by Lesley Curtis, PhD and Schuyler

Issues in Machine-checking the Decidability of Implicational Ticket Entailment Jeremy Dawson,