UQ, STAT2201, 2017, Lectures 3 and 4 Unit 3 Probability - PowerPoint PPT Presentation

UQ, STAT2201, 2017, Lectures 3 and 4 Unit 3 – Probability Distributions. 1

Random Variables 2

A random variable X is a numerical (integer, real, complex, vector etc.) summary of the outcome of the random experiment. The range or support of the random variable is the set of possible values that it may take. Random variables are usually denoted by capital letters. 3

A discrete random variable is an integer/real-valued random variable with a finite (or countably infinite) range. A continuous random variable is a real valued random variable with an interval (either finite or infinite) of real numbers for its range. 4

Experiment ⇒ Outcome , ω, from the sample space . X ( ω ) ≡ Random Variable (function of the outcome) . � � � � P X ∈ U = P { ω | X ( ω ) ∈ U} . 5

Example: Dig a hole searching for gold. Ω ≡ all possible outcomes (many ways to define this) . X ≡ Weight of gold found in grams . � � � P X > 20) = P { ω | X ( ω ) ∈ U} with U = { x : x > 20 } . 6

Probability Distributions 7

The probability distribution of a random variable X is a description of the probabilities associated with the possible values of X . There are several common alternative ways to describe the probability distribution, with some differences between discrete and continuous random variables. 8

While not the most popular in practice, a unified way to describe the distribution of any scalar valued random variable X (real or integer) is the cumulative distribution function , F ( x ) = P ( X ≤ x ) . It holds that (1) 0 ≤ F ( x ) ≤ 1. (2) lim x →−∞ F ( x ) = 0. (3) lim x →∞ F ( x ) = 1. (3) If x ≤ y , then F ( x ) ≤ F ( y ). That is, F ( · ) is non-decreasing. 9

Examples to understand:   0 , x < − 1 , 0 , x < 0 ,     F ( x ) = F ( x ) = 0 . 3 , − 1 ≤ x < 1 , x , 0 ≤ x ≤ 1 ,   1 , 1 ≤ x . 1 , 1 ≤ x .   10

Distributions are often summarised by numbers such as the mean , µ , variance , σ 2 , or moments . These numbers, in general do not identify the distribution, but hint at the general location, spread and shape. √ σ 2 and is particularly useful The standard deviation of X is σ = when working with the Normal distribution. More on these soon. 11

Discrete Random Variables 12

Given a discrete random variable X with possible values x 1 , x 2 , . . . , x n , the probability mass function of X is, p ( x ) = P ( X = x ) . Note: In [MonRun2014] and many other sources, the notation used is f ( x ) (as a pdf of a continuous random variable). 13

A probability mass function, p ( x ) satisfies: (1) p ( x i ) ≥ 0. n � (2) p ( x i ) = 1. i =1 The cumulative distribution function of a discrete random variable X , denoted as F ( x ), is � F ( x ) = p ( x i ) . x i ≤ x 14

P ( X = x i ) can be determined from the jump at the value of x . More specifically p ( x i ) = P ( X = x i ) = F ( x i ) − lim x ↑ x i F ( x i ) . 15

Back to the example:  0 , x < − 1 ,   F ( x ) = 0 . 3 , − 1 ≤ x < 1 ,  1 , 1 ≤ x .  What is the pmf? 16

The mean or expected value of a discrete random variable X , is � µ = E ( X ) = x p ( x ) . x 17

The expected value of h ( X ) for some function h ( · ) is: � � � E h ( X ) = h ( x ) p ( x ) . x 18

The k ’th moment of X is, x k p ( x ) . E ( X k ) = � x 19

The variance of X , is σ 2 = V ( X ) = E ( X − µ ) 2 = ( x − µ ) 2 p ( x ) = x 2 p ( x ) − µ 2 . � � x x 20

The Discrete Uniform Distribution 21

A random variable X has a discrete uniform distribution if each of the n values in its range, x 1 , x 2 , . . . , x n , has equal probability. I.e. p ( x i ) = 1 / n . 22

Suppose that X is a discrete uniform random variable on the consecutive integers a , a + 1 , a + 2 , . . . , b , for a ≤ b . The mean and variance of X are V ( X ) = ( b − a + 1) 2 − 1 E ( X ) = b + a and . 2 12 23

To compute the mean and variance of the discrete uniform, use: n n k = n ( n + 1) k 2 = n ( n + 1)(2 n + 1) � � , 2 6 k =1 k =1 24

E ( X ) = � b 1 k = a k b − a +1 = 25

E ( X 2 ) = � b 1 k = a k 2 b − a +1 = 26

The Binomial Distribution 27

The setting of n independent and identical Bernoulli trials is as follows: (1) There are n trials. (1) The trials are independent. (2) Each trial results in only two possible outcomes, labelled as “success” and “failure”. (3) The probability of a success in each trial denoted as p is the same for all trials. 28

Binomial Example: Number of digs finding gold. n = 5 digs in different spots. p = 0 . 1 chance of finding gold in each spot. 29

The random variable X that equals the number of trials that result in a success is a binomial random variable with parameters 0 ≤ p ≤ 1 and n = 1 , 2 , . . . . The probability mass function of X is � n � p x (1 − p ) n − x , p ( x ) = x = 0 , 1 , . . . , n . x 30

Useful to remember from algebra: the binomial expansion for constants a and b is n � n � ( a + b ) n = � a k b n − k . k k =0 31

If X is a binomial random variable with parameters p and n , then, E ( X ) = n p and V ( X ) = n p (1 − p ) . 32

Example (cont.): Number of digs finding gold ( n = 5 , p = 0 . 1): 33

Continuous Random Variables 34

Given a continuous random variable X , the probability density function (pdf) is a function, f ( x ) such that, (1) f ( x ) ≥ 0. (2) f ( x ) = 0 for x not in the range. ∞ � (3) f ( x ) dx = 1. −∞ (4) For small ∆ x , f ( x ) ∆ x ≈ P ( X ∈ [ x , x + ∆ x )). b � (5) P ( a ≤ X ≤ b ) = f ( x ) dx = area under f ( x ) from a to b . a 35

Given the pdf, f ( x ) we can get the cdf as follows: x � F ( x ) = P ( X ≤ x ) = f ( u ) du for − ∞ < x < ∞ . −∞ 36

Given the cdf, F ( x ) we can get the pdf: f ( x ) = d dx F ( x ) . 37

The mean or expected value of a continous random variable X , is ∞ � µ = E ( X ) = x f ( x ) dx . −∞ The expected value of h ( X ) for some function h ( · ) is: ∞ � � � E h ( X ) = h ( x ) f ( x ) dx . −∞ The k ’th moment of X is, ∞ � x k f ( x ) dx . E ( X k ) = −∞ The variance of X , is ∞ � ∞ � σ 2 = V ( X ) = ( x − µ ) 2 f ( x ) dx = x 2 f ( x ) dx − µ 2 . −∞ −∞ 38

Continuous Uniform Distribution 39

A continuous random variable X with probability density function 1 f ( x ) = b − a , a ≤ x ≤ b . is a continuous uniform random variable or “uniform random variable” for short. 40

If X is a continuous uniform random variable over a ≤ x ≤ b , the mean and variance are: σ 2 = V ( X ) = ( b − a ) 2 µ = E ( X ) = a + b and . 2 12 41

The Normal Distribution 42

A random variable X with probability density function − ( x − µ )2 1 f ( x ) = √ , −∞ < x < ∞ , e 2 σ 2 σ 2 π is a normal random variable with parameters µ where −∞ < µ < ∞ , and σ > 0. For this distribution, the parameters map directly to the mean and variance, V ( X ) = σ 2 . E ( X ) = µ and The notation N ( µ, σ 2 ) is used to denote the distribution. Note that some authors and software packages use σ for the second parameter and not σ 2 . 43

A normal random variable with a mean and variance of: σ 2 = 1 µ = 0 and is called a standard normal random variable and is denoted as Z . The cumulative distribution function of a standard normal random variable is denoted as Φ( z ) = F Z ( z ) = P ( Z ≤ z ) , and is tabulated in a table. 44

It is very common to compute P ( a < X < b ) for X ∼ N ( µ, σ 2 ). This is the typical way: P ( a < X < b ) = P ( a − µ < X − µ < b − µ ) � a − µ < X − µ < b − µ � = P σ σ σ � a − µ < Z < b − µ � = P σ σ � b − µ � a − µ � � = Φ − Φ . σ σ We get: � b − µ � a − µ � � F X ( b ) − F X ( a ) = F Z − F Z . σ σ 45

The Exponential Distribution 46

The exponential distribution with parameter λ > 0 is given by the survival function , F ( x ) = 1 − F ( x ) = P ( X > x ) = e − λ x . The random variable X represents the distance between successive events from a Poisson process with mean number of events per unit interval λ > 0. 47

The probability density function of X is f ( x ) = λ e − λ x for 0 ≤ x < ∞ . Note that sometimes a different parameterisation, θ = 1 /λ is used (e.g. in the Julia Distributions package). 48

The mean and variance are: µ = E ( X ) = 1 σ 2 = V ( X ) = 1 and λ 2 λ 49

The exponential distribution is the only continuous distribution with range [0 , ∞ ) exhibiting the lack of memory property . For an exponential random variable X , P ( X > t + s | X > t ) = P ( X > s ) . 50

Monte Carlo Random Variable Generation 51

Monte Carlo simulation makes use of methods to transform a uniform random variable in a manner where it follows an arbitrary given given distribution. One example of this is if U ∼ Uniform(0 , 1) then X = − 1 λ log( U ) is exponentially distributed with parameter λ . 52

UQ, STAT2201, 2017, Lectures 3 and 4 Unit 3 Probability - PowerPoint PPT Presentation

UQ, STAT2201, 2017, Lectures 3 and 4 Unit 3 Probability Distributions. 1 Random Variables 2 A random variable X is a numerical (integer, real, complex, vector etc.) summary of the outcome of the random experiment. The range or support of

UQ, STAT2201, 2017, Lecture 5 Unit 4 Joint Distributions and Unit 5 Descriptive

UQ, STAT2201, 2017, Lecture 8 (and part of 9). Unit 8 Two Sample Inference. Unit 9

UQ, STAT2201, 2017, Lecture 2, Unit 2, Probability and Monte Carlo. 1 Im willing to bet that

Probability Basics Martin Emms October 1, 2020 Probability Basics Outline Probability

UQ, STAT2201, 2017, Lecture 9. Unit 10 Further Stats Overview 1 The Strength of Conditional

Continuing Probability. Wrap up: Total Probability and Conditional Probability. Continuing

Chapter 2 Probability 1. Definition of Probability 2. Probability of disjoint events 3.

Probability Basics Probability Background Martin Emms October 1, 2020 Probability Basics

Chapter 2 Probability 1. Definition of Probability 2. Probability of disjoint events 3.

Unit 2: Probability and distributions Lecture 1: Probability and conditional probability

STAT2201 Analysis of Engineering & Scientific Data Unit 3 Slava Vaisman The University of

UQ, STAT2201, 2017, Lecture 6 Unit 6 Statistical Inference Ideas. 1 Statistical Inference is

UQ, STAT2201, 2017, Lecture 7. Unit 7 Single Sample Inference. 1 Setup: A sample x 1 , . . .

Counting and Probability Whats to come? Counting and Probability Whats to come?

STAT2201 Analysis of Engineering & Scientific Data Unit 7 Slava Vaisman The University of

STAT2201 Analysis of Engineering & Scientific Data Unit 8 Slava Vaisman The University of

12/1/2019 Department of Veterinary and Animal Sciences Advanced Quantitative Methods in Herd

Continuous Distributions 1.8-1.9: Continuous Random Variables 1.10.1: Uniform Distribution

Lecture 1.1: An Introduction to Ordinary Differential Equations Matthew Macauley Department of

Math 1120 Class 1 Dan Barbasch Aug. 23, 2012 Course Website Math 1120 Class 1 Dan Barbasch

01 Foundations Shravan Vasishth SMLP Shravan Vasishth 01 Foundations SMLP 1 / 29 Preview:

Agenda Course 02402 Introduction to Statistics Continuous random variables and distributions 1

Discrete vs. Continuous Data MDM4U: Mathematics of Data Management Recap Identify the discrete

* Equal Contributors Maryland Virginia Tech Colorado

UQ, STAT2201, 2017, Lectures 3 and 4 Unit 3 Probability - PowerPoint PPT Presentation

UQ, STAT2201, 2017, Lectures 3 and 4 Unit 3 Probability Distributions. 1 Random Variables 2 A random variable X is a numerical (integer, real, complex, vector etc.) summary of the outcome of the random experiment. The range or support of

UQ, STAT2201, 2017, Lecture 5 Unit 4 Joint Distributions and Unit 5 Descriptive

UQ, STAT2201, 2017, Lecture 8 (and part of 9). Unit 8 Two Sample Inference. Unit 9

UQ, STAT2201, 2017, Lecture 2, Unit 2, Probability and Monte Carlo. 1 Im willing to bet that

Probability Basics Martin Emms October 1, 2020 Probability Basics Outline Probability

UQ, STAT2201, 2017, Lecture 9. Unit 10 Further Stats Overview 1 The Strength of Conditional

Continuing Probability. Wrap up: Total Probability and Conditional Probability. Continuing

Chapter 2 Probability 1. Definition of Probability 2. Probability of disjoint events 3.

Probability Basics Probability Background Martin Emms October 1, 2020 Probability Basics

Chapter 2 Probability 1. Definition of Probability 2. Probability of disjoint events 3.

Unit 2: Probability and distributions Lecture 1: Probability and conditional probability

STAT2201 Analysis of Engineering &amp; Scientific Data Unit 3 Slava Vaisman The University of

UQ, STAT2201, 2017, Lecture 6 Unit 6 Statistical Inference Ideas. 1 Statistical Inference is

UQ, STAT2201, 2017, Lecture 7. Unit 7 Single Sample Inference. 1 Setup: A sample x 1 , . . .

Counting and Probability Whats to come? Counting and Probability Whats to come?

STAT2201 Analysis of Engineering &amp; Scientific Data Unit 7 Slava Vaisman The University of

STAT2201 Analysis of Engineering &amp; Scientific Data Unit 8 Slava Vaisman The University of

12/1/2019 Department of Veterinary and Animal Sciences Advanced Quantitative Methods in Herd

Continuous Distributions 1.8-1.9: Continuous Random Variables 1.10.1: Uniform Distribution

Lecture 1.1: An Introduction to Ordinary Differential Equations Matthew Macauley Department of

Math 1120 Class 1 Dan Barbasch Aug. 23, 2012 Course Website Math 1120 Class 1 Dan Barbasch

01 Foundations Shravan Vasishth SMLP Shravan Vasishth 01 Foundations SMLP 1 / 29 Preview:

Agenda Course 02402 Introduction to Statistics Continuous random variables and distributions 1

Discrete vs. Continuous Data MDM4U: Mathematics of Data Management Recap Identify the discrete

* Equal Contributors Maryland Virginia Tech Colorado

STAT2201 Analysis of Engineering & Scientific Data Unit 3 Slava Vaisman The University of

STAT2201 Analysis of Engineering & Scientific Data Unit 7 Slava Vaisman The University of

STAT2201 Analysis of Engineering & Scientific Data Unit 8 Slava Vaisman The University of