Chapter 2 Discrete Random Variables Peng-Hua Wang Graduate - PowerPoint PPT Presentation

Chapter 2 Discrete Random Variables Peng-Hua Wang Graduate Institute of Communication Engineering National Taipei University

Chapter Contents 2.1 Basic Concepts 2.2 Probability Mass Functions 2.3 Functions of Random Variables 2.4 Expectation, Mean, and Variance 2.5 Joint PMFs of Multiple Random Variables 2.6 Conditioning 2.7 Independenc 2.8 Summary and Discussion Peng-Hua Wang, April 3, 2012 Probability, Chap 2 - p. 2/58

2.1 Basic Concepts Peng-Hua Wang, April 3, 2012 Probability, Chap 2 - p. 3/58

Concepts ■ For an experiment, a random variable is a particular number associated with each outcome. ■ “Mathematically, a random variable is a real-valued function of the experimental outcome.” ■ We can assign probabilities to values of a random variable. ■ When do we use random variables? ◆ Outcomes are numerical: dice roll, stock prices, ... ◆ Outcomes are not numerical, but associated with some numerical values: average grade point of randomly selected student, ... Peng-Hua Wang, April 3, 2012 Probability, Chap 2 - p. 4/58

Example ■ A sequence of 3 tosses of a coin ■ The outcomes are { HHH , HHT , HTH , HTT , THH , THT , TTH , TTT } . These 3-long sequences of heads and tails are not random variables. (why?) ■ The number of heads in the sequence is a random variable. ◆ Let X be The number of heads. We have X ( HHH ) = 3, X ( HHT ) = 2, X ( HTH ) = 2, X ( HTT ) = 1 X ( THH ) = 2, X ( THT ) = 1, X ( TTH ) = 1, X ( TTT ) = 0 P ( X = 3 ) = P ( HHH ) P ( X = 2 ) = P ( HHT ) + P ( HTH ) + P ( THH ) Peng-Hua Wang, April 3, 2012 Probability, Chap 2 - p. 5/58

Example ■ Deterministic function of a random variable is also a random variable. ◆ Let Y = X 2 is the square function of X . ◆ P ( Y = 4 ) = P ( X = 2 ) = P ( HHT ) + P ( HTH ) + P ( THH ) Peng-Hua Wang, April 3, 2012 Probability, Chap 2 - p. 6/58

More concepts ■ Random variables are real-valued functions of the experimental outcome. ■ Deterministic functions of a random variable are also random variables. ■ Each random variable can be associated with certain “averages”, such as the mean and the variance. ■ A random variable can be conditioned on an event or on another random variable. ■ We can define independence between random variables. Peng-Hua Wang, April 3, 2012 Probability, Chap 2 - p. 7/58

More concepts ■ If the range of a random variable (all values that it can take) is either finite or countably infinite, it is called a discrete random variables. ■ If the range of a random variable is uncountably infinite, it is not discrete. ◆ Select a number a from the interval [ 0, 1 ] . ◆ The random variable X = a 2 is not discrete. ◆ The random variable  a ≥ 0.5 1,  Y = 0, a < 0.5  is discrete. ■ We focus on discrete random variables in this chapter. Peng-Hua Wang, April 3, 2012 Probability, Chap 2 - p. 8/58

2.2 Probability Mass Functions Peng-Hua Wang, April 3, 2012 Probability, Chap 2 - p. 9/58

PMFs ■ For a discrete random variable X , the probability mass function (PMF) p X ( x ) is the probability of the event { X = x } : p X ( x ) = P ( { X = x } ) ■ For example, toss of two fair coins. Let X be the number of heads obtained. The PMF of X is p X ( 0 ) = 1/4 p X ( 1 ) = 1/2 p X ( 2 ) = 1/4 p X ( x ) = 0, otherwise. Peng-Hua Wang, April 3, 2012 Probability, Chap 2 - p. 10/58

Basic Properties ■ Let S be the set of all possible values of a random variable X . ∑ p X ( x ) = 1 x ∈S ■ Let A be a set of some values of a random variable X . P ( x ∈ A ) = ∑ p X ( x ) x ∈A ■ For example, toss of two fair coins. Let X be the number of heads. P ( X > 0 ) = p X ( 1 ) + p X ( 2 ) = 3/4. Peng-Hua Wang, April 3, 2012 Probability, Chap 2 - p. 11/58

Bernoulli Random Variables ■ Bernoulli random variable X , X = 0 or 1 is defined by p X ( 0 ) = 1 − p , p X ( 1 ) = p , 0 ≤ p ≤ 1. ■ We can use Bernoulli rv for modeling a coin toss. p is the probability of head. X = 1 means a head obtained. Peng-Hua Wang, April 3, 2012 Probability, Chap 2 - p. 12/58

Binomial Random Variables ■ Binomial random variable X , X = 0, 1, ... n is defined by � n � p k ( 1 − p ) n − k , p X ( k ) = k = 0, 1, ..., n , 0 ≤ p ≤ 1. k ■ We can use binomial rv for modeling the number of heads in n coin tosses. p is the probability of head. X = k means k heads obtained. k ) p k ( 1 − p ) n − k = 1 k = 0 ( n ■ ∑ n Peng-Hua Wang, April 3, 2012 Probability, Chap 2 - p. 13/58

Geometric Random Variables ■ Toss a coin repeatedly. Let X be the number of tosses until a head comes up. The PMF of X is p X ( k ) = p ( 1 − p ) k − 1 , k = 1, 2, ... 0 ≤ p ≤ 1. ■ X is called a geometric rv. k = 1 p ( 1 − p ) k − 1 = 1. ■ ∑ ∞ k = 1 p X ( k ) = ∑ ∞ Peng-Hua Wang, April 3, 2012 Probability, Chap 2 - p. 14/58

Poisson Random Variables ■ Let λ be the average typos per n words, or the “typo rate.” Then p = λ / n be the “type probability.” Let X be the number of typos in n words. We know that X is a binomial rv. � n � p k ( 1 − p ) n − k p X ( k ) = k ■ If n is large but λ remains fixed (i.e., p is very small), we can prove that p k ( 1 − p ) n − k → e − λ λ k � n � p X ( k ) = k = 0, 1, ... k ! , k ■ This is called the Poisson random variable. Peng-Hua Wang, April 3, 2012 Probability, Chap 2 - p. 15/58

Poisson Random Variables k = 0 e − λ λ k ■ ∑ ∞ k ! = 1 ■ We can use Poisson rv for modeling ◆ The number of miss-spelled words. ◆ The number of cars involved in accidents in a city on a given day Peng-Hua Wang, April 3, 2012 Probability, Chap 2 - p. 16/58

2.3 Functions of Random Variables Peng-Hua Wang, April 3, 2012 Probability, Chap 2 - p. 17/58

Functions of Random Variables ■ Let X is a random variable. We can generate another random variable Y by transform Y = g ( X ) ■ If X is a random variable, then Y = g ( X ) is also a random variable. We can calculate the PMF of Y from the PMF of X . ∑ p Y ( y ) = p X ( x ) { x | y = g ( x ) } Peng-Hua Wang, April 3, 2012 Probability, Chap 2 - p. 18/58

Example 2.1 ■ Let X is a uniform random variable. X = − 4, − 3, ...3, 4. Find the PMF of X . ■ Let Y = | X | . Find the PMF of Y . ■ Let Z = X 2 . Find the PMF of Z . Peng-Hua Wang, April 3, 2012 Probability, Chap 2 - p. 19/58

2.4 Expectation, Mean, and Variance Peng-Hua Wang, April 3, 2012 Probability, Chap 2 - p. 20/58

Mean ■ The PMF of a random variable X provides us with all information about X . ■ If we want to obtain a summary of X , we can use the expected value or called the mean of X . Defined by E [ X ] = ∑ xp X ( x ) . x ■ The expected value is a weighted average of all possible values of X . The weighting coefficients are the corresponding probability. Peng-Hua Wang, April 3, 2012 Probability, Chap 2 - p. 21/58

Mean ■ The means of some random variables do not exists, or more precise, are not well-defined. ■ The mean is well-defined if ∑ | x | p X ( x ) < ∞ . x ■ Example: p X ( 2 k ) = 2 − k , k = 1, 2, ... ■ Example: p X ( 2 k ) = p X ( − 2 k ) = 2 − k , k = 2, 3, .... This PMF is symmetric. Peng-Hua Wang, April 3, 2012 Probability, Chap 2 - p. 22/58

Example 2.2 ■ Two independent coin tosses, each with a 3/4 probability of a head. ■ Let X be the number of heads obtained. ■ Binomial random variable with parameters n = 2 and p = 3/4 ■ Find the PMF of X . ■ Find E [ X ] Peng-Hua Wang, April 3, 2012 Probability, Chap 2 - p. 23/58

Mean of Function of a Random Variable ■ Let Y = g ( X ) where X and Y are random variables. ■ E [ X ] = ∑ x xp X ( x ) . ■ E [ Y ] = E [ g ( X )] = ∑ x g ( x ) p X ( x ) . ■ We do not need to calculate the PMF of Y . ■ Example. ◆ Let X is a uniform random variable. X = − 4, − 3, ...3, 4. Find E [ X ] . ◆ Let Y = | X | . Find E [ Y ] . ◆ Let Z = X 2 . Find E [ Z ] . Peng-Hua Wang, April 3, 2012 Probability, Chap 2 - p. 24/58

Optimality of Mean ■ Let X is a random variable with PMF p X ( x ) . ■ We want to find a number c to summarize X . That is, the error between c and the values of X should be minimized. ■ We can use squared difference between c and values of X as a measure of error. ■ That is, we should find a constant c to minimize E [( X − c ) 2 ] . Peng-Hua Wang, April 3, 2012 Probability, Chap 2 - p. 25/58

Optimality of Mean ■ The answer c = E [ X ] . (proof). ■ The corresponding minimized error E [( X − E [ X ]) 2 ] is called the variance of X . ■ That is, E [ x ] is the minimized-mean-squared estimate (MMSE) of X which have the minimized mean-squared error (MSE). Peng-Hua Wang, April 3, 2012 Probability, Chap 2 - p. 26/58

Variance ■ Definition. var ( X ) = E [( X − E [ X ]) 2 ] ■ Standard deviation � σ X = var ( X ) ■ In general, E [ X n ] is called the n th moment of X . Mean is the first moment. Peng-Hua Wang, April 3, 2012 Probability, Chap 2 - p. 27/58

Example 2.3 ■ Let X is a uniform random variable. X = − 4, − 3, ...3, 4. Find var [ X ] . Peng-Hua Wang, April 3, 2012 Probability, Chap 2 - p. 28/58

Properties ■ If, Y = aX + b , then E [ Y ] = aE [ X ] + b and var ( Y ) = a 2 var ( X ) ■ var ( X ) = E [ X 2 ] − ( E [ X ]) 2 Peng-Hua Wang, April 3, 2012 Probability, Chap 2 - p. 29/58

Chapter 2 Discrete Random Variables Peng-Hua Wang Graduate - PowerPoint PPT Presentation

Chapter 2 Discrete Random Variables Peng-Hua Wang Graduate Institute of Communication Engineering National Taipei University Chapter Contents 2.1 Basic Concepts 2.2 Probability Mass Functions 2.3 Functions of Random Variables 2.4 Expectation,

Topics 11/13/2006 Chapter 11, start Chapter 12 11/20/2006 Chapter 12 11/27/2006 Chapter 13

Topics 11/13/2006 Chapter 11, start Chapter 12 11/20/2006 Chapter 12 Inheritance Concepts

Chapter 13 Chapter 13 1 What is this? Chapter 13 2 What is this? Chapter 13 3 What is

CHAPTER CHAPTER VII CHAPTER CHAPTER VII VII VII MANAGEMENT AND MANAGEMENT AND

Appendix A Chapter 9 versus Chapter 1 1 at a Glance Chapter 9 Chapter 1 1 ( I n) voluntary Cannot

CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I :

Pushdown Automata Chapter 5 Chapter 5 Chapter 5 Chapter 5

Chapter 6 Programme design and development Lets Recap Chapter 2: Chapter 3: Chapter 1:

OWASP London Chapter Meeting 27th July 2017 London Chapter Chapter Leaders: Sam

Constraint Satisfaction Problem s C t i t S ti f ti P bl Reading: Chapter 6 (3 rd ed );

Chapter 3 Chapter 3 Data Description McGraw-Hill, Bluman, 7 th ed, Chapter 3 1 Ch Chapter 3

OWASP London Chapter Meeting 23rd November 2017 London Chapter Chapter Leaders: Sam

A.I.S. Class 22: Outline I Learning Objectives for Chapter 8 I Chapter 8 Quiz I New ACCESS Features

A.I.S. Class 27: Outline I Learning Objectives for Chapter 8 I Chapter 8 Quiz I New ACCESS Features

Chapters for the Final Exam Chapter 20: Electric forces and fields (Conceptual Questions) Chapter

Chapter: 9 9 9 9 Chapter: Chapter: Chapter: High-Speed Downlink High-Speed Downlink Packet

E [ X ] = X 1 ( a ) := { | X ( ) = a } . a Pr [ X = a ] . 3. Important

Online Topology Inference from Streaming Stationary Graph Signals Rasoul Shafipour Dept. of

Natural Language Processing Spring 2017 Unit 1: Sequence Models Lecture 4a: Probabilities and

Concept Drift Detection the State-of-the-Art Shujian Yu, Ph.D. Candidate Department of

False vacuum decay in gauge theory ~Standard model and beyond~ Yutaro Shoji (KMI, Nagoya U.)

Probability theory Adapted from F. Xia 17 Basic concepts Possible outcomes, sample space,

Processing Independent Component Analysis Class 8. 24 Sep 2015 Instructor: Bhiksha Raj

Probabilistic Graphical Models Probabilistic Graphical Models Review of probability theory