Basic Probability Concepts James H. Steiger Department of - - PowerPoint PPT Presentation

basic probability concepts
SMART_READER_LITE
LIVE PREVIEW

Basic Probability Concepts James H. Steiger Department of - - PowerPoint PPT Presentation

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Basic Probability Concepts James H. Steiger Department of Psychology and Human Development Vanderbilt University Multilevel Regression


slide-1
SLIDE 1

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing

Basic Probability Concepts

James H. Steiger

Department of Psychology and Human Development Vanderbilt University

Multilevel Regression Modeling, 2009

Multilevel Basic Probability Concepts

slide-2
SLIDE 2

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing

An Introduction to R

1 Random Variables

Informal Definition Manifest and Latent Random Variables Continuous and Discrete Random Variables

2 Probability Distributions

Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

3 Sampling Distributions 4 Confidence Intervals

The Classic Normal Theory Approach Confidence Intervals on Linear Transformations Confidence Intervals Via Simulation

Multilevel Basic Probability Concepts

slide-3
SLIDE 3

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Informal Definition Manifest and Latent Random Variables Continuous and Discrete Random Variables

Random Variables

Random Variables The term random variable has a technical definition that we discussed in Psychology 310 For our purposes, it will suffice to consider a random variable to be a random process with numerical outcomes that occur according to a distribution law Example (Uniform (0,1) Random Variable) A random process that generates numbers so that all values between 0 and 1, inclusive, are equally likely to occur is said to have a U(0,1) distribution.

Multilevel Basic Probability Concepts

slide-4
SLIDE 4

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Informal Definition Manifest and Latent Random Variables Continuous and Discrete Random Variables

Random Variables

Random Variables The term random variable has a technical definition that we discussed in Psychology 310 For our purposes, it will suffice to consider a random variable to be a random process with numerical outcomes that occur according to a distribution law Example (Uniform (0,1) Random Variable) A random process that generates numbers so that all values between 0 and 1, inclusive, are equally likely to occur is said to have a U(0,1) distribution.

Multilevel Basic Probability Concepts

slide-5
SLIDE 5

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Informal Definition Manifest and Latent Random Variables Continuous and Discrete Random Variables

Random Variables

Random Variables The term random variable has a technical definition that we discussed in Psychology 310 For our purposes, it will suffice to consider a random variable to be a random process with numerical outcomes that occur according to a distribution law Example (Uniform (0,1) Random Variable) A random process that generates numbers so that all values between 0 and 1, inclusive, are equally likely to occur is said to have a U(0,1) distribution.

Multilevel Basic Probability Concepts

slide-6
SLIDE 6

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Informal Definition Manifest and Latent Random Variables Continuous and Discrete Random Variables

Random Variables

Random Variables The term random variable has a technical definition that we discussed in Psychology 310 For our purposes, it will suffice to consider a random variable to be a random process with numerical outcomes that occur according to a distribution law Example (Uniform (0,1) Random Variable) A random process that generates numbers so that all values between 0 and 1, inclusive, are equally likely to occur is said to have a U(0,1) distribution.

Multilevel Basic Probability Concepts

slide-7
SLIDE 7

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Informal Definition Manifest and Latent Random Variables Continuous and Discrete Random Variables

Manifest and Latent Variables

Manifest and Latent Variables In advanced applications, we will refer to manifest and latent random variables A variable is manifest if it can be measured directly A variable is latent if it is an assumed quantity that cannot be measured directly The dividing line between manifest and latent variables is

  • ften rather imprecise

Example (Manifest Variable) Your grade on an exam is a manifest random variable.

Multilevel Basic Probability Concepts

slide-8
SLIDE 8

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Informal Definition Manifest and Latent Random Variables Continuous and Discrete Random Variables

Manifest and Latent Variables

Manifest and Latent Variables In advanced applications, we will refer to manifest and latent random variables A variable is manifest if it can be measured directly A variable is latent if it is an assumed quantity that cannot be measured directly The dividing line between manifest and latent variables is

  • ften rather imprecise

Example (Manifest Variable) Your grade on an exam is a manifest random variable.

Multilevel Basic Probability Concepts

slide-9
SLIDE 9

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Informal Definition Manifest and Latent Random Variables Continuous and Discrete Random Variables

Manifest and Latent Variables

Manifest and Latent Variables In advanced applications, we will refer to manifest and latent random variables A variable is manifest if it can be measured directly A variable is latent if it is an assumed quantity that cannot be measured directly The dividing line between manifest and latent variables is

  • ften rather imprecise

Example (Manifest Variable) Your grade on an exam is a manifest random variable.

Multilevel Basic Probability Concepts

slide-10
SLIDE 10

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Informal Definition Manifest and Latent Random Variables Continuous and Discrete Random Variables

Manifest and Latent Variables

Manifest and Latent Variables In advanced applications, we will refer to manifest and latent random variables A variable is manifest if it can be measured directly A variable is latent if it is an assumed quantity that cannot be measured directly The dividing line between manifest and latent variables is

  • ften rather imprecise

Example (Manifest Variable) Your grade on an exam is a manifest random variable.

Multilevel Basic Probability Concepts

slide-11
SLIDE 11

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Informal Definition Manifest and Latent Random Variables Continuous and Discrete Random Variables

Manifest and Latent Variables

Manifest and Latent Variables In advanced applications, we will refer to manifest and latent random variables A variable is manifest if it can be measured directly A variable is latent if it is an assumed quantity that cannot be measured directly The dividing line between manifest and latent variables is

  • ften rather imprecise

Example (Manifest Variable) Your grade on an exam is a manifest random variable.

Multilevel Basic Probability Concepts

slide-12
SLIDE 12

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Informal Definition Manifest and Latent Random Variables Continuous and Discrete Random Variables

Manifest and Latent Variables

Manifest and Latent Variables In advanced applications, we will refer to manifest and latent random variables A variable is manifest if it can be measured directly A variable is latent if it is an assumed quantity that cannot be measured directly The dividing line between manifest and latent variables is

  • ften rather imprecise

Example (Manifest Variable) Your grade on an exam is a manifest random variable.

Multilevel Basic Probability Concepts

slide-13
SLIDE 13

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Informal Definition Manifest and Latent Random Variables Continuous and Discrete Random Variables

Continuous and Discrete Random Variables

A continuous random variable has an uncountably infinite number of possible outcomes because it can take on all values over some range of the number line A discrete random variable takes on only a countable number of discrete outcomes As we saw in Psychology 310, discrete random variables can assign a probability to a particular numerical outcome, while continuous random variables cannot Example (Discrete Random Variable) Suppose you assign the number 1 to all people born male, and 2 to all people born female. This random variable is discrete, because it takes on only the values 1 and 2.

Multilevel Basic Probability Concepts

slide-14
SLIDE 14

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Informal Definition Manifest and Latent Random Variables Continuous and Discrete Random Variables

Continuous and Discrete Random Variables

A continuous random variable has an uncountably infinite number of possible outcomes because it can take on all values over some range of the number line A discrete random variable takes on only a countable number of discrete outcomes As we saw in Psychology 310, discrete random variables can assign a probability to a particular numerical outcome, while continuous random variables cannot Example (Discrete Random Variable) Suppose you assign the number 1 to all people born male, and 2 to all people born female. This random variable is discrete, because it takes on only the values 1 and 2.

Multilevel Basic Probability Concepts

slide-15
SLIDE 15

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Informal Definition Manifest and Latent Random Variables Continuous and Discrete Random Variables

Continuous and Discrete Random Variables

A continuous random variable has an uncountably infinite number of possible outcomes because it can take on all values over some range of the number line A discrete random variable takes on only a countable number of discrete outcomes As we saw in Psychology 310, discrete random variables can assign a probability to a particular numerical outcome, while continuous random variables cannot Example (Discrete Random Variable) Suppose you assign the number 1 to all people born male, and 2 to all people born female. This random variable is discrete, because it takes on only the values 1 and 2.

Multilevel Basic Probability Concepts

slide-16
SLIDE 16

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Informal Definition Manifest and Latent Random Variables Continuous and Discrete Random Variables

Continuous and Discrete Random Variables

A continuous random variable has an uncountably infinite number of possible outcomes because it can take on all values over some range of the number line A discrete random variable takes on only a countable number of discrete outcomes As we saw in Psychology 310, discrete random variables can assign a probability to a particular numerical outcome, while continuous random variables cannot Example (Discrete Random Variable) Suppose you assign the number 1 to all people born male, and 2 to all people born female. This random variable is discrete, because it takes on only the values 1 and 2.

Multilevel Basic Probability Concepts

slide-17
SLIDE 17

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Informal Definition Manifest and Latent Random Variables Continuous and Discrete Random Variables

Continuous and Discrete Random Variables

A continuous random variable has an uncountably infinite number of possible outcomes because it can take on all values over some range of the number line A discrete random variable takes on only a countable number of discrete outcomes As we saw in Psychology 310, discrete random variables can assign a probability to a particular numerical outcome, while continuous random variables cannot Example (Discrete Random Variable) Suppose you assign the number 1 to all people born male, and 2 to all people born female. This random variable is discrete, because it takes on only the values 1 and 2.

Multilevel Basic Probability Concepts

slide-18
SLIDE 18

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

Using Probability Distributions

Using Probability Distributions Probability distributions are frequently used to provide succinct models for quantities of scientific interest We observe distributions of data, and assess how well the distributions conform to the specified model While observing the distribution of the data, we may hypothesize the general family of the distribution, but leave

  • pen the question of the values of the parameters

In that case, we talk of free parameters to be estimated

Multilevel Basic Probability Concepts

slide-19
SLIDE 19

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

Using Probability Distributions

Using Probability Distributions Probability distributions are frequently used to provide succinct models for quantities of scientific interest We observe distributions of data, and assess how well the distributions conform to the specified model While observing the distribution of the data, we may hypothesize the general family of the distribution, but leave

  • pen the question of the values of the parameters

In that case, we talk of free parameters to be estimated

Multilevel Basic Probability Concepts

slide-20
SLIDE 20

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

Using Probability Distributions

Using Probability Distributions Probability distributions are frequently used to provide succinct models for quantities of scientific interest We observe distributions of data, and assess how well the distributions conform to the specified model While observing the distribution of the data, we may hypothesize the general family of the distribution, but leave

  • pen the question of the values of the parameters

In that case, we talk of free parameters to be estimated

Multilevel Basic Probability Concepts

slide-21
SLIDE 21

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

Using Probability Distributions

Using Probability Distributions Probability distributions are frequently used to provide succinct models for quantities of scientific interest We observe distributions of data, and assess how well the distributions conform to the specified model While observing the distribution of the data, we may hypothesize the general family of the distribution, but leave

  • pen the question of the values of the parameters

In that case, we talk of free parameters to be estimated

Multilevel Basic Probability Concepts

slide-22
SLIDE 22

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

Using Probability Distributions

Using Probability Distributions Probability distributions are frequently used to provide succinct models for quantities of scientific interest We observe distributions of data, and assess how well the distributions conform to the specified model While observing the distribution of the data, we may hypothesize the general family of the distribution, but leave

  • pen the question of the values of the parameters

In that case, we talk of free parameters to be estimated

Multilevel Basic Probability Concepts

slide-23
SLIDE 23

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

Using Probability Distributions

More Complex Applications

Using Probability Distributions In more complex applications, such as multilevel modeling, we may model data emanating from a particular distribution family at one level (say kids within a school) At another level, we might model the parameters for the schools as having a distribution across schools For example, we might hypothesize that the parameters across schools have a normal distribution In that case, the size of the variance of that distribution would indicate how much the schools show variation on a particular characteristic In the slides that follow, we shall examine some of the more useful distributions we will encounter early in the course

Multilevel Basic Probability Concepts

slide-24
SLIDE 24

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

Using Probability Distributions

More Complex Applications

Using Probability Distributions In more complex applications, such as multilevel modeling, we may model data emanating from a particular distribution family at one level (say kids within a school) At another level, we might model the parameters for the schools as having a distribution across schools For example, we might hypothesize that the parameters across schools have a normal distribution In that case, the size of the variance of that distribution would indicate how much the schools show variation on a particular characteristic In the slides that follow, we shall examine some of the more useful distributions we will encounter early in the course

Multilevel Basic Probability Concepts

slide-25
SLIDE 25

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

Using Probability Distributions

More Complex Applications

Using Probability Distributions In more complex applications, such as multilevel modeling, we may model data emanating from a particular distribution family at one level (say kids within a school) At another level, we might model the parameters for the schools as having a distribution across schools For example, we might hypothesize that the parameters across schools have a normal distribution In that case, the size of the variance of that distribution would indicate how much the schools show variation on a particular characteristic In the slides that follow, we shall examine some of the more useful distributions we will encounter early in the course

Multilevel Basic Probability Concepts

slide-26
SLIDE 26

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

Using Probability Distributions

More Complex Applications

Using Probability Distributions In more complex applications, such as multilevel modeling, we may model data emanating from a particular distribution family at one level (say kids within a school) At another level, we might model the parameters for the schools as having a distribution across schools For example, we might hypothesize that the parameters across schools have a normal distribution In that case, the size of the variance of that distribution would indicate how much the schools show variation on a particular characteristic In the slides that follow, we shall examine some of the more useful distributions we will encounter early in the course

Multilevel Basic Probability Concepts

slide-27
SLIDE 27

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

Using Probability Distributions

More Complex Applications

Using Probability Distributions In more complex applications, such as multilevel modeling, we may model data emanating from a particular distribution family at one level (say kids within a school) At another level, we might model the parameters for the schools as having a distribution across schools For example, we might hypothesize that the parameters across schools have a normal distribution In that case, the size of the variance of that distribution would indicate how much the schools show variation on a particular characteristic In the slides that follow, we shall examine some of the more useful distributions we will encounter early in the course

Multilevel Basic Probability Concepts

slide-28
SLIDE 28

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

Using Probability Distributions

More Complex Applications

Using Probability Distributions In more complex applications, such as multilevel modeling, we may model data emanating from a particular distribution family at one level (say kids within a school) At another level, we might model the parameters for the schools as having a distribution across schools For example, we might hypothesize that the parameters across schools have a normal distribution In that case, the size of the variance of that distribution would indicate how much the schools show variation on a particular characteristic In the slides that follow, we shall examine some of the more useful distributions we will encounter early in the course

Multilevel Basic Probability Concepts

slide-29
SLIDE 29

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Normal Distribution

The Normal Distribution The normal distribution is a widely used continuous distribution The normal distribution family is a two-parameter family Each normal distribution is characterized by two parameters, the mean µ and the standard deviation σ. Shaped like a bell, the normal pdf is sometimes referred to as the bell curve The central limit theorem, discussed on pages 13–14 of Gelman & Hill, explains why many quantities have a distribution that is approximately normal The normal distribution family is closed under linear transformations, i.e., any normal distribution may be transformed into any other normal distribution by a linear transformation

Multilevel Basic Probability Concepts

slide-30
SLIDE 30

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Normal Distribution

The Normal Distribution The normal distribution is a widely used continuous distribution The normal distribution family is a two-parameter family Each normal distribution is characterized by two parameters, the mean µ and the standard deviation σ. Shaped like a bell, the normal pdf is sometimes referred to as the bell curve The central limit theorem, discussed on pages 13–14 of Gelman & Hill, explains why many quantities have a distribution that is approximately normal The normal distribution family is closed under linear transformations, i.e., any normal distribution may be transformed into any other normal distribution by a linear transformation

Multilevel Basic Probability Concepts

slide-31
SLIDE 31

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Normal Distribution

The Normal Distribution The normal distribution is a widely used continuous distribution The normal distribution family is a two-parameter family Each normal distribution is characterized by two parameters, the mean µ and the standard deviation σ. Shaped like a bell, the normal pdf is sometimes referred to as the bell curve The central limit theorem, discussed on pages 13–14 of Gelman & Hill, explains why many quantities have a distribution that is approximately normal The normal distribution family is closed under linear transformations, i.e., any normal distribution may be transformed into any other normal distribution by a linear transformation

Multilevel Basic Probability Concepts

slide-32
SLIDE 32

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Normal Distribution

The Normal Distribution The normal distribution is a widely used continuous distribution The normal distribution family is a two-parameter family Each normal distribution is characterized by two parameters, the mean µ and the standard deviation σ. Shaped like a bell, the normal pdf is sometimes referred to as the bell curve The central limit theorem, discussed on pages 13–14 of Gelman & Hill, explains why many quantities have a distribution that is approximately normal The normal distribution family is closed under linear transformations, i.e., any normal distribution may be transformed into any other normal distribution by a linear transformation

Multilevel Basic Probability Concepts

slide-33
SLIDE 33

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Normal Distribution

The Normal Distribution The normal distribution is a widely used continuous distribution The normal distribution family is a two-parameter family Each normal distribution is characterized by two parameters, the mean µ and the standard deviation σ. Shaped like a bell, the normal pdf is sometimes referred to as the bell curve The central limit theorem, discussed on pages 13–14 of Gelman & Hill, explains why many quantities have a distribution that is approximately normal The normal distribution family is closed under linear transformations, i.e., any normal distribution may be transformed into any other normal distribution by a linear transformation

Multilevel Basic Probability Concepts

slide-34
SLIDE 34

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Normal Distribution

The Normal Distribution The normal distribution is a widely used continuous distribution The normal distribution family is a two-parameter family Each normal distribution is characterized by two parameters, the mean µ and the standard deviation σ. Shaped like a bell, the normal pdf is sometimes referred to as the bell curve The central limit theorem, discussed on pages 13–14 of Gelman & Hill, explains why many quantities have a distribution that is approximately normal The normal distribution family is closed under linear transformations, i.e., any normal distribution may be transformed into any other normal distribution by a linear transformation

Multilevel Basic Probability Concepts

slide-35
SLIDE 35

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Normal Distribution

The Normal Distribution The normal distribution is a widely used continuous distribution The normal distribution family is a two-parameter family Each normal distribution is characterized by two parameters, the mean µ and the standard deviation σ. Shaped like a bell, the normal pdf is sometimes referred to as the bell curve The central limit theorem, discussed on pages 13–14 of Gelman & Hill, explains why many quantities have a distribution that is approximately normal The normal distribution family is closed under linear transformations, i.e., any normal distribution may be transformed into any other normal distribution by a linear transformation

Multilevel Basic Probability Concepts

slide-36
SLIDE 36

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Multivariate Normal Distribution

The Multivariate Normal Distribution The multivariate normal distribution is a continuous multivariate distribution having two matrix parameters, the vector of means µ and the covariance matrix Σ Any linear combination of multi-normal variables has a normal distribution As we saw in Psychology 310, the mean and variance of the linear combination is determined by µ, Σ, and the linear weights

Multilevel Basic Probability Concepts

slide-37
SLIDE 37

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Multivariate Normal Distribution

The Multivariate Normal Distribution The multivariate normal distribution is a continuous multivariate distribution having two matrix parameters, the vector of means µ and the covariance matrix Σ Any linear combination of multi-normal variables has a normal distribution As we saw in Psychology 310, the mean and variance of the linear combination is determined by µ, Σ, and the linear weights

Multilevel Basic Probability Concepts

slide-38
SLIDE 38

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Multivariate Normal Distribution

The Multivariate Normal Distribution The multivariate normal distribution is a continuous multivariate distribution having two matrix parameters, the vector of means µ and the covariance matrix Σ Any linear combination of multi-normal variables has a normal distribution As we saw in Psychology 310, the mean and variance of the linear combination is determined by µ, Σ, and the linear weights

Multilevel Basic Probability Concepts

slide-39
SLIDE 39

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Multivariate Normal Distribution

The Multivariate Normal Distribution The multivariate normal distribution is a continuous multivariate distribution having two matrix parameters, the vector of means µ and the covariance matrix Σ Any linear combination of multi-normal variables has a normal distribution As we saw in Psychology 310, the mean and variance of the linear combination is determined by µ, Σ, and the linear weights

Multilevel Basic Probability Concepts

slide-40
SLIDE 40

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Lognormal Distribution

The Lognormal Distribution If X is normally distributed, then y = ex is said to have a lognormal distribution. If Y is lognormally distributed, the logarithm of Y has a normal distribution In R, dlnorm gives the density, plnorm gives the distribution function, qlnorm gives the quantile function, and rlnorm generates random deviates

Multilevel Basic Probability Concepts

slide-41
SLIDE 41

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Lognormal Distribution

Some Basic Facts The Lognormal Distribution It is common, when referring to a normal distribution, to use the abbreviations N (µ, σ) or N (µ, σ2). It is important to realize that, when referring to a lognormal distribution for a variable Y , the convention is to refer to the parameters µ and σ from the corresponding normal variable X = ln(Y ) In this case, the actual mean and variance of Y are not µ and σ2, but rather are E(Y ) = eµ+ 1

2σ2,

Var(Y ) = (eσ2 − 1)e2µ+σ2

Multilevel Basic Probability Concepts

slide-42
SLIDE 42

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Lognormal Distribution

Some Basic Facts The Lognormal Distribution It is common, when referring to a normal distribution, to use the abbreviations N (µ, σ) or N (µ, σ2). It is important to realize that, when referring to a lognormal distribution for a variable Y , the convention is to refer to the parameters µ and σ from the corresponding normal variable X = ln(Y ) In this case, the actual mean and variance of Y are not µ and σ2, but rather are E(Y ) = eµ+ 1

2σ2,

Var(Y ) = (eσ2 − 1)e2µ+σ2

Multilevel Basic Probability Concepts

slide-43
SLIDE 43

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Lognormal Distribution

Some Basic Facts The Lognormal Distribution It is common, when referring to a normal distribution, to use the abbreviations N (µ, σ) or N (µ, σ2). It is important to realize that, when referring to a lognormal distribution for a variable Y , the convention is to refer to the parameters µ and σ from the corresponding normal variable X = ln(Y ) In this case, the actual mean and variance of Y are not µ and σ2, but rather are E(Y ) = eµ+ 1

2σ2,

Var(Y ) = (eσ2 − 1)e2µ+σ2

Multilevel Basic Probability Concepts

slide-44
SLIDE 44

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Lognormal Distribution

Some Basic Facts The Lognormal Distribution It is common, when referring to a normal distribution, to use the abbreviations N (µ, σ) or N (µ, σ2). It is important to realize that, when referring to a lognormal distribution for a variable Y , the convention is to refer to the parameters µ and σ from the corresponding normal variable X = ln(Y ) In this case, the actual mean and variance of Y are not µ and σ2, but rather are E(Y ) = eµ+ 1

2σ2,

Var(Y ) = (eσ2 − 1)e2µ+σ2

Multilevel Basic Probability Concepts

slide-45
SLIDE 45

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Lognormal Distribution

Example (The Lognormal Distribution) Here is a picture comparing the lognormal and corresponding normal distribution.

−3 −2 −1 1 2 3 0.0 0.2 0.4 0.6 0.8 1.0 x f(x)

Multilevel Basic Probability Concepts

slide-46
SLIDE 46

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Lognormal Distribution

Applications

Applications of the Lognormal When independent processes combine multiplicatively, the result can be lognormally distributed For a detailed and entertaining discussion of the lognormal distribution, see the article by Limpert, Stahel, and Abbt (2001) in the reading list

Multilevel Basic Probability Concepts

slide-47
SLIDE 47

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Lognormal Distribution

Applications

Applications of the Lognormal When independent processes combine multiplicatively, the result can be lognormally distributed For a detailed and entertaining discussion of the lognormal distribution, see the article by Limpert, Stahel, and Abbt (2001) in the reading list

Multilevel Basic Probability Concepts

slide-48
SLIDE 48

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Lognormal Distribution

Applications

Applications of the Lognormal When independent processes combine multiplicatively, the result can be lognormally distributed For a detailed and entertaining discussion of the lognormal distribution, see the article by Limpert, Stahel, and Abbt (2001) in the reading list

Multilevel Basic Probability Concepts

slide-49
SLIDE 49

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Binomial Distribution

The Binomial Distribution This discrete distribution is one of the foundations of modern categorical data analysis The binomial random variable X represents the number of “successes” in N outcomes of a binomial process A binomial process is characterized by

N independent trials Only two outcomes, arbitrarily designated “success” and “failure” Probabilities of success and failure remain constant over trials

Many interesting real world processes only approximately meet the above specifications Nevertheless, the binomial is often an excellent approximation

Multilevel Basic Probability Concepts

slide-50
SLIDE 50

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Binomial Distribution

The Binomial Distribution This discrete distribution is one of the foundations of modern categorical data analysis The binomial random variable X represents the number of “successes” in N outcomes of a binomial process A binomial process is characterized by

N independent trials Only two outcomes, arbitrarily designated “success” and “failure” Probabilities of success and failure remain constant over trials

Many interesting real world processes only approximately meet the above specifications Nevertheless, the binomial is often an excellent approximation

Multilevel Basic Probability Concepts

slide-51
SLIDE 51

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Binomial Distribution

The Binomial Distribution This discrete distribution is one of the foundations of modern categorical data analysis The binomial random variable X represents the number of “successes” in N outcomes of a binomial process A binomial process is characterized by

N independent trials Only two outcomes, arbitrarily designated “success” and “failure” Probabilities of success and failure remain constant over trials

Many interesting real world processes only approximately meet the above specifications Nevertheless, the binomial is often an excellent approximation

Multilevel Basic Probability Concepts

slide-52
SLIDE 52

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Binomial Distribution

The Binomial Distribution This discrete distribution is one of the foundations of modern categorical data analysis The binomial random variable X represents the number of “successes” in N outcomes of a binomial process A binomial process is characterized by

N independent trials Only two outcomes, arbitrarily designated “success” and “failure” Probabilities of success and failure remain constant over trials

Many interesting real world processes only approximately meet the above specifications Nevertheless, the binomial is often an excellent approximation

Multilevel Basic Probability Concepts

slide-53
SLIDE 53

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Binomial Distribution

The Binomial Distribution This discrete distribution is one of the foundations of modern categorical data analysis The binomial random variable X represents the number of “successes” in N outcomes of a binomial process A binomial process is characterized by

N independent trials Only two outcomes, arbitrarily designated “success” and “failure” Probabilities of success and failure remain constant over trials

Many interesting real world processes only approximately meet the above specifications Nevertheless, the binomial is often an excellent approximation

Multilevel Basic Probability Concepts

slide-54
SLIDE 54

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Binomial Distribution

The Binomial Distribution This discrete distribution is one of the foundations of modern categorical data analysis The binomial random variable X represents the number of “successes” in N outcomes of a binomial process A binomial process is characterized by

N independent trials Only two outcomes, arbitrarily designated “success” and “failure” Probabilities of success and failure remain constant over trials

Many interesting real world processes only approximately meet the above specifications Nevertheless, the binomial is often an excellent approximation

Multilevel Basic Probability Concepts

slide-55
SLIDE 55

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Binomial Distribution

The Binomial Distribution This discrete distribution is one of the foundations of modern categorical data analysis The binomial random variable X represents the number of “successes” in N outcomes of a binomial process A binomial process is characterized by

N independent trials Only two outcomes, arbitrarily designated “success” and “failure” Probabilities of success and failure remain constant over trials

Many interesting real world processes only approximately meet the above specifications Nevertheless, the binomial is often an excellent approximation

Multilevel Basic Probability Concepts

slide-56
SLIDE 56

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Binomial Distribution

The Binomial Distribution This discrete distribution is one of the foundations of modern categorical data analysis The binomial random variable X represents the number of “successes” in N outcomes of a binomial process A binomial process is characterized by

N independent trials Only two outcomes, arbitrarily designated “success” and “failure” Probabilities of success and failure remain constant over trials

Many interesting real world processes only approximately meet the above specifications Nevertheless, the binomial is often an excellent approximation

Multilevel Basic Probability Concepts

slide-57
SLIDE 57

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Binomial Distribution

The Binomial Distribution This discrete distribution is one of the foundations of modern categorical data analysis The binomial random variable X represents the number of “successes” in N outcomes of a binomial process A binomial process is characterized by

N independent trials Only two outcomes, arbitrarily designated “success” and “failure” Probabilities of success and failure remain constant over trials

Many interesting real world processes only approximately meet the above specifications Nevertheless, the binomial is often an excellent approximation

Multilevel Basic Probability Concepts

slide-58
SLIDE 58

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

Characteristics of the Binomial Distribution

Characteristics of the Binomial Distribution The binomial distribution is a two-parameter family, N is the number of trials, p the probability of success The binomial has pdf Pr(X = r) = N r

  • pr(1 − p)N −r

The mean and variance of the binomial are E(X ) = Np Var(X ) = Np(1 − p)

Multilevel Basic Probability Concepts

slide-59
SLIDE 59

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

Characteristics of the Binomial Distribution

Characteristics of the Binomial Distribution The binomial distribution is a two-parameter family, N is the number of trials, p the probability of success The binomial has pdf Pr(X = r) = N r

  • pr(1 − p)N −r

The mean and variance of the binomial are E(X ) = Np Var(X ) = Np(1 − p)

Multilevel Basic Probability Concepts

slide-60
SLIDE 60

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

Characteristics of the Binomial Distribution

Characteristics of the Binomial Distribution The binomial distribution is a two-parameter family, N is the number of trials, p the probability of success The binomial has pdf Pr(X = r) = N r

  • pr(1 − p)N −r

The mean and variance of the binomial are E(X ) = Np Var(X ) = Np(1 − p)

Multilevel Basic Probability Concepts

slide-61
SLIDE 61

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

Characteristics of the Binomial Distribution

Characteristics of the Binomial Distribution The binomial distribution is a two-parameter family, N is the number of trials, p the probability of success The binomial has pdf Pr(X = r) = N r

  • pr(1 − p)N −r

The mean and variance of the binomial are E(X ) = Np Var(X ) = Np(1 − p)

Multilevel Basic Probability Concepts

slide-62
SLIDE 62

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

Characteristics of the Binomial Distribution

Normal Approximation to the Binomial The B(N , p) distribution is well approximated by a N (Np, Np(1 − p)) distribution as long as p is not too far removed from .5 and N is reasonably large A good rule of thumb is that both Np and N (1 − p must be greater than 5 The approximation can be further improved by correcting for continuity

Multilevel Basic Probability Concepts

slide-63
SLIDE 63

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

Characteristics of the Binomial Distribution

Normal Approximation to the Binomial The B(N , p) distribution is well approximated by a N (Np, Np(1 − p)) distribution as long as p is not too far removed from .5 and N is reasonably large A good rule of thumb is that both Np and N (1 − p must be greater than 5 The approximation can be further improved by correcting for continuity

Multilevel Basic Probability Concepts

slide-64
SLIDE 64

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

Characteristics of the Binomial Distribution

Normal Approximation to the Binomial The B(N , p) distribution is well approximated by a N (Np, Np(1 − p)) distribution as long as p is not too far removed from .5 and N is reasonably large A good rule of thumb is that both Np and N (1 − p must be greater than 5 The approximation can be further improved by correcting for continuity

Multilevel Basic Probability Concepts

slide-65
SLIDE 65

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

Characteristics of the Binomial Distribution

Normal Approximation to the Binomial The B(N , p) distribution is well approximated by a N (Np, Np(1 − p)) distribution as long as p is not too far removed from .5 and N is reasonably large A good rule of thumb is that both Np and N (1 − p must be greater than 5 The approximation can be further improved by correcting for continuity

Multilevel Basic Probability Concepts

slide-66
SLIDE 66

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Poisson Distribution

The Poisson Distribution When events arrive without any systematic “clustering,” i.e., they arrive with a known average rate in a fixed time period but each event arrives at a time independent of the time since the last event, the exact integer number of events can be modeled with the Poisson distribution The Poisson is a single parameter family, the parameter being λ, the expected number of events in the interval of interest For a Poisson random variable X , the probability of exactly r events is Pr(X = r) = λre−λ r!

Multilevel Basic Probability Concepts

slide-67
SLIDE 67

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Poisson Distribution

The Poisson Distribution When events arrive without any systematic “clustering,” i.e., they arrive with a known average rate in a fixed time period but each event arrives at a time independent of the time since the last event, the exact integer number of events can be modeled with the Poisson distribution The Poisson is a single parameter family, the parameter being λ, the expected number of events in the interval of interest For a Poisson random variable X , the probability of exactly r events is Pr(X = r) = λre−λ r!

Multilevel Basic Probability Concepts

slide-68
SLIDE 68

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Poisson Distribution

The Poisson Distribution When events arrive without any systematic “clustering,” i.e., they arrive with a known average rate in a fixed time period but each event arrives at a time independent of the time since the last event, the exact integer number of events can be modeled with the Poisson distribution The Poisson is a single parameter family, the parameter being λ, the expected number of events in the interval of interest For a Poisson random variable X , the probability of exactly r events is Pr(X = r) = λre−λ r!

Multilevel Basic Probability Concepts

slide-69
SLIDE 69

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

The Poisson Distribution

The Poisson Distribution When events arrive without any systematic “clustering,” i.e., they arrive with a known average rate in a fixed time period but each event arrives at a time independent of the time since the last event, the exact integer number of events can be modeled with the Poisson distribution The Poisson is a single parameter family, the parameter being λ, the expected number of events in the interval of interest For a Poisson random variable X , the probability of exactly r events is Pr(X = r) = λre−λ r!

Multilevel Basic Probability Concepts

slide-70
SLIDE 70

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing Probability Models The Normal Distribution The Multivariate Normal Distribution The Lognormal Distribution The Binomial Distribution The Poisson Distribution

Characteristics of the Poisson Distribution

Characteristics of the Poisson Distribution The Poisson is used widely to model occurrences of low probability events A random variable X having a Poisson distribution with parameter λ has mean and variance given by E(X ) = λ Var(X ) = λ

Multilevel Basic Probability Concepts

slide-71
SLIDE 71

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing

Sampling Distributions

Sampling Distributions As discussed in your introductory course, we frequently sample from a population and obtain a statistic as an estimate of some key quantity Over repeated samples, these estimates show variability This variability is like noise, degrading the signal that is the parameter The known or hypothetical sampling distribution of the statistic allows us to gauge how accurate our parameter estimate is (at least in the long run)

Multilevel Basic Probability Concepts

slide-72
SLIDE 72

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing

Sampling Distributions

Sampling Distributions As discussed in your introductory course, we frequently sample from a population and obtain a statistic as an estimate of some key quantity Over repeated samples, these estimates show variability This variability is like noise, degrading the signal that is the parameter The known or hypothetical sampling distribution of the statistic allows us to gauge how accurate our parameter estimate is (at least in the long run)

Multilevel Basic Probability Concepts

slide-73
SLIDE 73

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing

Sampling Distributions

Sampling Distributions As discussed in your introductory course, we frequently sample from a population and obtain a statistic as an estimate of some key quantity Over repeated samples, these estimates show variability This variability is like noise, degrading the signal that is the parameter The known or hypothetical sampling distribution of the statistic allows us to gauge how accurate our parameter estimate is (at least in the long run)

Multilevel Basic Probability Concepts

slide-74
SLIDE 74

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing

Sampling Distributions

Sampling Distributions As discussed in your introductory course, we frequently sample from a population and obtain a statistic as an estimate of some key quantity Over repeated samples, these estimates show variability This variability is like noise, degrading the signal that is the parameter The known or hypothetical sampling distribution of the statistic allows us to gauge how accurate our parameter estimate is (at least in the long run)

Multilevel Basic Probability Concepts

slide-75
SLIDE 75

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing

Sampling Distributions

Sampling Distributions As discussed in your introductory course, we frequently sample from a population and obtain a statistic as an estimate of some key quantity Over repeated samples, these estimates show variability This variability is like noise, degrading the signal that is the parameter The known or hypothetical sampling distribution of the statistic allows us to gauge how accurate our parameter estimate is (at least in the long run)

Multilevel Basic Probability Concepts

slide-76
SLIDE 76

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing

Sampling Distributions

An Example

Sampling Distributions — An Example Suppose we take an opinion poll of N = 100 people at random, and 47% of them favor some position The question is, what does that tell us about the proportion

  • f people in the population favoring the position?

Multilevel Basic Probability Concepts

slide-77
SLIDE 77

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing

Sampling Distributions

An Example

Sampling Distributions — An Example Suppose we take an opinion poll of N = 100 people at random, and 47% of them favor some position The question is, what does that tell us about the proportion

  • f people in the population favoring the position?

Multilevel Basic Probability Concepts

slide-78
SLIDE 78

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing

Sampling Distributions

An Example

Sampling Distributions — An Example Suppose we take an opinion poll of N = 100 people at random, and 47% of them favor some position The question is, what does that tell us about the proportion

  • f people in the population favoring the position?

Multilevel Basic Probability Concepts

slide-79
SLIDE 79

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing

Sampling Distributions

An Example

Sampling Distributions — An Example In your introductory course, you learned as a simple consequence of the binomial distribution that if the population proportion is p, the sample proportion ˆ p has a sampling distribution that is approximately normal, with mean p and variance p(1 − p)/N For any hypothesized value of p, this tells us, through our knowledge of the normal distribution, how likely we would be to observe a value of .47 We can use this, in turn, to evaluate which values of p are “reasonable” in some sense

Multilevel Basic Probability Concepts

slide-80
SLIDE 80

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing

Sampling Distributions

An Example

Sampling Distributions — An Example In your introductory course, you learned as a simple consequence of the binomial distribution that if the population proportion is p, the sample proportion ˆ p has a sampling distribution that is approximately normal, with mean p and variance p(1 − p)/N For any hypothesized value of p, this tells us, through our knowledge of the normal distribution, how likely we would be to observe a value of .47 We can use this, in turn, to evaluate which values of p are “reasonable” in some sense

Multilevel Basic Probability Concepts

slide-81
SLIDE 81

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing

Sampling Distributions

An Example

Sampling Distributions — An Example In your introductory course, you learned as a simple consequence of the binomial distribution that if the population proportion is p, the sample proportion ˆ p has a sampling distribution that is approximately normal, with mean p and variance p(1 − p)/N For any hypothesized value of p, this tells us, through our knowledge of the normal distribution, how likely we would be to observe a value of .47 We can use this, in turn, to evaluate which values of p are “reasonable” in some sense

Multilevel Basic Probability Concepts

slide-82
SLIDE 82

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing

Sampling Distributions

An Example

Sampling Distributions — An Example In your introductory course, you learned as a simple consequence of the binomial distribution that if the population proportion is p, the sample proportion ˆ p has a sampling distribution that is approximately normal, with mean p and variance p(1 − p)/N For any hypothesized value of p, this tells us, through our knowledge of the normal distribution, how likely we would be to observe a value of .47 We can use this, in turn, to evaluate which values of p are “reasonable” in some sense

Multilevel Basic Probability Concepts

slide-83
SLIDE 83

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing The Classic Normal Theory Approach Confidence Intervals on Linear Transformations Confidence Intervals Via Simulation

Confidence Intervals

Confidence Intervals A confidence interval is a numerical interval constructed on the basis of data Such an interval is called a 95% (or .95) confidence interval if it is constructed so that it contains the true parameter value at least 95% of the time in the long run There are a variety of methods available for constructing confidence intervals

Multilevel Basic Probability Concepts

slide-84
SLIDE 84

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing The Classic Normal Theory Approach Confidence Intervals on Linear Transformations Confidence Intervals Via Simulation

Confidence Intervals

Confidence Intervals A confidence interval is a numerical interval constructed on the basis of data Such an interval is called a 95% (or .95) confidence interval if it is constructed so that it contains the true parameter value at least 95% of the time in the long run There are a variety of methods available for constructing confidence intervals

Multilevel Basic Probability Concepts

slide-85
SLIDE 85

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing The Classic Normal Theory Approach Confidence Intervals on Linear Transformations Confidence Intervals Via Simulation

Confidence Intervals

Confidence Intervals A confidence interval is a numerical interval constructed on the basis of data Such an interval is called a 95% (or .95) confidence interval if it is constructed so that it contains the true parameter value at least 95% of the time in the long run There are a variety of methods available for constructing confidence intervals

Multilevel Basic Probability Concepts

slide-86
SLIDE 86

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing The Classic Normal Theory Approach Confidence Intervals on Linear Transformations Confidence Intervals Via Simulation

Confidence Intervals

Confidence Intervals A confidence interval is a numerical interval constructed on the basis of data Such an interval is called a 95% (or .95) confidence interval if it is constructed so that it contains the true parameter value at least 95% of the time in the long run There are a variety of methods available for constructing confidence intervals

Multilevel Basic Probability Concepts

slide-87
SLIDE 87

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing The Classic Normal Theory Approach Confidence Intervals on Linear Transformations Confidence Intervals Via Simulation

Simple Normal Theory Confidence Intervals

Normal Theory Confidence Intervals In Psychology 310 we leared about simple symmetric confidence intervals based on the normal distribution If a statistic ˆ θ used to estimate a parameter θ has a normal sampling distribution with mean θ and sampling variance Var(ˆ θ), then we may construct a 95% confidence interval for θ as ˆ θ ± 1.96

  • Var(ˆ

θ) In general, a consistent estimator Var(ˆ θ) may be substituted for Var(ˆ θ) in the above

Multilevel Basic Probability Concepts

slide-88
SLIDE 88

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing The Classic Normal Theory Approach Confidence Intervals on Linear Transformations Confidence Intervals Via Simulation

Simple Normal Theory Confidence Intervals

Normal Theory Confidence Intervals In Psychology 310 we leared about simple symmetric confidence intervals based on the normal distribution If a statistic ˆ θ used to estimate a parameter θ has a normal sampling distribution with mean θ and sampling variance Var(ˆ θ), then we may construct a 95% confidence interval for θ as ˆ θ ± 1.96

  • Var(ˆ

θ) In general, a consistent estimator Var(ˆ θ) may be substituted for Var(ˆ θ) in the above

Multilevel Basic Probability Concepts

slide-89
SLIDE 89

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing The Classic Normal Theory Approach Confidence Intervals on Linear Transformations Confidence Intervals Via Simulation

Simple Normal Theory Confidence Intervals

Normal Theory Confidence Intervals In Psychology 310 we leared about simple symmetric confidence intervals based on the normal distribution If a statistic ˆ θ used to estimate a parameter θ has a normal sampling distribution with mean θ and sampling variance Var(ˆ θ), then we may construct a 95% confidence interval for θ as ˆ θ ± 1.96

  • Var(ˆ

θ) In general, a consistent estimator Var(ˆ θ) may be substituted for Var(ˆ θ) in the above

Multilevel Basic Probability Concepts

slide-90
SLIDE 90

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing The Classic Normal Theory Approach Confidence Intervals on Linear Transformations Confidence Intervals Via Simulation

Simple Normal Theory Confidence Intervals

Normal Theory Confidence Intervals In Psychology 310 we leared about simple symmetric confidence intervals based on the normal distribution If a statistic ˆ θ used to estimate a parameter θ has a normal sampling distribution with mean θ and sampling variance Var(ˆ θ), then we may construct a 95% confidence interval for θ as ˆ θ ± 1.96

  • Var(ˆ

θ) In general, a consistent estimator Var(ˆ θ) may be substituted for Var(ˆ θ) in the above

Multilevel Basic Probability Concepts

slide-91
SLIDE 91

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing The Classic Normal Theory Approach Confidence Intervals on Linear Transformations Confidence Intervals Via Simulation

Confidence Intervals on Linear Combinations

Confidence Intervals on Linear Combinations As we saw in Psychology 310, frequently linear combinations of parameters are of interest In that case, we can construct appropriate point estimates, standard errors, test statistics, and confidence intervals Methods are discussed in detail in the Psychology 310 handout, A Unified Approach to Some Common Statistical Tests

Multilevel Basic Probability Concepts

slide-92
SLIDE 92

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing The Classic Normal Theory Approach Confidence Intervals on Linear Transformations Confidence Intervals Via Simulation

Confidence Intervals on Linear Combinations

Confidence Intervals on Linear Combinations As we saw in Psychology 310, frequently linear combinations of parameters are of interest In that case, we can construct appropriate point estimates, standard errors, test statistics, and confidence intervals Methods are discussed in detail in the Psychology 310 handout, A Unified Approach to Some Common Statistical Tests

Multilevel Basic Probability Concepts

slide-93
SLIDE 93

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing The Classic Normal Theory Approach Confidence Intervals on Linear Transformations Confidence Intervals Via Simulation

Confidence Intervals on Linear Combinations

Confidence Intervals on Linear Combinations As we saw in Psychology 310, frequently linear combinations of parameters are of interest In that case, we can construct appropriate point estimates, standard errors, test statistics, and confidence intervals Methods are discussed in detail in the Psychology 310 handout, A Unified Approach to Some Common Statistical Tests

Multilevel Basic Probability Concepts

slide-94
SLIDE 94

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing The Classic Normal Theory Approach Confidence Intervals on Linear Transformations Confidence Intervals Via Simulation

Confidence Intervals on Linear Combinations

Confidence Intervals on Linear Combinations As we saw in Psychology 310, frequently linear combinations of parameters are of interest In that case, we can construct appropriate point estimates, standard errors, test statistics, and confidence intervals Methods are discussed in detail in the Psychology 310 handout, A Unified Approach to Some Common Statistical Tests

Multilevel Basic Probability Concepts

slide-95
SLIDE 95

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing The Classic Normal Theory Approach Confidence Intervals on Linear Transformations Confidence Intervals Via Simulation

Confidence Intervals Via Simulation

Confidence Intervals Via Simulation In some cases, we are interested in a function of parameters We know the distribution of individual parameter estimates, but we don’t have a convenient expression for the distribution of the function of the parameter estimates In this case, we can simulate the distribution of the function of parameter estimates using random number generation To generate the 95% confidence interval, we extract the .025 and .975 quantiles of the resulting simulated data

Multilevel Basic Probability Concepts

slide-96
SLIDE 96

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing The Classic Normal Theory Approach Confidence Intervals on Linear Transformations Confidence Intervals Via Simulation

Confidence Intervals Via Simulation

Confidence Intervals Via Simulation In some cases, we are interested in a function of parameters We know the distribution of individual parameter estimates, but we don’t have a convenient expression for the distribution of the function of the parameter estimates In this case, we can simulate the distribution of the function of parameter estimates using random number generation To generate the 95% confidence interval, we extract the .025 and .975 quantiles of the resulting simulated data

Multilevel Basic Probability Concepts

slide-97
SLIDE 97

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing The Classic Normal Theory Approach Confidence Intervals on Linear Transformations Confidence Intervals Via Simulation

Confidence Intervals Via Simulation

Confidence Intervals Via Simulation In some cases, we are interested in a function of parameters We know the distribution of individual parameter estimates, but we don’t have a convenient expression for the distribution of the function of the parameter estimates In this case, we can simulate the distribution of the function of parameter estimates using random number generation To generate the 95% confidence interval, we extract the .025 and .975 quantiles of the resulting simulated data

Multilevel Basic Probability Concepts

slide-98
SLIDE 98

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing The Classic Normal Theory Approach Confidence Intervals on Linear Transformations Confidence Intervals Via Simulation

Confidence Intervals Via Simulation

Confidence Intervals Via Simulation In some cases, we are interested in a function of parameters We know the distribution of individual parameter estimates, but we don’t have a convenient expression for the distribution of the function of the parameter estimates In this case, we can simulate the distribution of the function of parameter estimates using random number generation To generate the 95% confidence interval, we extract the .025 and .975 quantiles of the resulting simulated data

Multilevel Basic Probability Concepts

slide-99
SLIDE 99

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing The Classic Normal Theory Approach Confidence Intervals on Linear Transformations Confidence Intervals Via Simulation

Confidence Intervals Via Simulation

Confidence Intervals Via Simulation In some cases, we are interested in a function of parameters We know the distribution of individual parameter estimates, but we don’t have a convenient expression for the distribution of the function of the parameter estimates In this case, we can simulate the distribution of the function of parameter estimates using random number generation To generate the 95% confidence interval, we extract the .025 and .975 quantiles of the resulting simulated data

Multilevel Basic Probability Concepts

slide-100
SLIDE 100

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing The Classic Normal Theory Approach Confidence Intervals on Linear Transformations Confidence Intervals Via Simulation

Confidence Intervals Via Simulation

An Example

Example (Confidence Intervals Via Simulation) An example of the simulation approach can be found on page 20 of Gelman & Hill They assume that, with N = 500 per group, the distribution of the sample proportion can be approximated very accurately with a normal distribution In the problem of interest, the experimenter has observed sample proportions ˆ p1 and ˆ p2, each based on samples of 500 However, the experimenter wishes to construct a confidence interval on p1/p2.

Multilevel Basic Probability Concepts

slide-101
SLIDE 101

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing The Classic Normal Theory Approach Confidence Intervals on Linear Transformations Confidence Intervals Via Simulation

Confidence Intervals Via Simulation

An Example

Example (Confidence Intervals Via Simulation) An example of the simulation approach can be found on page 20 of Gelman & Hill They assume that, with N = 500 per group, the distribution of the sample proportion can be approximated very accurately with a normal distribution In the problem of interest, the experimenter has observed sample proportions ˆ p1 and ˆ p2, each based on samples of 500 However, the experimenter wishes to construct a confidence interval on p1/p2.

Multilevel Basic Probability Concepts

slide-102
SLIDE 102

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing The Classic Normal Theory Approach Confidence Intervals on Linear Transformations Confidence Intervals Via Simulation

Confidence Intervals Via Simulation

An Example

Example (Confidence Intervals Via Simulation) An example of the simulation approach can be found on page 20 of Gelman & Hill They assume that, with N = 500 per group, the distribution of the sample proportion can be approximated very accurately with a normal distribution In the problem of interest, the experimenter has observed sample proportions ˆ p1 and ˆ p2, each based on samples of 500 However, the experimenter wishes to construct a confidence interval on p1/p2.

Multilevel Basic Probability Concepts

slide-103
SLIDE 103

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing The Classic Normal Theory Approach Confidence Intervals on Linear Transformations Confidence Intervals Via Simulation

Confidence Intervals Via Simulation

An Example

Example (Confidence Intervals Via Simulation) An example of the simulation approach can be found on page 20 of Gelman & Hill They assume that, with N = 500 per group, the distribution of the sample proportion can be approximated very accurately with a normal distribution In the problem of interest, the experimenter has observed sample proportions ˆ p1 and ˆ p2, each based on samples of 500 However, the experimenter wishes to construct a confidence interval on p1/p2.

Multilevel Basic Probability Concepts

slide-104
SLIDE 104

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing The Classic Normal Theory Approach Confidence Intervals on Linear Transformations Confidence Intervals Via Simulation

Confidence Intervals Via Simulation

An Example

Example (Confidence Intervals Via Simulation) An example of the simulation approach can be found on page 20 of Gelman & Hill They assume that, with N = 500 per group, the distribution of the sample proportion can be approximated very accurately with a normal distribution In the problem of interest, the experimenter has observed sample proportions ˆ p1 and ˆ p2, each based on samples of 500 However, the experimenter wishes to construct a confidence interval on p1/p2.

Multilevel Basic Probability Concepts

slide-105
SLIDE 105

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing The Classic Normal Theory Approach Confidence Intervals on Linear Transformations Confidence Intervals Via Simulation

Confidence Intervals Via Simulation

An Example

Example (Confidence Intervals Via Simulation) The experimenter proceeds by constructing 10000 independent replications of ˆ p1 and 10000 replications of ˆ p2 For each pair, the ratio ˆ p1/ˆ p2 is computed This creates a set of 10000 replications of the ratio of proportions The 95% confidence interval is then constructed from the .025 and .975 quantiles of this set of 10000 ratios

Multilevel Basic Probability Concepts

slide-106
SLIDE 106

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing The Classic Normal Theory Approach Confidence Intervals on Linear Transformations Confidence Intervals Via Simulation

Confidence Intervals Via Simulation

An Example

Example (Confidence Intervals Via Simulation) The experimenter proceeds by constructing 10000 independent replications of ˆ p1 and 10000 replications of ˆ p2 For each pair, the ratio ˆ p1/ˆ p2 is computed This creates a set of 10000 replications of the ratio of proportions The 95% confidence interval is then constructed from the .025 and .975 quantiles of this set of 10000 ratios

Multilevel Basic Probability Concepts

slide-107
SLIDE 107

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing The Classic Normal Theory Approach Confidence Intervals on Linear Transformations Confidence Intervals Via Simulation

Confidence Intervals Via Simulation

An Example

Example (Confidence Intervals Via Simulation) The experimenter proceeds by constructing 10000 independent replications of ˆ p1 and 10000 replications of ˆ p2 For each pair, the ratio ˆ p1/ˆ p2 is computed This creates a set of 10000 replications of the ratio of proportions The 95% confidence interval is then constructed from the .025 and .975 quantiles of this set of 10000 ratios

Multilevel Basic Probability Concepts

slide-108
SLIDE 108

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing The Classic Normal Theory Approach Confidence Intervals on Linear Transformations Confidence Intervals Via Simulation

Confidence Intervals Via Simulation

An Example

Example (Confidence Intervals Via Simulation) The experimenter proceeds by constructing 10000 independent replications of ˆ p1 and 10000 replications of ˆ p2 For each pair, the ratio ˆ p1/ˆ p2 is computed This creates a set of 10000 replications of the ratio of proportions The 95% confidence interval is then constructed from the .025 and .975 quantiles of this set of 10000 ratios

Multilevel Basic Probability Concepts

slide-109
SLIDE 109

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing The Classic Normal Theory Approach Confidence Intervals on Linear Transformations Confidence Intervals Via Simulation

Confidence Intervals Via Simulation

An Example

Example (Confidence Intervals Via Simulation) The experimenter proceeds by constructing 10000 independent replications of ˆ p1 and 10000 replications of ˆ p2 For each pair, the ratio ˆ p1/ˆ p2 is computed This creates a set of 10000 replications of the ratio of proportions The 95% confidence interval is then constructed from the .025 and .975 quantiles of this set of 10000 ratios

Multilevel Basic Probability Concepts

slide-110
SLIDE 110

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing

Hypothesis Testing

Hypothesis Testing Gelman and Hill make a number of interesting points in their brief discussion They suggest viewing a hypothesis as a model about the data Testing the hypothesis involves comparing the behavior of the data with the data predicted by the model For example, if proportions are showing their standard random variation, this implies something about the size of that variation They examine this notion in an extensive example

Multilevel Basic Probability Concepts

slide-111
SLIDE 111

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing

Hypothesis Testing

Hypothesis Testing Gelman and Hill make a number of interesting points in their brief discussion They suggest viewing a hypothesis as a model about the data Testing the hypothesis involves comparing the behavior of the data with the data predicted by the model For example, if proportions are showing their standard random variation, this implies something about the size of that variation They examine this notion in an extensive example

Multilevel Basic Probability Concepts

slide-112
SLIDE 112

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing

Hypothesis Testing

Hypothesis Testing Gelman and Hill make a number of interesting points in their brief discussion They suggest viewing a hypothesis as a model about the data Testing the hypothesis involves comparing the behavior of the data with the data predicted by the model For example, if proportions are showing their standard random variation, this implies something about the size of that variation They examine this notion in an extensive example

Multilevel Basic Probability Concepts

slide-113
SLIDE 113

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing

Hypothesis Testing

Hypothesis Testing Gelman and Hill make a number of interesting points in their brief discussion They suggest viewing a hypothesis as a model about the data Testing the hypothesis involves comparing the behavior of the data with the data predicted by the model For example, if proportions are showing their standard random variation, this implies something about the size of that variation They examine this notion in an extensive example

Multilevel Basic Probability Concepts

slide-114
SLIDE 114

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing

Hypothesis Testing

Hypothesis Testing Gelman and Hill make a number of interesting points in their brief discussion They suggest viewing a hypothesis as a model about the data Testing the hypothesis involves comparing the behavior of the data with the data predicted by the model For example, if proportions are showing their standard random variation, this implies something about the size of that variation They examine this notion in an extensive example

Multilevel Basic Probability Concepts

slide-115
SLIDE 115

Random Variables Probability Distributions Sampling Distributions Confidence Intervals Hypothesis Testing

Hypothesis Testing

Hypothesis Testing Gelman and Hill make a number of interesting points in their brief discussion They suggest viewing a hypothesis as a model about the data Testing the hypothesis involves comparing the behavior of the data with the data predicted by the model For example, if proportions are showing their standard random variation, this implies something about the size of that variation They examine this notion in an extensive example

Multilevel Basic Probability Concepts