Biostatistics Preparatory Course: Methods and Computing Lecture 9 - PowerPoint PPT Presentation

Biostatistics Preparatory Course: Methods and Computing Lecture 9 Maximum Likelihood & the Bootstrap Methods and Computing Harvard University Department of Biostatistics 1 / 16

Overview: Maximum Likelihood Estimation Consider estimating a parameter θ given a sample of data, { X 1 , . . . , X n } What is maximum likelihood estimation? A statistical method that estimates θ as the value that maximizes the likelihood of obtaining the observed data That is, the maximum likelihood estimator (MLE) provides the greatest amount of agreement between the selected model and the data Methods and Computing Harvard University Department of Biostatistics 2 / 16

Overview: Maximum Likelihood Estimation What is the likelihood function? In math - L ( θ ) = f ( x 1 , . . . , x n | θ ) where f ( · ) denotes the joint density of the data In words - the function that dictates the probability (relative frequency) of observing the data as a function of θ The definition of the MLE is: ˆ θ MLE = arg max θ L ( θ ) Methods and Computing Harvard University Department of Biostatistics 3 / 16

Simple Setting We will focus on the setting of iid observations, that is, { X 1 , . . . , X n } is a simple random sample The likelihood then simplifies to n � L ( θ ) = f ( x i | θ ) i = 1 In practice, we typically maximize the log of the likelihood: n � ℓ ( θ ) = log {L ( θ ) } = log { f ( x i | θ ) } i = 1 since taking the derivative of a sum is typically easier than a product and the likelihood can be very small for large n (a computational issue) Methods and Computing Harvard University Department of Biostatistics 4 / 16

Why is maximum likelihood estimation so popular? Provides a unified framework for estimation Under mild regularity conditions, MLEs are: consistent → converge to the true value in probability as n → ∞ , i.e. 1 n →∞ P ( | ˆ lim θ − θ | ≤ ǫ ) = 1 ∀ ǫ > 0 asymptotically normal → √ n (ˆ θ − θ ) ∼ N ( 0 , σ 2 ) for large n 2 asymptotically efficient → achieve the lowest variance for large n 3 invariant → if ˆ θ is the MLE for θ then g (ˆ θ ) is the MLE for g ( θ ) 4 Many algorithms exist for maximum likelihood estimation Methods and Computing Harvard University Department of Biostatistics 5 / 16

Steps to find the MLE 1 Write out the likelihood L ( θ ) = f ( x 1 , . . . , x n | θ ) 2 Simplify the log likelihood ℓ ( θ ) = log {L ( θ ) } 3 Take the derivative of ℓ ( θ ) with respect to the parameter of interest, θ 4 Set = 0 5 Solve for θ (this is your ˆ θ MLE ) � � ∂ 2 6 Check that ˆ θ MLE is a maximum ∂θ 2 ℓ ( θ ) < 0 Methods and Computing Harvard University Department of Biostatistics 6 / 16

MLE Exercises Methods and Computing Harvard University Department of Biostatistics 7 / 16

MLE Exercises 1 Suppose we have an iid sample { X 1 , . . . , X 100 } with X i ∼ Ber ( p ) . Find the MLE for p . Recall that the density for a Bernoulli random variable can be written as: p X i ( 1 − p ) 1 − X i Methods and Computing Harvard University Department of Biostatistics 7 / 16

MLE Exercises 1 Suppose we have an iid sample { X 1 , . . . , X 100 } with X i ∼ Ber ( p ) . Find the MLE for p . Recall that the density for a Bernoulli random variable can be written as: p X i ( 1 − p ) 1 − X i 2 Suppose we have an iid sample { X 1 , . . . , X n } with X i ∼ N ( µ, σ 2 ) Find the MLE for µ . Recall that the density for a normal random variable can be written as: 1 exp( 1 2 σ 2 ( X i − µ ) 2 ) √ 2 πσ Methods and Computing Harvard University Department of Biostatistics 7 / 16

MLE Exercises in R We are going to use R to derive the MLE in more complex cases. In the previous two examples, we found a closed-form solution (MLE) for our parameters Sometimes, there is no closed-form solution, so we need to use optimization methods to estimate our parameter of interest Methods and Computing Harvard University Department of Biostatistics 8 / 16

The optim function General-purpose optimization that implements various methods It will find the values of some parameters that minimizes some function You need to specify... The parameters that you want to estimate The function (in our case, the negative log-likelihood; why negative?) The method (I typically use "BFGS") Starting values for your parameters (use random numbers) Other values that you need to pass into your function Methods and Computing Harvard University Department of Biostatistics 9 / 16

MLE Exercises Methods and Computing Harvard University Department of Biostatistics 10 / 16

The Bootstrap What is the bootstrap? A widely applicable, computer intensive resampling method used to compute standard errors, confidence intervals, and significance tests Why is it important? The exact sampling distribution of an estimator can be difficult to obtain Asymptotic expansions are sometimes easier but expressions for standard errors based on large sample theory may not perform well in finite samples Methods and Computing Harvard University Department of Biostatistics 11 / 16

Motivating Analogy The bootstrap samples should relate to the original sample just as the original sample relates to the population Methods and Computing Harvard University Department of Biostatistics 12 / 16

Overview: The Bootstrap Principle Without additional information, the sample contains all we know about the underlying distribution so resampling the sample is the best approximation to sampling from the true distribution Methods and Computing Harvard University Department of Biostatistics 13 / 16

The Bootstrap Principle Suppose X = { X 1 , . . . , X n } is a sample used to estimate some parameter θ = T ( P ) of the underlying distribution P . To make inference on θ , we are interested in the properties of our estimator ˆ θ = S ( X ) for θ . If we knew P , we could obtain { X b | b = 1 , . . . B } from P and use Monte-Carlo to estimate the sampling distribution of ˆ θ (sound familiar?) We don’t so we do the next best thing and resample from original sample, i.e. the empirical distribution, ˆ P We expect the empirical distribution to estimate the underlying distribution well by the Glivenko-Cantelli Theorem Methods and Computing Harvard University Department of Biostatistics 14 / 16

Bootstrap procedure Goal: Find the standard error and confidence intervals for some ˆ θ = S ( D ) where D encodes our observed data. Select B independent bootstrap resamples D ( b ) , each consisting of N data values drawn with replacement from the data. Compute the estimates from each bootstrap resample ˆ θ ∗ ( b ) = S ( D ∗ ( b )) b = 1 , ..., B Estimate the standard error se (ˆ θ ) by the sample standard deviation of the B replications of ˆ θ ∗ ( b ) Estimate the confidence interval by finding the 100 ( 1 − α ) percentile bootstrap CI, θ ∗ α/ 2 , ˆ θ ∗ 1 − α/ 2 ) (ˆ θ L , ˆ θ U ) = ( ˆ Methods and Computing Harvard University Department of Biostatistics 15 / 16

Boostrap exercise Methods and Computing Harvard University Department of Biostatistics 16 / 16

Biostatistics Preparatory Course: Methods and Computing Lecture 9 - PowerPoint PPT Presentation

Biostatistics Preparatory Course: Methods and Computing Lecture 9 Maximum Likelihood & the Bootstrap Methods and Computing Harvard University Department of Biostatistics 1 / 16 Overview: Maximum Likelihood Estimation Consider estimating a

Biostatistics Preparatory Course: Methods and Computing Lecture 6 Simulations Methods and

Launching the preparatory process for the 2019 Sector Ministers Meeting Preparatory webinar

PMMVY Preparatory PMMVY Preparatory Conference Pradhan Mantri Matru Vandana Yojana (PMMVY)

Comparison of Preparatory Signal Comparison of Preparatory Signal Detection Techniques for

Preparatory Course in Computer programming experience Science Computer Science 1 : Theoretical

Integration Philipp Warode October 2, 2019 Mathematics Preparatory Course 2019 Philipp

Biostatistics Burkhardt Seifert & Alois Tschopp Department of Biostatistics Epidemiology,

Lecture 1 : Introduction to Statistical Computing Biostatistics 615/815 - Statistical Computing .

Lecture 1 : Introduction to Statistical Computing Biostatistics 615/815 - Statistical Computing .

Course Orientation q Course Description q Course Outcomes q Course Requirements q Course Outline

Chemistry 51-P Data The Benefits of a Preparatory Course Course Results Post-Survey Results

ANKARA YILDIRIM BEYAZIT UNIVERSITY PREPARATORY SCHOOL ORIENTATION PRESENTATION CONTENTS 1.

Biostatistics and Design Core up to 2016 Andrea J Cook, PhD Senior Investigator Biostatistics

Biostatistics 602 - Statistical Inference March 14th, 2013 Biostatistics 602 - Lecture 16 Hyun

Biostatistics 602 - Statistical Inference April 16th, 2013 Biostatistics 602 - Lecture 24 Hyun

Biostatistics 602 - Statistical Inference March 19th, 2013 Biostatistics 602 - Lecture 16 Hyun

Status of GArTPC software studies Justo Martn-Albo Harvard University The participants of the

Annual general meeting 10 April 2019 Simon Thompson Chairman Sustainable energy Board of

MA111: Contemporary mathematics . Jack Schmidt University of Kentucky November 26, 2012

Types & Bandwidth is the throughput of a communication resource, Types & measured in

Policy Interactions within a Carbon-Pricing System Robert N. Stavins A. J. Meyer Professor of

Configurations of Extremal Type II Lattices and Codes Scott Duke Kominers Department of

Empirical Methods Research Landscape Qualitative = Constructivist approach Build theory

26:010:557 / 26:620:557 Social Science Research Methods Dr. Peter R. Gillett Associate

Biostatistics Preparatory Course: Methods and Computing Lecture 9 - PowerPoint PPT Presentation

Biostatistics Preparatory Course: Methods and Computing Lecture 9 Maximum Likelihood & the Bootstrap Methods and Computing Harvard University Department of Biostatistics 1 / 16 Overview: Maximum Likelihood Estimation Consider estimating a

Biostatistics Preparatory Course: Methods and Computing Lecture 6 Simulations Methods and

Launching the preparatory process for the 2019 Sector Ministers Meeting Preparatory webinar

PMMVY Preparatory PMMVY Preparatory Conference Pradhan Mantri Matru Vandana Yojana (PMMVY)

Comparison of Preparatory Signal Comparison of Preparatory Signal Detection Techniques for

Preparatory Course in Computer programming experience Science Computer Science 1 : Theoretical

Integration Philipp Warode October 2, 2019 Mathematics Preparatory Course 2019 Philipp

Biostatistics Burkhardt Seifert &amp; Alois Tschopp Department of Biostatistics Epidemiology,

Lecture 1 : Introduction to Statistical Computing Biostatistics 615/815 - Statistical Computing .

Lecture 1 : Introduction to Statistical Computing Biostatistics 615/815 - Statistical Computing .

Course Orientation q Course Description q Course Outcomes q Course Requirements q Course Outline

Chemistry 51-P Data The Benefits of a Preparatory Course Course Results Post-Survey Results

ANKARA YILDIRIM BEYAZIT UNIVERSITY PREPARATORY SCHOOL ORIENTATION PRESENTATION CONTENTS 1.

Biostatistics and Design Core up to 2016 Andrea J Cook, PhD Senior Investigator Biostatistics

Biostatistics 602 - Statistical Inference March 14th, 2013 Biostatistics 602 - Lecture 16 Hyun

Biostatistics 602 - Statistical Inference April 16th, 2013 Biostatistics 602 - Lecture 24 Hyun

Biostatistics 602 - Statistical Inference March 19th, 2013 Biostatistics 602 - Lecture 16 Hyun

Status of GArTPC software studies Justo Martn-Albo Harvard University The participants of the

Annual general meeting 10 April 2019 Simon Thompson Chairman Sustainable energy Board of

MA111: Contemporary mathematics . Jack Schmidt University of Kentucky November 26, 2012

Types &amp; Bandwidth is the throughput of a communication resource, Types &amp; measured in

Policy Interactions within a Carbon-Pricing System Robert N. Stavins A. J. Meyer Professor of

Configurations of Extremal Type II Lattices and Codes Scott Duke Kominers Department of

Empirical Methods Research Landscape Qualitative = Constructivist approach Build theory

26:010:557 / 26:620:557 Social Science Research Methods Dr. Peter R. Gillett Associate

Biostatistics Burkhardt Seifert & Alois Tschopp Department of Biostatistics Epidemiology,

Types & Bandwidth is the throughput of a communication resource, Types & measured in