Course 02402 Introduction to Statistics Lecture 10: Simulation based statistical methods Per Bruun Brockhoff
DTU Informatics Building 305 - room 110 Danish Technical University 2800 Lyngby – Denmark e-mail: pbb@imm.dtu.dk
Per Bruun Brockhoff (pbb@imm.dtu.dk) Introduction to Statistics, Lecture 10 Fall 2012 1 / 27
Overview
1 Introduction to simulation
Example 1
2 Propagation of error
Example 1, cont.
3 Confidence intervals using simulation: Bootstrapping
Example 2, one-sample Two-sample situation Example 3
4 Hypothesis testing using simulation
By bootstrap confidence intervals One-sample setup, Example 2, cont. Hypothesis testing using permutation tests Two-sample setup, Example 3, cont.
Per Bruun Brockhoff (pbb@imm.dtu.dk) Introduction to Statistics, Lecture 10 Fall 2012 2 / 27 Introduction to simulation
Motivation Table 8.1 has a "missing link”: Small samples that are NOT from a normal distribution In the old days: non-parametric tests, e.g. chapter 14. More common now: Simulation based statistics:
Confidence intervals are much easier to achieve They are much easier to apply in more complicated situations They better reflect today’s reality: they are simply now used in many contexts
Require : Use of computer - R is a super tool for this!
Per Bruun Brockhoff (pbb@imm.dtu.dk) Introduction to Statistics, Lecture 10 Fall 2012 4 / 27 Introduction to simulation
What is simulation really?
(Pseudo) random numbers generated from a computer A random number generator is an algorithm that can generate xi+1 from xi A sequence of numbers appears random Require a "start" called a "seed" (Using the computer clock) Basically the uniform distribution is simulated in this way, and then:
If U ∼ Uniform(0.1) and F is a distribution function for any probability distribution, then F −1(U) follow the distribution given by F
Per Bruun Brockhoff (pbb@imm.dtu.dk) Introduction to Statistics, Lecture 10 Fall 2012 5 / 27