Statistical methods Toma Podobnik Oddelek za fiziko, FMF, UNI-LJ , - PowerPoint PPT Presentation

Statistical methods Tomaž Podobnik Oddelek za fiziko, FMF, UNI-LJ , , Odsek za eksperimentalno fiziko delcev, IJS

Contents: I. Prologue, II. Mathematical Preliminaries, III. Frequency Interpretation of Probability Distributions, IV. Confidence Intervals, V. Testing of Hypotheses, VI. Inverse Probability Distributions, VII. Interpretation of Inverse Probability Distributions, VIII.Time Series and Dynamical Models. 4/15/2010 2

II. Mathematical Preliminaries: 1. Motivation, 2. Probability spaces, 3. Conditional probabilities, 4. Random variables, 5. Probability distributions, 6. Transformations of probability distributions, 7. Conditional distributions, 8. Parametric families of (direct) probability distributions, 9. The Central limit Theorem, 10.Invariant parametric families. 4/15/2010 3

〈 〉 = 〈 〉 X K , , with and X x x Theorem 2 (CLT, Lévy). Consider i.i.d. 1 n i = < ∞ ( ) ( ) . Then, Var x Var x i ⎛ ⎞ ( ) 1 Var x n x ∑ = ∑ ⎜ ⎜ ⎟ ⎟ 〈 〈 〉 〉 ≡ ≡ lim lim ~ , , . x x N N x x x x x ⎜ ⎜ ⎟ ⎟ n n i → ∞ 1 i ⎝ n ⎠ n n n 1 ∑ ≡ − 2 2 ( ) s x x − n i n 1 n = 1 i 〈 〉 〈 〉 2 K Propositio n. Consider i.i.d. { , , }, and suppose that , , X X x x 1 n 〈 〈 〉 〉 〈 〈 〉 〉 3 4 , , all exist and are finite. Then, , x x 〈 s n = 〉 2 ( ) . Var x 4/15/2010 4

III. Frequency Interpretation of Probability Distributions: “In order to make the theory operational, we must introduce a concept of probability that links the mathematics to an external world of measu- rable phenomena ” (A Stuart J K Ord (1994) § 8 8 p 290 ) rable phenomena. (A. Stuart, J. K. Ord (1994), § 8.8, p. 290.) “The most striking achievement of the physical sciences is prediction.” (G. Pólya (1954) , Chap. XIV, § 4, p. 64.) “The pure mathematician can do what he pleases, but the applied The pure mathematician can do what he pleases, but the applied mathematician must be at least partially sane.” (M. Kline (1980). Mathematics: The Loss of Certainty, Chap. XIII , p. 285.) 4/15/2010 5

III. Frequency Interpretation of Probability Distributions: III. Frequency Interpretation of Probability Distributions: 1. Example, 2. Binary random sequence, 3. Random sequence of real numbers, 4. Monte Carlo methods. 4/15/2010 6

1. Example 1. (Bertrand’s paradox). 1. Example 1. (Bertrand s paradox). A straw is tossed at random so that the line determined by the straw 〈 l 〈 l 〉 〉 intersects the unit circle. What is the expected length intersects the unit circle What is the expected length of the chord of the chord thus defined? J.L. Bertrand (1889), Calcul des Probilités , pp. 4-5. J.B. Paris (1994), The Uncertain’s Reasoner Companion , Chap. 6, pp. 71-72. E.T. Jaynes (2003), Probability Theory, § 12.4.4, pp. 386-394. y ( ), y y, § , pp ( ) ( ) 2 , y x 2 , y x 2 2 θ ( ) − 1 , 0 l h ( ( ) ) 0 0 , 0 0 ( ) 1 , y x 1 4/15/2010 7

( ( ) ) ≤ ≤ h ≤ ≤ ⎧ ⎧ π π 1 1 ; ; 0 0 1 1 h 2 , y x x y ( ) = ⇒ 〈 〉 = ≈ ⎨ 2 ) 1 . 57 ; a f h l ⎩ 0 ; otherwise 2 θ ( ( ) ) − 1 1 , , 0 0 h h ( ) π ⎧ 0 , 0 2 ≤ θ ≤ ⎪ ; 0 4 ( ) θ = ⇒ 〈 〉 = ≈ π ⎨ ) 1 . 27 ; b f 2 l π π ⎪ ⎪ ⎩ 0 ; otherwise ⎧ ⎧ 1 1 − ≤ ≤ ⎪ ; 1 1 x 4 ( ) = ⇒ 〈 〉 = ≈ 2 ⎨ ) 1 . 33 . c f x l 2 2 3 ⎪ ⎩ ⎩ 0 ; ; otherwise 4/15/2010 8

2. Binary random sequences. Consider an infinite binary sequence 1,0,1,1,0,1,0,0,0,1,0,1,1,0,1,... with equal relative frequencies of appearance of 1’s and 0’s, equa e a e eque c es o appea a ce o s a d 0 s, 1 ν = ν = ; 1 0 2 or more precisely or more precisely, ⎛ ⎞ 1 n − < ε = ⎜ ⎟ 1 lim 1 . P → ∞ ⎝ ⎝ ⎠ ⎠ 2 n n We say that n 1 = n 0 =1/2 is true almost everywhere with respect to the Bernoulli measure Bn (1/2) on the space of infinite binary sequences, called Cantor space ( Bn (1/2) on the Cantor space ( B (1/2) ll d C t th C t is isomorphic to the Lebesgue measure on the interval [0,1] ). 4/15/2010 9

For a Bn (1/2) -typical binary sequence we would further expect that 1 ν = ν = ν = ν = , 1 , 1 1 , 0 0 , 1 0 , 0 4 1 ν = ν = ν = ν = ν = ν = ν = ν = , 1 , 1 , 1 0 , 1 , 1 1 , 0 , 1 1 , 1 , 0 0 , 0 , 1 0 , 1 , 0 1 , 0 , 0 , 0 , 0 , 0 8 M M holds Bn (1/2) -almost everywhere. That is from a Bn (1/2) typical binary sequence we would naively That is, from a Bn (1/2) - typical binary sequence we would naively expect to satisfy all properties true Bn (1/2) -almost everywhere. Unfortunately, such a definition is vacuous. 4/15/2010 10

Definition 1 ( Bn (1/2) - random binary sequence). An infinite binary sequence is called ( Martin-Löf ) Bn (1/2) - random iff it is not rejected by the Martin Löf test (i e by the Martin-Löf test (i.e., if it satisfies a (special) countable if it satisfies a (special) countable sequence of properties true Bn (1/2) -almost everywhere). P. Martin-Löf (1966), Inform. Control 9 , 602-619. l 9 602 619 P M ti Löf (1966) I f C t The limiting frequencies n 1 and n 0 need not be the same, e.g., n 1 =2/3 The limiting frequencies n and n need not be the same e g n 2/3 and n 0 =1/3 . Definition 2 ( Bn (n ) - random binary sequence) An infinite binary Definition 2 ( Bn (n 1 ) - random binary sequence). An infinite binary sequence is called ( Martin-Löf ) Bn (n 1 ) - random iff it is not rejected by the Martin-Löf test (i.e., if it satisfies a countable sequence of properties true Bn (n ) almost everywhere) sequence of properties true Bn (n 1 ) -almost everywhere). Remark 1. No finite binary sequence is random. e a o te b a y seque ce s a do 4/15/2010 11

3. Real random sequences. Given a probability space ( (  n ,  n , Pr X ) , a set A œ  n n   and an infinite sequence x 1 , x 2 ,… , ( x i œ  n ) give d i fi it ( ) i rise to a binary sequence b 1 ,b 2 ,… , where A ∈ A ∈ ⎧ ⎧ 1 1 ; ; x x A = i ⎨ . b i ⎩ 0 ; otherwise Definition 3 ( Pr X -random sequence). Given a probability space ( (  n ,  n , Pr X ) , an infinite sequence x 1 , x 2 ,… , ( x i œ  n ) is Pr X -random iff for ( X ) 1 , 2 , q ( i ) X every A œ  n the corresponding binary sequence b 1 ,b 2 ,… is Bn [Pr X (A)] - random. In this way, the probability distribution Pr X on  n coincides with the (frequency) distribution of the sequence x 1 , x 2 , ..., which is characteristic of the frequency interpretation of probability of the frequency interpretation of probability. 4/15/2010 12

Remark 2. Every finite sequence is non -random. Consequently, the ran- R k 2 E fi it i d C tl th domness of QM cannot be verified, it can only be postulated. Remark 3. Every (possibly infinite) sequence that results from an algo- rithm is non -random. Consequently, none of the numbers from ith i d C tl f th b f random number generators, based on algorithms, is truly random. Rather, they are pseudo-random numbers. There are random number generators based on QM processes such as, for example, radioactive decays. The numbers from these , p , y generators may be (parts of) truly random sequences. 4/15/2010 13

4. Monte Carlo methods. Basis: Generator of (pseudo-) random numbers, uniformly distributed on an interval, often [0,1]. MC integratio n : ( x ) f ( ( ) ) f f = = + + × × − − 1 1 . rndm rndm x x x x x x x x max max i a i b a = × 2 . rndm' y f i i max ( ) ( ) ≤ ≤ ⇒ ⇒ = + + 3 3 . . 1 1 y y f f x x N N N N acc acc i i i i − − − − − − − − − − − − − − − − ( ) ( ) N ( ( ) ) x ∫ ∫ = × × − × × b acc acc f f x x dx dx x x x x f f b a max N x a gen x x x b a 4/15/2010 14

( ) (Pseudo-) Random numbers for arbitrary : f X x ( x ) f X = [ , ] V x x X a b f f ( ( ) ) = + × − max max 1 1 . rndm d x x x x i a i b a = × 2 . rndm' y f max i i ( ) ( ) ≤ ≤ ⇒ ⇒ 3 3 . accept accept y y f f x x x x i X i i − − − − − − − − − − − − − − − − { } { } ( ) ( ) accepted accepted i ~ ~ x x f f x x X x x x a b 4/15/2010 15

Low efficiencies may represent a serious problem: = [ , ] V x x ( x ) f X X a b ( ( ) ) = + × − 1 . rndm x x x x f f i i a a i i b b a a max = × 2 . rndm' y f i i max ( ) ≤ ⇒ 3 . accept y f x x i X i i − − − − − − − − − − − − − − − − = × [ , ] S x x f rec a b max N S = acc shad x x x N S a b gen rec 4/15/2010 16

Statistical methods Toma Podobnik Oddelek za fiziko, FMF, UNI-LJ , - PowerPoint PPT Presentation

Statistical methods Toma Podobnik Oddelek za fiziko, FMF, UNI-LJ , , Odsek za eksperimentalno fiziko delcev, IJS Contents: I. Prologue, II. Mathematical Preliminaries, III. Frequency Interpretation of Probability Distributions, IV.

Statistics 435/535 Statistical Methods for Quality and Productivity Improvement / Statistical

Statistical Statistical Statistical Model Statistical Model Model Checking Model Checking

STAT 401A - Statistical Methods for Research Workers Statistical Inference Jarad Niemi (Dr. J)

STK-IN4300 Methods using Derived Input Directions Statistical Learning Methods in Data Science

STK-IN4300 Statistical Learning Methods in Data Science Statistical Boosting Boosting as a

Statistical graphics with Statistical graphics with ggplot2 ggplot2 Programming for Statistical

Statistical Methods Statistical Methods Descriptive Inferential Statistics Statistics

Introduction to Statistical Process Control Statistical Process Control (SPC) uses seven major

Meshless Meshless Methods Meshless Meshless Methods Methods Methods Contents

Introduction to resampling methods Tushar Shanker Data Scientist DataCamp Statistical

Statistical presentation Statistical presentation Statistical tabulations by age, sex and 3 digit

EFTA Statistical Cooperation & the European Statistical System EEA Seminar EEA Seminar

EFTA Statistical Cooperation & the European Statistical System EEA Seminar EEA Seminar

13 Jan, 2011 Statistical Literacy: Confounding UTSA Confounding 2011 1 2011 2 Statistical

Nov 2010 Statistical Literacy: Harper's Magazine Fall 2010 1 Fall 2010 2 Statistical

Statistical Machine Translation George Foster George Foster Statistical Machine Translation A

Strong fields in heavy ion collisions at VLHC P. Lvai (Wigner RCP, Budapest, Hungary) V.V.

Topic 8 graphics "What makes the situation worse is that the highest level CS course I've

Highprecision luminosity at e + e colliders: theory status and challenges Guido Montagna

Dynamic Security Testing Sicco Verwer s.e.verwer@tudelft.nl 1 Challenge the future Today

30/03/2016 Safety Moment Ben Green Planning Forum Ian Fletcher 22 March 2016 2 Safety Moment

Works will be presented Deep Convolutional Generative Adversarial Networks(DCGAN) Image

Laser Interferometer Space Antenna (LISA) Time-Delay Interferometry with Moving Spacecraft

A brief history of gravitational-wave research and the gravitational-wave spectrum Wei-Tou Ni

Sambuz

Useful Links

Newsletter

Mail Us

Statistical methods Toma Podobnik Oddelek za fiziko, FMF, UNI-LJ , - PowerPoint PPT Presentation

Statistical methods Toma Podobnik Oddelek za fiziko, FMF, UNI-LJ , , Odsek za eksperimentalno fiziko delcev, IJS Contents: I. Prologue, II. Mathematical Preliminaries, III. Frequency Interpretation of Probability Distributions, IV.

Statistics 435/535 Statistical Methods for Quality and Productivity Improvement / Statistical

Statistical Statistical Statistical Model Statistical Model Model Checking Model Checking

STAT 401A - Statistical Methods for Research Workers Statistical Inference Jarad Niemi (Dr. J)

STK-IN4300 Methods using Derived Input Directions Statistical Learning Methods in Data Science

STK-IN4300 Statistical Learning Methods in Data Science Statistical Boosting Boosting as a

Statistical graphics with Statistical graphics with ggplot2 ggplot2 Programming for Statistical

Statistical Methods Statistical Methods Descriptive Inferential Statistics Statistics

Introduction to Statistical Process Control Statistical Process Control (SPC) uses seven major

Meshless Meshless Methods Meshless Meshless Methods Methods Methods Contents

Introduction to resampling methods Tushar Shanker Data Scientist DataCamp Statistical

Statistical presentation Statistical presentation Statistical tabulations by age, sex and 3 digit

EFTA Statistical Cooperation &amp; the European Statistical System EEA Seminar EEA Seminar

EFTA Statistical Cooperation &amp; the European Statistical System EEA Seminar EEA Seminar

13 Jan, 2011 Statistical Literacy: Confounding UTSA Confounding 2011 1 2011 2 Statistical

Nov 2010 Statistical Literacy: Harper's Magazine Fall 2010 1 Fall 2010 2 Statistical

Statistical Machine Translation George Foster George Foster Statistical Machine Translation A

Strong fields in heavy ion collisions at VLHC P. Lvai (Wigner RCP, Budapest, Hungary) V.V.

Topic 8 graphics &quot;What makes the situation worse is that the highest level CS course I've

Highprecision luminosity at e + e colliders: theory status and challenges Guido Montagna

Dynamic Security Testing Sicco Verwer s.e.verwer@tudelft.nl 1 Challenge the future Today

30/03/2016 Safety Moment Ben Green Planning Forum Ian Fletcher 22 March 2016 2 Safety Moment

Works will be presented Deep Convolutional Generative Adversarial Networks(DCGAN) Image

Laser Interferometer Space Antenna (LISA) Time-Delay Interferometry with Moving Spacecraft

A brief history of gravitational-wave research and the gravitational-wave spectrum Wei-Tou Ni

Sambuz

Useful Links

Newsletter

Mail Us

EFTA Statistical Cooperation & the European Statistical System EEA Seminar EEA Seminar

EFTA Statistical Cooperation & the European Statistical System EEA Seminar EEA Seminar

Topic 8 graphics "What makes the situation worse is that the highest level CS course I've