Statistical methods Toma Podobnik Oddelek za fiziko, FMF, UNI-LJ , - PowerPoint PPT Presentation

Statistical methods Tomaž Podobnik Oddelek za fiziko, FMF, UNI-LJ , , Odsek za eksperimentalno fiziko delcev, IJS

I Prologue: I. Prologue: 1. Literature, 2. Basic Desiderata, 3. Example. 3. Example. 4/15/2010 2

1. Literature: • no single textbook covers all topics appropriately, • relevant references will be given for each of the topics separately. 4/15/2010 3

2. Basic Desiderata: A Every system must be (self-) consistent. B Every scientific (empirical) theory must be operational (i.e., it must specify operations that ensure falsifiability ( , p y p y of its predictions). K Popper (1959) The Logic of Scientific Discovery § 24 pp 91-92 K. Popper (1959), The Logic of Scientific Discovery, § 24, pp. 91 92. J. Neyman (1937), Phil. Trans. R. Soc., A 236 , 333-380. G. Pólya (1954), Mathematics and Plausible Reasoning, Vol. 2 – Patterns of Plausible Inference Chap XIV § 4 p 64 Patterns of Plausible Inference, Chap. XIV, § 4, p. 64, and Chap. XV, § 4, p. 117. 4/15/2010 4

Definition 1 (Inconsistency). A system is inconsistent iff, within the Definition 1 (Inconsistency). A system is inconsistent iff, within the rules of the system, a conclusion can be reasoned out in more than one ways, and not all the ways lead to the same result. Definition 2 (Consistency). A system that is not inconsistent (i.e., a system without a single demonstrated inconsistency) is said a system without a single demonstrated inconsistency) is said to be consistent. 4/15/2010 5

3 Example 1 A decay of an atom (nucleus particle) at time t : 3. Example 1. A decay of an atom (nucleus, particle) at time t 1 : ⎧− ⎫ 1 t ( ) τ = ) ⎨ 1 ⎬ f I t | exp ; 1 τ τ τ τ 1 ⎩ ⎩ ⎭ ⎭ = = τ { } [ ] ( ) − + = τ = τ − τ τ ∈ 1 ( |  I f t | exp t / ; t f I ( ) τ = P A | 1 t = ∫ b ( ( ) ) ( ( ) ) ( ( ) ) I ∫ τ τ = P A | | f f t | | dt ; ; A t , , t I I I I a a b b t t t t a b t t a 4/15/2010 6

τ ? Given t 1 , what is the value of the parameter The Maximum Likelihood Method (MLM): R A Fisher (1922) Phil Trans R Soc R. A. Fisher (1922), Phil. Trans. R. Soc., A 222 , 309-368. A 222 309 368 ( ) ( ) τ ≡ τ Likelihood function : . L t ; f t | 1 I 1 “The principle of Maximum Likelihood states that when confronted with The principle of Maximum Likelihood states that, when confronted with τ a choice of we choose that one (if any) which maximizes L .” A St art J K Ord (1994) Kendall’s Ad anced Theor of Statistics A. Stuart, J. K. Ord (1994), Kendall’s Advanced Theory of Statistics, Vol. 1 – Distribution Theory, § 8.22, p. 300. I. Kuš č er, A. Kodre (1994), Matematika v fiziki in tehniki, § 11.8, p. 309. R. Jamnik (1975), Verjetnostni ra č un in statistika, § 24, p. 529-530. In I. Vidav (ed.), Višja matematika II . 4/15/2010 7

General method: ∂ ( ) τ τ = ˆ : ln L t ; 0 τ = τ ) ˆ ∂ τ 1 τ ; 1 τ = ˆ ˆ t ( 1 = 1 t L τ What justifies MLM? 4/15/2010 8

τ ˆ (The fallacy of) The limiting property of : Set of measurements { t 1 ,…,t n } : i.i.d. n ( ) ( ) ( ) ∏ τ = τ = τ L t ,..., t ; f t ,..., t | f t | 1 n I 1 n I i = i 1 ∂ ∂ n 1 1 ( ) ( ) ∑ τ = τ τ = ⇒ τ = = ˆ ˆ t ,..., t : ln L t ,..., t ; 0 ˆ t t τ = τ ˆ ∂ τ 1 n 1 n n i n = i 1 τ ⎛ ⎞ ( ( ) ) τ τ τ τ τ τ ⎜ ⎜ ⎟ ⎟ ˆ lim lim : : f f | | ~ N N , ⎝ ⎠ → ∞ n n = ≈ ∞ = ≈ ∞ N 3 , n 1 C 4/15/2010 9

The conditional probability fallacy: ( ) ( ) τ τ = τ τ = Interested in calculate : f | t max ., : f t | max . 1 1 ( ( ) ) ( ( ) ) τ = τ ⇒ an implicit p assumption p f f | | t f f t | | : I I 1 1 I I 1 1 ∞ ( ( ) ) ∫ ∫ τ d τ = ∞ f f I t 1 | | d 0 Example (L. Lyons): A : ”The first person I’ll meet is female.” B : ”The first person I’ll meet is pregnant.” B ”Th fi t I’ll t i t ” ( ) ( ) ≠ P A | B P B | A 4/15/2010 10

The fallacy of point estimates: τ The parameter may take on every value in a continuum p y y τ ˆ  + ï a measure of a single point in the continuum is 0. For verifiable predictions we must turn to interval estimations For verifiable predictions we must turn to interval estimations. “Ignorance is preferable to error and he is less remote from the truth who belives nothing than he who believes what is wrong.” g g (T. Jefferson (1781). Notes on Virginia .) 4/15/2010 11

Contents: I. Prologue, II. Mathematical Preliminaries, III. Frequency Interpretation of Probability Distributions, IV. Confidence Intervals, V. Testing of Hypotheses, VI. Inverse Probability Distributions, VII. Interpretation of Inverse Probability Distributions, VIII.Time Series and Dynamical Models. 4/15/2010 12

II. Mathematical Preliminaries: 1. Motivation, 2. Probability spaces, 3. Conditional probabilities, 4. Random variables, 5. Probability distributions, 6. Transformations of probability distributions, 7. Conditional distributions, 8. Parametric families of (direct) probability distributions, 9. The Central limit Theorem, 10.Invariant parametric families. 4/15/2010 13

1. Motivation : • (self-) consistency, “ Jede axiomatische (abstrakte) Theorie läßt bekanntlicht • unbegrentzt viele konkrete Interpretationen zu.” (A. N. Kolmogorov (1933). Grundbegriffe der Wahrscheinlichkeitsrechnung, Chap. I, p. 1.) g, p , p ) A. Stuart, J. K. Ord (1994), Kendall’s Advanced Theory of Statistics, Vol. 1 – Distribution Theory . M M Rao (1993) Conditional Measures and Applications M.M.Rao (1993), Conditional Measures and Applications. 4/15/2010 14

2. Probability spaces: Ω = abstract, non - empty) universal set ( Ω Ω ⊆ Ω K A , B , C { } Σ = K A , B , C , A A C Definition 3 (s -algebra ). S is called s -algebra ( s -field) on W iff: Ω ∈ Σ i) , B Ω ∈ Σ ii) \ A ; A , ∑ ∑ ∞ ∞ ∈ Σ ∈ Σ iii) A ; A . i i = i 1 Definition 4 (Measurable space). An ordered pair (W,S) , consisting of a universal set W and a s - algebra S on W is called a measurable space. 4/15/2010 15

Example 2 (Measurable space). (W,S) , where S={W,«} . Example 3 (Measurable space). (  n ,  n ),where  n is the Borel s -algebra (the smallest s -algebra containing all n-dimensional open rectangles). Definition 5 (Probability measure). A real-valued function P on a s -algebra S is called Probability measure iff: g y ( ) ≥ ∈ Σ i) P A 0 ; A , ( ) Ω = ii) P 1 , ( ) ( ) ∑ ∞ ∞ = = ∅ iii) U I P A P A ; A A . = ≠ i 1 i = i i j i i 1 Definition 6 (Probability space). An ordered triple (W,S, P ) , consisting of a universal set W , of a s -algebra S on W , and of a probability (measure) P on S is called probability space. lity (measure) P on S is called probability space. 4/15/2010 16

3. Conditional probability: Definition 7 (Conditional probability). Consider a probability space p y p ( p y) (W,S, P ) and sets A,B œS, P ( B )>0. Then, the conditional probability P ( A|B ) is defined as ( ( ) ) I I P A B ( ( ) ) ≡ P P A A | | B B . ( ) P B Definition 8 (Independent sets). Consider a probability space t ) C id b bilit D fi iti 8 (I d d t (W,S, P ) and sets A,B œS. The sets are ( P -) independent iff ( ( ) ) = I P A B P ( ( A ) ) P ( ( B ) ) . Definition 9 (Mutually exclusive and exhaustive sets). Consider a space (W S P ) and a set { B space (W,S, P ) and a set { B 1 ,..., B n }, B i œS and P ( B i )>0 . When B } B œS and P ( B )>0 When = ∅ = Ω and n I U B B B , ≠ = i j i i 1 i the sets B i are called mutually exclusive and (W - ) exhaustive . ( ) y i 4/15/2010 17

Proposition 1 (Law of Total Probability, LTP). Let (W,S, P ) be a probability space and { B j } mutually exclusive and W -exhaustive sets Then P ( A ) A œS decomposes as sets. Then, P ( A ), A œS, decomposes as ( ) ( ) . ( ) ∑ = n = P A P B P A | B j j j 1 ( ( ( ( ) ) ) ) ( ( ( ( ) ) ) ) ( ) ( ) ( ( ) ) = Ω = = n n I I U U I P A P A P A B P A B Proof. = = j 1 j j 1 j ( ) ( ) ( ) . ∑ ∑ n n = = I P A B P B P A | B ฀ ฀ = j = j j j j 1 j j 1 Theorem 1 (Bayes). Let (W,S, P ) be a probability space, { B j } mutually exclusive and W exhaustive sets and P ( A )>0 A œS Then exclusive and W -exhaustive sets, and P ( A )>0, A œS. Then, ( ) ( ) P B P A | B ( ) = i i P B | A . ( ) ( ( ) ( ) ) ∑ ∑ = i n P P B B P P A A | | B B j j j j j 1 ( ) ( ) ( ) ( ) ( ) = = Proof. follows from Def. 7, P A I B P B P A | B P A P B | A i i i i whereas P ( A ) decomposes according to LTP. p g ฀ ( ) 4/15/2010 18

Statistical methods Toma Podobnik Oddelek za fiziko, FMF, UNI-LJ , - PowerPoint PPT Presentation

Statistical methods Toma Podobnik Oddelek za fiziko, FMF, UNI-LJ , , Odsek za eksperimentalno fiziko delcev, IJS I Prologue: I. Prologue: 1. Literature, 2. Basic Desiderata, 3. Example. 3. Example. 4/15/2010 2 1. Literature: no

Statistics 435/535 Statistical Methods for Quality and Productivity Improvement / Statistical

Statistical Statistical Statistical Model Statistical Model Model Checking Model Checking

STAT 401A - Statistical Methods for Research Workers Statistical Inference Jarad Niemi (Dr. J)

STK-IN4300 Methods using Derived Input Directions Statistical Learning Methods in Data Science

STK-IN4300 Statistical Learning Methods in Data Science Statistical Boosting Boosting as a

Statistical graphics with Statistical graphics with ggplot2 ggplot2 Programming for Statistical

Statistical Methods Statistical Methods Descriptive Inferential Statistics Statistics

Introduction to Statistical Process Control Statistical Process Control (SPC) uses seven major

Meshless Meshless Methods Meshless Meshless Methods Methods Methods Contents

Introduction to resampling methods Tushar Shanker Data Scientist DataCamp Statistical

Statistical presentation Statistical presentation Statistical tabulations by age, sex and 3 digit

EFTA Statistical Cooperation & the European Statistical System EEA Seminar EEA Seminar

EFTA Statistical Cooperation & the European Statistical System EEA Seminar EEA Seminar

13 Jan, 2011 Statistical Literacy: Confounding UTSA Confounding 2011 1 2011 2 Statistical

Nov 2010 Statistical Literacy: Harper's Magazine Fall 2010 1 Fall 2010 2 Statistical

Statistical Machine Translation George Foster George Foster Statistical Machine Translation A

COMMUNITY ENGAGEMENT: Making the World Ever Better WEDNESDAY, SEPTEMBER 16 | 121 P.M THERESA

UNIVERSITIES AS INNOVATION DRIVERS IN THEIR REGION - President Satish Tripathi, PhD

Privacy in Economics Hyoungtae / Jay / Naomi University of Maryland December 2, 2010 1

Indexing Local Configurations of Features for Scalable Content-Based Video Copy Detection

Multiparameter models Dr. Jarad Niemi STAT 544 - Iowa State University February 12, 2019 Jarad

PIVOTAL QUANTILE ESTIMATES IN VAR CALCULATIONS Peter Schaller, Bank Austria Creditanstalt

Lectures 2 and 3: Goodness of Fit Applied Statistics 2014 1 / 36 GoF testing EDF tests

A non-standard model distrMod an S4 -class based package for statistical models one-dim.

Statistical methods Toma Podobnik Oddelek za fiziko, FMF, UNI-LJ , - PowerPoint PPT Presentation

Statistical methods Toma Podobnik Oddelek za fiziko, FMF, UNI-LJ , , Odsek za eksperimentalno fiziko delcev, IJS I Prologue: I. Prologue: 1. Literature, 2. Basic Desiderata, 3. Example. 3. Example. 4/15/2010 2 1. Literature: no

Statistics 435/535 Statistical Methods for Quality and Productivity Improvement / Statistical

Statistical Statistical Statistical Model Statistical Model Model Checking Model Checking

STAT 401A - Statistical Methods for Research Workers Statistical Inference Jarad Niemi (Dr. J)

STK-IN4300 Methods using Derived Input Directions Statistical Learning Methods in Data Science

STK-IN4300 Statistical Learning Methods in Data Science Statistical Boosting Boosting as a

Statistical graphics with Statistical graphics with ggplot2 ggplot2 Programming for Statistical

Statistical Methods Statistical Methods Descriptive Inferential Statistics Statistics

Introduction to Statistical Process Control Statistical Process Control (SPC) uses seven major

Meshless Meshless Methods Meshless Meshless Methods Methods Methods Contents

Introduction to resampling methods Tushar Shanker Data Scientist DataCamp Statistical

Statistical presentation Statistical presentation Statistical tabulations by age, sex and 3 digit

EFTA Statistical Cooperation &amp; the European Statistical System EEA Seminar EEA Seminar

EFTA Statistical Cooperation &amp; the European Statistical System EEA Seminar EEA Seminar

13 Jan, 2011 Statistical Literacy: Confounding UTSA Confounding 2011 1 2011 2 Statistical

Nov 2010 Statistical Literacy: Harper's Magazine Fall 2010 1 Fall 2010 2 Statistical

Statistical Machine Translation George Foster George Foster Statistical Machine Translation A

COMMUNITY ENGAGEMENT: Making the World Ever Better WEDNESDAY, SEPTEMBER 16 | 121 P.M THERESA

UNIVERSITIES AS INNOVATION DRIVERS IN THEIR REGION - President Satish Tripathi, PhD

Privacy in Economics Hyoungtae / Jay / Naomi University of Maryland December 2, 2010 1

Indexing Local Configurations of Features for Scalable Content-Based Video Copy Detection

Multiparameter models Dr. Jarad Niemi STAT 544 - Iowa State University February 12, 2019 Jarad

PIVOTAL QUANTILE ESTIMATES IN VAR CALCULATIONS Peter Schaller, Bank Austria Creditanstalt

Lectures 2 and 3: Goodness of Fit Applied Statistics 2014 1 / 36 GoF testing EDF tests

A non-standard model distrMod an S4 -class based package for statistical models one-dim.

EFTA Statistical Cooperation & the European Statistical System EEA Seminar EEA Seminar

EFTA Statistical Cooperation & the European Statistical System EEA Seminar EEA Seminar