HYPOTHESIS TESTING PART I RECAP & OUTLOOK BAYESIAN PARAMETER - PowerPoint PPT Presentation

INTRODUCTION TO DATA ANALYSIS HYPOTHESIS TESTING PART I

RECAP & OUTLOOK BAYESIAN PARAMETER ESTIMATION FREQUENTIST HYPOTHESIS TESTING ▸ model captures prior beliefs ▸ model captures a hypothetically M M about data-generating process assumed data-generating process ▸ prior over latent parameters ▸ fix parameter value of interest ▸ likelihood of data ▸ likelihood of data ▸ Bayesian posterior inference using ▸ single out some aspect of the data as most important (test statistic) observed data D obs ▸ look at distribution of test statistic ▸ compare posterior beliefs to some given the assumed model parameter value of interest (sampling distribution) ▸ check likelihood of test statistic applied to the observed data D obs

CAVEAT ! FREQUENTIST HYPOTHESIS TESTING ▸ there are at least three flavors of frequentist hypothesis testing ▸ Fisher ▸ Neyman-Pearson ▸ modern hybrid NHST [null-hypothesis significance testing] ▸ not every text book is clear on these differences and/or which flavor it endorses ▸ there is also no unanimity of practice between or within research fields

LEARNING GOALS ▸ understand basic idea of frequentist hypothesis testing ▸ understand what a p-value is ▸ definition, one- vs two-sided ▸ test statistic & sampling distribution ▸ relation to confidence intervals ▸ significance levels & -error α

p -value

PRELIMINARIES ▸ research hypothesis: theoretically implied answer to a main question of interest for research ▸ e.g., truth-judgements of sentences with presupposition failure at chance level? (King of France) ▸ e.g., faster reactions in reaction time trials than in go/No-go trials? (Mental Chronometry) ▸ null hypothesis: specific assumption made for purposes of analysis ▸ fix parameter value in a data-generating model for technical reasons ▸ analogy: useful assumption in mathematical proof (e.g., in reductio ad absurdum) ▸ alternative hypothesis: the antagonist of the null hypothesis, specified to relate the null hypothesis to the research hypothesis

P-VALUE

Binomial Model

BAYESIAN BINOMIAL MODEL (AS ORIGINALLY INTRODUCED) θ ∼ Beta(…) N θ k ∼ Binomial( θ , N ) k

BAYESIAN BINOMIAL MODEL (EXTENDED) θ θ ∼ Beta(…) x i ∼ Bernoulli( θ 0 ) x i N N ∑ k = x i i =1 k

FREQUENTIST BINOMIAL MODEL [doted line = “working assumption”] θ 0 x i ∼ Bernoulli( θ 0 ) [likelihood of “raw” data] N ∑ k = x i [test statistic (derived from “raw” data)] i =1 x i N FACT: The sampling distribution of is: k k ∼ Binomial( θ 0 , N ) k

⃗ FREQUENTIST BINOMIAL MODEL ▸ null-hypothesis: θ = θ 0 θ 0 ▸ test statistic: derived from “raw” data k x ▸ the most important (numerical) aspect of the data for the current testing purposes x i N ▸ sampling distribution: likelihood of observing a particular value of in this model k ▸ notice: the observed data has not yet made D obs any appearance k remark: sometimes summary statistics of other than the ▸ D obs test statistic might be used in the model

FREQUENTIST BINOMIAL MODEL ▸ likelihood of data: random variable 𝒠 | H 0 θ 0 N ∏ P ( 𝒠 | H 0 = ⟨ x 1 , …, x N ⟩ ) = Bernoulli( x i , θ 0 ) i =1 ▸ sampling distribution: random variable T | H 0 x i N P ( T | H 0 = k ) = Binomial( k , θ 0 , N ) k

Binomial p-values

BINOMIAL TEST ▸ 24/7 example: and N = 24 k = 7 ▸ t ( D obs ) = 7 P ( T | H 0 = k ) = Binomial( k , θ 0 , N ) ▸ ▸ p-value definition: p ( D obs ) = P ( T | H 0 ⪰ H 0, a t ( D obs )) we know this ??? we know this What counts as “more extreme evidence against the null hypothesis” is a context-sensitive notion that depends on the null-hypothesis and the alternative hypothesis because only when put together do null- and alternative hypothesis address the research question in the background.

BINOMIAL TEST ▸ compare two research questions ▸ we still use a point-valued null- hypothesis for technical reasons 1. Is the coin fair? ▸ the alternative hypothesis is ▸ H 0 : θ = 0.5 important to fix the meaning of ⪰ H 0, a ▸ H a : θ ≠ 0.5 2. Is the coin biased towards heads? ▸ H 0 : θ = 0.5 ▸ H a : θ < 0.5

BINOMIAL TEST ▸ Case 1: Is the coin fair? ▸ H 0 : θ = 0.5 ▸ H a : θ ≠ 0.5 ▸ which values of are k more extreme evidence against ? H 0

BINOMIAL TEST ▸ Case 1: Is the coin fair? ▸ H 0 : θ = 0.5 ▸ H a : θ ≠ 0.5 ▸ which values of are k more extreme evidence against ? H 0 ▸ anything that’s even less likely to occur

BINOMIAL TEST

BINOMIAL TEST ▸ Case 2: Is the coin biased towards heads? ▸ H 0 : θ = 0.5 ▸ H a : θ < 0.5 ▸ which values of are k more extreme evidence against ? H 0

BINOMIAL TEST ▸ Case 2: Is the coin biased towards heads? ▸ H 0 : θ = 0.5 ▸ H a : θ < 0.5 ▸ which values of are k more extreme evidence against ? H 0 ▸ anything even more in favor of H a

BINOMIAL TEST

p -value revisit

P-VALUE

significance α and -errors

SIGNIFICANCE LEVELS ▸ standardly we fix a significance level before the test α ▸ common values of are: α ▸ α = 0.05 ▸ α = 0.01 ▸ α = 0.001 ▸ if the p -value for the observed data passes the pre-established threshold of significance, we say that the test result was significant ▸ a significant test result is conventionally regarded as “strong enough” evidence against the null-hypothesis, so that we can reject the null hypothesis as a viable explanation of the data ▸ non-significant results are interpreted differently in different approaches (more later)

α -ERROR ▸ an -error (aka type-I error) occurs when we reject a true null hypothesis α ▸ by definition this type of error occurs, in the long run, with a proportion of no more than α ▸ it is in this way that frequentist statistic is subscribed and cherishes a regime of long-term error control on research results ▸ Bayesian approaches (usually) are not concerned with long-term error control

HYPOTHESIS TESTING PART I RECAP & OUTLOOK BAYESIAN PARAMETER - PowerPoint PPT Presentation

INTRODUCTION TO DATA ANALYSIS HYPOTHESIS TESTING PART I RECAP & OUTLOOK BAYESIAN PARAMETER ESTIMATION FREQUENTIST HYPOTHESIS TESTING model captures prior beliefs model captures a hypothetically M M about data-generating process

STAT 113 Hypothesis Testing I Colin Reimer Dawson Oberlin College October 5, 2017 1 / 17

Chapter 6 Hypothesis Testing What is Hypothesis Testing? the use of statistical

Chapter 6 Hypothesis Testing What is Hypothesis Testing? the use of statistical

STAT 215 Hypothesis Testing I Colin Reimer Dawson Oberlin College September 7, 2017 1 / 14

CME/STATS 195 CME/STATS 195 Lecture 7: Hypothesis Testing and Lecture 7: Hypothesis Testing and

Gov 2000: 6. Hypothesis Testing Matthew Blackwell October 11, 2016 1 / 55 1. Hypothesis

Cluster Validity Hypothesis Random Graph Hypothesis Random Label Hypothesis Relative Criteria

Testing Specification testing Michel Bierlaire Introduction to choice models Differences from

Hypothesis Testing Mark Lunt Centre for Epidemiology Versus Arthritis University of Manchester

Hypothesis tests with binomial example STAT 587 (Engineering) Iowa State University October 2,

t -tests STAT 587 (Engineering) Iowa State University October 2, 2020 Statistical hypothesis

Testing 6.1 Specification testing Michel Bierlaire A short reminder on hypothesis testing

Hypothesis testing get data that differ from the null hypothesis. If the data would be quite

Lecture 4: Hypothesis Testing Ani Manichaikul amanicha@jhsph.edu 20 April 2007 1 / 69 Steps of

Hypothesis Testing Part I James J. Heckman University of Chicago Econ 312, Spring 2019 Heckman

Bayesian hypothesis testing Dr. Jarad Niemi STAT 544 - Iowa State University March 7, 2019

Introduction to Android Development What is Android? Android is the customizable, easy to

Refresher on x ts and the plot () f u nction Arna u d Amsellem The R Trader VISU AL IZIN G TIME

Strengthening Ontarios Innovation System: The Role of Ontarios Innovation Agenda ISRN

Using Geometric Singular Perturbation Theory to Understand Singular Shocks Barbara Lee Keyfitz

Chromosome tracing with OligoFISSEQ Marc A. Marti-Renom CNAG-CRG ICREA Huy Nguyen

Class exercise Single-nucleotide polymorphism A single-nucleotide polymorphism (SNP,

BootcampR AN INTRODUCTION TO R Jason A. Heppler, PhD University of Nebraska at Omaha March 17,

HOW TO BECOME AN EFFECTIVE GROUP FACILITATOR How do I prepare? Know your Know your Know your