Introduction Marc H. Mehlman marcmehlman@yahoo.com University of - PowerPoint PPT Presentation

Introduction Marc H. Mehlman marcmehlman@yahoo.com University of New Haven “To understand God’s thoughts, we must study statistics, for these are the measure of his purpose.” – Florence Nightingale “Statistics: the mathematical theory of ignorance.” – Morris Kline “Statistics means never having to say you’re certain.” – Anonymous Marc Mehlman Marc Mehlman (University of New Haven) Introduction 1 / 23

Table of Contents Introduction 1 Studies 2 Blocking 3 Data Collection 4 Random Samples vs Simple Random Samples 5 Correlation 6 Error 7 Marc Mehlman Marc Mehlman (University of New Haven) Introduction 2 / 23

Introduction Introduction Introduction “Data. Data. Data. I can’t make bricks without clay.” – Sherlock Holmes “In God we trust. All others must bring data.” - W. Edwards Deming Marc Mehlman Marc Mehlman (University of New Haven) Introduction 3 / 23

Introduction Population and Sample The distinction between population and sample is basic to statistics. To make sense of any sample result, you must know what population the sample represents. The population in a statistical study is the entire group of The population in a statistical study is the entire group of individuals about which we want information. individuals about which we want information. A sample is the part of the population from which we actually A sample is the part of the population from which we actually collect information. We use information from a sample to draw collect information. We use information from a sample to draw conclusions about the entire population. conclusions about the entire population. Population Population Collect data from a representative Sample ... Sample Sample Make an Inference about the Population . 19 Marc Mehlman Marc Mehlman (University of New Haven) Introduction 4 / 23

Introduction Definition If the sample is the entire population, it is called the census . A variable is a measurable characteristic of individuals within the population. The distribution of a variable is the frequency it obtains it outputs. Data is a variable’s values from the sample. Statistics is the science of drawing inference from data about the population. Statistic’s Origins: Anecdotes and noticing patterns in random happenings (samples). Assumption: sample collected from a random subset of population, ie. sample is a random sample. Definition The population can be parameterized. For instance if one is interested in weights of all Americans of age x , then x is a parameter . Marc Mehlman Marc Mehlman (University of New Haven) Introduction 5 / 23

Introduction Example From the 50,000 residents of the town a Milford, 300 where selected randomly and asked if they have ever had cancer. The population is the 50,000 residents, the sample is the 300 randomly selected residents and the variable is the variable cancer/no cancer. It was too costly to contact all 50,000 residents so the actual distribution of cancer among the entire population is inferred from the distribution of the cancer of 300 randomly sampled residents. Definition (Types of Variables) qualitative (categorical): descriptive Examples: color of eyes, gender, city born in. quantitative: numeric Examples: height, miles per gallon, tem- perature, etc. Marc Mehlman Marc Mehlman (University of New Haven) Introduction 6 / 23

Introduction Definition (Types of Quantitative Variables) discrete: discrete range Examples: # of children someone has, number of coins in pocket continuous: continuous range Examples: weight, speed Definition (Categories of Data) nominal level of measurement labels – no ordering. Examples: colors or yes/no ordinal level of measurement ordered, but distances between data values are meaningless. Examples: grades, ranks interval level of measurement ordered, distance between data values have meaning, but no natural zero. Example: height of class members above sea level ratio level of measurement interval level + a natural zero (ratios meaningful). Example: height of class members above classroom floor. Marc Mehlman Marc Mehlman (University of New Haven) Introduction 7 / 23

Studies Studies Studies Marc Mehlman Marc Mehlman (University of New Haven) Introduction 8 / 23

Studies Observation vs. Experiment When our goal is to understand cause and effect, experiments are the only source of fully convincing data. The distinction between observational study and experiment is one of the most important in statistics. An observational study observes individuals and measures An observational study observes individuals and measures variables of interest but does not attempt to influence the variables of interest but does not attempt to influence the responses. The purpose is to describe some group or situation. responses. The purpose is to describe some group or situation. An experiment deliberately imposes some treatment on An experiment deliberately imposes some treatment on individuals to measure their responses. The purpose is to study individuals to measure their responses. The purpose is to study whether the treatment causes a change in the response. whether the treatment causes a change in the response. In contrast to observational studies, experiments don’t just observe individuals or ask them questions. They actively impose some treatment in order to measure the response. 5 Marc Mehlman Marc Mehlman (University of New Haven) Introduction 9 / 23

Studies Definition cross–sectional study data collected at one point in time retrospective study historical prospective or longitudinal study on going – collecting future data too. Example: Framingham Heart Study cohort study usually a longitudinal study – data is compared between cohorts (individuals who share similary characteristics or experience). Example: Minnesota Twin Family Study (MTFS) For an experiment: Definition single blinding subjects not aware if they are control group or treatment group. double blinding subjects and experimenters have no idea who is in control or treatment group. Marc Mehlman Marc Mehlman (University of New Haven) Introduction 10 / 23

Blocking Blocking Blocking Marc Mehlman Marc Mehlman (University of New Haven) Introduction 11 / 23

Blocking Blocking group like subjects together to reduce variance Example men with men and women with women. for instance, women maybe more prone to a disease than men. Or a treatment may help men and harm women. By blocking men and women apart, gender differences can be isolated. randomized block design vs completely randomized design – assign treatments randomly in each block versus assigning treatments to subjects at–large. “Block what you can; randomize what you cannot.” Moral: Group like subjects together to reduce variance. Marc Mehlman Marc Mehlman (University of New Haven) Introduction 12 / 23

Blocking Sample Size big = more reliable results. replication is the measure of reliable, ie., if the experiment is replicated one would get similar results. small cheaper, faster Marc Mehlman Marc Mehlman (University of New Haven) Introduction 13 / 23

Data Collection Data Collection Data Collection Marc Mehlman Marc Mehlman (University of New Haven) Introduction 14 / 23

Data Collection Data Collection Types of Sampling self–selected sample one could send out a questions in a mailing and only get answers from those who choose to reply. systematic sampling Example: every 50 th subject. convenience sampling sample those that are easiest to sample. stratified sampling sample from each strata (subgroup) cluster sampling sample everyone in randomly selected clusters stratified sampling gives better results – see blocking Multistage Sampling Example 1 take a random sample of size 8 of the states. 2 take a simple random sample of size 7 of the counties in each state. 3 take a random sample of 6 cities. 4 take a random sample of 5 voters. Marc Mehlman Marc Mehlman (University of New Haven) Introduction 15 / 23

Random Samples vs Simple Random Samples Random Samples vs Simple Random Samples Random Samples vs Simple Random Samples Marc Mehlman Marc Mehlman (University of New Haven) Introduction 16 / 23

Random Samples vs Simple Random Samples Random Samples vs Simple Random Samples Definition random sample , x 1 , · · · , x n 1 each subject is as likely to be x i as any other subject 2 the x 1 , · · · , x n are indep – just because x i = Bob does not mean that x j can not be Bob. simple random sample: out of N subjects chose n randomly so that the probability of any n subjects being chosen is the same as any other n � N N ! � subjects ( = n !( N − n )! ). n Marc Mehlman Marc Mehlman (University of New Haven) Introduction 17 / 23

Random Samples vs Simple Random Samples Random Samples vs Simple Random Samples Note: 1 subjects can be chosen more than once in a random sample - not so in a simple random sample. Simple random sample does not assume indep for the sample. 2 book’s definition is not quite right. Convention: When a simple random sample can be thought of as being a random sample? Answer: when sample size is 5% or less than the size of the entire population. Marc Mehlman Marc Mehlman (University of New Haven) Introduction 18 / 23

Correlation Correlation Correlation Marc Mehlman Marc Mehlman (University of New Haven) Introduction 19 / 23

Introduction Marc H. Mehlman marcmehlman@yahoo.com University of - PowerPoint PPT Presentation

Introduction Marc H. Mehlman marcmehlman@yahoo.com University of New Haven To understand Gods thoughts, we must study statistics, for these are the measure of his purpose. Florence Nightingale Statistics: the mathematical

INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION

Introduction ATV Introduction A T V Introduction A lphabet T V Introduction A lphabet

Brief Brief Introduction Introduction Brief Brief Introduction Introduction Zhengzhou

Brief Brief Introduction Introduction Brief Brief Introduction Introduction Zhengzhou

Shenzhen Cuilu jewelry Co., Ltd was founded in 1996 and its a large private enterprise

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Spectrum Painting Richard Shipman MW0RCZ ADARS 6th Jan 2020 Introduction Introduction

Introduction Introduction Introduction Introduction Outline Motivation Failures

Introduction Introduction Introduction Nationwide Cause for Concern 1

Team Introduction Experiments Outreach Problem Project Brainstorm Introduction Introduction

Lecture 1 Andreas Habegger Introduction Zynq Introduction Zynq Introduction Zynq PS vs. PL

Introduction to Web Design & Computer Principles Class 1 CSCI-UA 4 Introduction and Overview

Introduction to CICS Course introduction Course introduction What is CICS? What is an

INF5110 Compiler Construction Introduction Spring 2016 1 / 33 Outline 1. Introduction

INTRODUCTION I Syllabus INTRODUCTION I Syllabus I Why study labor economics? INTRODUCTION I

2018.06 01 SMILE5 Introduction S E 5 02 Alpha Cloud M I L 03 Company Introduction 04

Cardiovascular Health Banff 2012 CP1271671-5 Global Burden of Cardiovascular Disease 2002

BoXHED : B oosted e X act H azard E stimator with D ynamic covariates Xiaochen Wang Yale

Dag 2: Logistic regression Susanne Rosthj Biostatistisk Afdeling Institut for

FEASIBILITY STUDY School Committee Meeting April 25, 2018 PROJECT MANAGEMENT SMMA Agenda 1.

INTRODUCTION TO GENETIC EPIDEMIOLOGY (GBIO0015) Prof. Dr. Dr. K. Van Steen Introduction to

Intelligent Design Theory: The God-of-the-Gaps Rooted in Concordism Denis O. Lamoureux

Single molecule mechanical studies of acto-myosin Justin E. Molloy Francis Crick

Mathematical Modeling and Biology Bo Deng Introduction Examples of Models Bo Deng Consistency

Sambuz

Useful Links

Newsletter

Mail Us

Introduction Marc H. Mehlman marcmehlman@yahoo.com University of - PowerPoint PPT Presentation

Introduction Marc H. Mehlman marcmehlman@yahoo.com University of New Haven To understand Gods thoughts, we must study statistics, for these are the measure of his purpose. Florence Nightingale Statistics: the mathematical

INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION

Introduction ATV Introduction A T V Introduction A lphabet T V Introduction A lphabet

Brief Brief Introduction Introduction Brief Brief Introduction Introduction Zhengzhou

Brief Brief Introduction Introduction Brief Brief Introduction Introduction Zhengzhou

Shenzhen Cuilu jewelry Co., Ltd was founded in 1996 and its a large private enterprise

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Spectrum Painting Richard Shipman MW0RCZ ADARS 6th Jan 2020 Introduction Introduction

Introduction Introduction Introduction Introduction Outline Motivation Failures

Introduction Introduction Introduction Nationwide Cause for Concern 1

Team Introduction Experiments Outreach Problem Project Brainstorm Introduction Introduction

Lecture 1 Andreas Habegger Introduction Zynq Introduction Zynq Introduction Zynq PS vs. PL

Introduction to Web Design &amp; Computer Principles Class 1 CSCI-UA 4 Introduction and Overview

Introduction to CICS Course introduction Course introduction What is CICS? What is an

INF5110 Compiler Construction Introduction Spring 2016 1 / 33 Outline 1. Introduction

INTRODUCTION I Syllabus INTRODUCTION I Syllabus I Why study labor economics? INTRODUCTION I

2018.06 01 SMILE5 Introduction S E 5 02 Alpha Cloud M I L 03 Company Introduction 04

Cardiovascular Health Banff 2012 CP1271671-5 Global Burden of Cardiovascular Disease 2002

BoXHED : B oosted e X act H azard E stimator with D ynamic covariates Xiaochen Wang Yale

Dag 2: Logistic regression Susanne Rosthj Biostatistisk Afdeling Institut for

FEASIBILITY STUDY School Committee Meeting April 25, 2018 PROJECT MANAGEMENT SMMA Agenda 1.

INTRODUCTION TO GENETIC EPIDEMIOLOGY (GBIO0015) Prof. Dr. Dr. K. Van Steen Introduction to

Intelligent Design Theory: The God-of-the-Gaps Rooted in Concordism Denis O. Lamoureux

Single molecule mechanical studies of acto-myosin Justin E. Molloy Francis Crick

Mathematical Modeling and Biology Bo Deng Introduction Examples of Models Bo Deng Consistency

Sambuz

Useful Links

Newsletter

Mail Us

Introduction to Web Design & Computer Principles Class 1 CSCI-UA 4 Introduction and Overview