How to Lie with Statistics March 3, 2020 Data Science CSCI 1951A - PowerPoint PPT Presentation

How to Lie with Statistics March 3, 2020 Data Science CSCI 1951A Brown University Instructor: Ellie Pavlick HTAs: Josh Levin, Diane Mutako, Sol Zitter

Announcements

Today • Linear Regression Recap/Follow up • P-Hacking, Researcher Degrees of Freedom

Dummy Variables cholesterol yes breakfast constant meds 20 31 0 1 1 20 5 0 1 1 X = 20 40 0 1 1 why do we 25 18 1 0 1 have to do this? what no breakfast about pseudo- eucalyptus inverse?

statsmodels import statsmodels.api as sm y, X = read_data() X = sm.add_constant(X) model = sm.OLS(y, X) results = model.fit() print(results.summary()) https://www.statsmodels.org/dev/examples/notebooks/generated/ols.html https://www.statsmodels.org/dev/generated/statsmodels.regression.linear_model.OLS.html

statsmodels import statsmodels.api as sm import statsmodels.formula.api as smf # M has column headers w/ names M = read_data() X = sm.add_constant(X) eq = “chol ~ eucalyptus + meds + breakfast” model = smf.ols(formula=eq, data=M) results = model.fit() print(results.summary()) https://www.statsmodels.org/dev/examples/notebooks/generated/ols.html https://www.statsmodels.org/dev/generated/statsmodels.regression.linear_model.OLS.html

statsmodels import statsmodels.api as sm import statsmodels.formula.api as smf # M has column headers w/ names M = read_data() interaction term X = sm.add_constant(X) eq = “chol ~ eucalyptus + meds + breakfast + eucalyptus:meds” model = smf.ols(formula=eq, data=M) results = model.fit() print(results.summary()) https://www.statsmodels.org/dev/examples/notebooks/generated/ols.html https://www.statsmodels.org/dev/generated/statsmodels.regression.linear_model.OLS.html

statsmodels import statsmodels.api as sm import statsmodels.formula.api as smf # M has column headers w/ names M = read_data() squared terms X = sm.add_constant(X) eq = “chol ~ eucalyptus + meds + breakfast + eucalyptus^2” model = smf.ols(formula=eq, data=M) results = model.fit() print(results.summary()) https://www.statsmodels.org/dev/examples/notebooks/generated/ols.html https://www.statsmodels.org/dev/generated/statsmodels.regression.linear_model.OLS.html

statsmodels https://www.statsmodels.org/dev/examples/notebooks/generated/ols.html https://www.statsmodels.org/dev/generated/statsmodels.regression.linear_model.OLS.html

statsmodels overall fit of model (SSE) https://www.statsmodels.org/dev/examples/notebooks/generated/ols.html https://www.statsmodels.org/dev/generated/statsmodels.regression.linear_model.OLS.html

statsmodels coefficients (i.e. effect sizes) https://www.statsmodels.org/dev/examples/notebooks/generated/ols.html https://www.statsmodels.org/dev/generated/statsmodels.regression.linear_model.OLS.html

statsmodels p-values https://www.statsmodels.org/dev/examples/notebooks/generated/ols.html https://www.statsmodels.org/dev/generated/statsmodels.regression.linear_model.OLS.html

Clicker Question!

Today • Linear Regression Recap/Follow up • P-Hacking, Researcher Degrees of Freedom

You can find almost anything if you look hard enough. Per capita cheese consumption correlates with Number of people who died by becoming tangled in their bedsheets 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 800 deaths 33lbs Bedsheet tanglings Cheese consumed 600 deaths 31.5lbs ρ = 0.95 400 deaths 30lbs 28.5lbs 200 deaths 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 Bedsheet tanglings Cheese consumed tylervigen.com https://en.wikipedia.org/wiki/Data_dredging http://www.tylervigen.com/spurious-correlations

How to Lie with Statistics March 3, 2020 Data Science CSCI 1951A - PowerPoint PPT Presentation

How to Lie with Statistics March 3, 2020 Data Science CSCI 1951A Brown University Instructor: Ellie Pavlick HTAs: Josh Levin, Diane Mutako, Sol Zitter Announcements Today Linear Regression Recap/Follow up P-Hacking, Researcher

Lie nilpotent group algebras central series Lie nilpotency index and central series Computation

Lie Theory From Basics to the Heisenberg Lie Group Noah Migoski IU Math DRP April, 2020 Noah

Introduction to Lie Groups, Lie Algebra, and Representation Theory Dennica Mitev University of

Special geometry Simon G. Chiossi Special geometry with solvable Lie groups Lie groups

What Makes a Lie a Lie? Dr. Sara L. Uckelman s.l.uckelman@durham.ac.uk @SaraLUckelman 10 Jan

Statistics on Lie groups: using the pseudo-Riemannian framework? Nina Miolane, Xavier Pennec

Constructing n -Engel Lie rings Serena Cical` o University of Trento Advisor: Willem A. de

On the curvatures of subalgebras of nilpotent Lie algebras Ana Hini c Gali c La Trobe

Lie Foliations Producing Harmonic Morphisms Sigmundur Gudmundsson Department of Mathematics

Lie Theory without groups 2020 Erd s Memorial Lecture Fall Western Sectional Meeting, October

Wreath Lie Algebras Cristina Di Pietro Cristina Di Pietro 1 Lie Algebras, their

Analysis on singular spaces, Lie manifolds, and non-commutative geometry II Lie manifolds Victor

The Capelli eigenvalue problem for Lie superalgebras Hadi Salmasian Department of Mathematics

Official Statistics Matt Dray, Assistant Statistician Official Statistics 2 Official

Linear connections on Lie groups The affine space of linear connections on a compact Lie group G

Lie Superalgebras and Sage Daniel Bump July 26, 2018 With the connivance of Brubaker, Schilling

Baumgartner, POLI 203 Spring 2016 Just Mercy, continued February 1, 2016 George Stinney

LEADING WITH LOVE Presented by: : Dr. r. Maria Church, CEO, Government Leadership Solutions Ro

Confronting Reality Strategic Thinking Team and Organizational Focus Productive

project and region. This section will build on that understanding by providing you with a

outside the Gospels Sayings of Jesus outside the Gospels Sayings of Jesus outside the Gospels

EHEALTH COMMISSION MEETING JULY 10, 2019 JULY AGENDA Call to Order Roll Call and Introductions

Introduction to Information Security CODATA School Hannah Short (CERN), Sebastian Lopienski

MITOCW | watch?v=yrmqYNvvIzs The following content is provided under a Creative Commons license.