Introd u ction to s w imming data C ASE STU D IE S IN STATISTIC AL - - PowerPoint PPT Presentation

introd u ction to s w imming data
SMART_READER_LITE
LIVE PREVIEW

Introd u ction to s w imming data C ASE STU D IE S IN STATISTIC AL - - PowerPoint PPT Presentation

Introd u ction to s w imming data C ASE STU D IE S IN STATISTIC AL TH IN K IN G J u stin Bois Lect u rer , Caltech The 2015 FINA World Championships 1 Photo b y Chan - Fan , CC - BY - SA -4.0 CASE STUDIES IN STATISTICAL THINKING Strokes at


slide-1
SLIDE 1

Introduction to swimming data

C ASE STU D IE S IN STATISTIC AL TH IN K IN G

Justin Bois

Lecturer, Caltech

slide-2
SLIDE 2

CASE STUDIES IN STATISTICAL THINKING

The 2015 FINA World Championships

Photo by Chan-Fan, CC-BY-SA-4.0

1

slide-3
SLIDE 3

CASE STUDIES IN STATISTICAL THINKING

Strokes at the World Championships

Freestyle Breaststroke Buery Backstroke

slide-4
SLIDE 4

CASE STUDIES IN STATISTICAL THINKING

Events at the World Championships

Dened by gender, distance, stroke Example: men's 200 m freestyle

slide-5
SLIDE 5

CASE STUDIES IN STATISTICAL THINKING

Rounds of events

Heats: First round Seminals: Penultimate round in some events Finals: The nal round; the winner is champion

slide-6
SLIDE 6

CASE STUDIES IN STATISTICAL THINKING

Data source

Data are freely available from OMEGA at omegatiming.com

slide-7
SLIDE 7

CASE STUDIES IN STATISTICAL THINKING

Domain-specific knowledge is

Imperative An absolute pleasure

slide-8
SLIDE 8

Let's practice!

C ASE STU D IE S IN STATISTIC AL TH IN K IN G

slide-9
SLIDE 9

Do swimmers go faster in the finals?

C ASE STU D IE S IN STATISTIC AL TH IN K IN G

Justin Bois

Lecturer, Caltech

slide-10
SLIDE 10

CASE STUDIES IN STATISTICAL THINKING

Event Time Venue Date Round 100 m free 47.51 Beijing 2008-08-11 Final 200 m free 1:42.96 Beijing 2008-08-12 Final 400 m free 3:47.79 Indianapolis 2005-04-01 Final 100 m back 53.01 Indianapolis 2007-08-03 Final 200 m back 1:54.65 Indianapolis 2007-08-01 Final 100 m breast 1:02.57 Columbia 2008-02-17 Final 200 m breast 2:11.30 San Antonio 2015-08-10 Final 100 m y 49.82 Rome 2009-08-01 Final 200 m y 1:51.51 Rome 2009-29-07 Final

slide-11
SLIDE 11

CASE STUDIES IN STATISTICAL THINKING

Event Time Venue Date Round 50 m free 23.67 Budapest 2017-07-29 Seminal 100 m free 51.71 Budapest 2017-07-23 Final 200 m free 1.54.08 Rio de Janeiro 2016-08-09 Final 400 m free 4.06.04 Amiens 2014-03-16 Final 50 m back 27.80 Borås 2017-06-30 Final 100 m back 59.98 Eindhoven 2015-04-05 Final 50 m y 24.43 Borås 2014-07-05 Final 100 m y 55.48 Rio de Janeiro 2016-08-07 Final

slide-12
SLIDE 12

CASE STUDIES IN STATISTICAL THINKING

Your question

Do swimmers swim faster in the nals than in other rounds? Individual swimmers, or the whole eld? Faster than heats? Faster than seminals? For what strokes? For what distances?

slide-13
SLIDE 13

CASE STUDIES IN STATISTICAL THINKING

Your question

Do individual female swimmers swim faster in the nals compared to the seminals? Events: 50, 100, 200 meter freestyle, breaststroke, buery, backstroke

slide-14
SLIDE 14

CASE STUDIES IN STATISTICAL THINKING

Diff'rent strokes

slide-15
SLIDE 15

CASE STUDIES IN STATISTICAL THINKING

Fractional improvement

f = semifinals time semifinals time − finals time

slide-16
SLIDE 16

CASE STUDIES IN STATISTICAL THINKING

Your question(s)

Original question: Do swimmers swim faster in the nals than in other rounds? Sharpened questions: What is the fractional improvement of individual female swimmers from the seminals to the nals? Is the observed fractional improvement commensurate with there being no dierence in performance in the seminals and nals?

slide-17
SLIDE 17

Let's practice!

C ASE STU D IE S IN STATISTIC AL TH IN K IN G

slide-18
SLIDE 18

How does the performance of swimmers decline

  • ver long events?

C ASE STU D IE S IN STATISTIC AL TH IN K IN G

Justin Bois

Lecturer, Caltech

slide-19
SLIDE 19

CASE STUDIES IN STATISTICAL THINKING

More swimming background

Photo by Chan-Fan, CC-BY-SA-4.0

1

slide-20
SLIDE 20

CASE STUDIES IN STATISTICAL THINKING

More swimming background

Split: The time is takes to swim one length of the pool

slide-21
SLIDE 21

CASE STUDIES IN STATISTICAL THINKING

More swimming background

slide-22
SLIDE 22

CASE STUDIES IN STATISTICAL THINKING

More swimming background

Image: Miho NL, CC-BY-3.0

1

slide-23
SLIDE 23

CASE STUDIES IN STATISTICAL THINKING

More swimming background

slide-24
SLIDE 24

CASE STUDIES IN STATISTICAL THINKING

Slowing down

slide-25
SLIDE 25

CASE STUDIES IN STATISTICAL THINKING

Quantifying slowdown

Use women's 800 m freestyle heats Omit rst and last 100 meters Compute mean split time for each split number Perform linear regression to get slowdown per split Perform hypothesis test: can the slowdown be explained by random variation?

slide-26
SLIDE 26

CASE STUDIES IN STATISTICAL THINKING

Hypothesis tests for correlation

Posit null hypothesis: split time and split number are completely uncorrelated Simulate data assuming null hypothesis is true

scrambled_split_number = np.random.permutation( split_number )

Use Pearson correlation, denoted rho , as test statistic

rho = dcst.pearson_r(scrambled_split_number, splits)

Compute p-value as the fraction of replicates that have Pearson correlation at least as large as observed

slide-27
SLIDE 27

Let's practice!

C ASE STU D IE S IN STATISTIC AL TH IN K IN G