Descriptive statistics P RACTICIN G S TATIS TICS IN TERVIEW QUES - - PowerPoint PPT Presentation

descriptive statistics
SMART_READER_LITE
LIVE PREVIEW

Descriptive statistics P RACTICIN G S TATIS TICS IN TERVIEW QUES - - PowerPoint PPT Presentation

Descriptive statistics P RACTICIN G S TATIS TICS IN TERVIEW QUES TION S IN R Zuzanna Chmielewska Actuary Descriptive statistics PRACTICING STATISTICS INTERVIEW QUESTIONS IN R Descriptive statistics PRACTICING STATISTICS INTERVIEW


slide-1
SLIDE 1

Descriptive statistics

P RACTICIN G S TATIS TICS IN TERVIEW QUES TION S IN R

Zuzanna Chmielewska

Actuary

slide-2
SLIDE 2

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Descriptive statistics

slide-3
SLIDE 3

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Descriptive statistics

slide-4
SLIDE 4

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Descriptive statistics

central tendency measures variability measures

slide-5
SLIDE 5

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Descriptive statistics

central tendency measures variability measures

slide-6
SLIDE 6

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Central tendency measures

slide-7
SLIDE 7

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Central tendency measures

slide-8
SLIDE 8

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Central tendency measures

slide-9
SLIDE 9

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Central tendency measures

slide-10
SLIDE 10

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Central tendency measures

slide-11
SLIDE 11

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Central tendency measures

slide-12
SLIDE 12

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Central tendency measures

slide-13
SLIDE 13

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Central tendency measures

slide-14
SLIDE 14

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Central tendency measures

slide-15
SLIDE 15

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Central tendency measures

slide-16
SLIDE 16

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

slide-17
SLIDE 17

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

slide-18
SLIDE 18

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

slide-19
SLIDE 19

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

slide-20
SLIDE 20

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

slide-21
SLIDE 21

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

slide-22
SLIDE 22

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Descriptive statistics

central tendency variability

slide-23
SLIDE 23

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Variability

slide-24
SLIDE 24

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Variability

slide-25
SLIDE 25

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Variability

slide-26
SLIDE 26

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Variability measures

variance standard deviation range

slide-27
SLIDE 27

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Variability measures

variance

σ =

2

n (x − μ) ∑

i 2

slide-28
SLIDE 28

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Variance - numerical example

x = 2,x = 5,x = 11 μ = = = 6 (x − μ) = (2 − 6) = (−4) = 16 (x − μ) = (5 − 6) = (−1) = 1 (x − μ) = (11 − 6) = (5) = 25 (x − μ) = 16 + 1 + 25 = 42 = = 14

1 2 3 n x ∑i=1

n i

3 2+5+11 1 2 2 2 2 2 2 2 3 2 2 2

∑i=1

n i 2 n (x −μ) ∑i=1

n i 2

3 42

slide-29
SLIDE 29

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Variability measures

variance

σ =

standard deviation

σ =

2

n (x − μ) ∑

i 2

√ σ2

slide-30
SLIDE 30

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Variability measures

variance

σ =

standard deviation

σ =

range

range = max − min

2

n (x − μ) ∑

i 2

√ σ2

slide-31
SLIDE 31

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Summary

central tendency measures mean median mode skewness variability measures variance standard deviation range

slide-32
SLIDE 32

Let's practice!

P RACTICIN G S TATIS TICS IN TERVIEW QUES TION S IN R

slide-33
SLIDE 33

Categorical data

P RACTICIN G S TATIS TICS IN TERVIEW QUES TION S IN R

Zuzanna Chmielewska

Actuary

slide-34
SLIDE 34

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

slide-35
SLIDE 35

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

slide-36
SLIDE 36

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

slide-37
SLIDE 37

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

slide-38
SLIDE 38

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

slide-39
SLIDE 39

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Factors in R

x1 <- c("AB", "A", "O", "AB", "B", "B") lvls <- c("A", "B", "AB", "O") x2 <- factor(x1, levels = lvls) print(x2) [1] AB A O AB B B Levels: A B AB O

slide-40
SLIDE 40

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Factors in R

x1 <- c("M", "L" , "L", "XS", "XL", "S") lvls <- c("XS", "S", "M", "L", "XL") x2 <- factor(x1, levels = lvls, ordered = TRUE) print(x2) [1] M L L XS XL S Levels: XS < S < M < L < XL

slide-41
SLIDE 41

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

slide-42
SLIDE 42

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

slide-43
SLIDE 43

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

slide-44
SLIDE 44

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

slide-45
SLIDE 45

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

slide-46
SLIDE 46

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

slide-47
SLIDE 47

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

slide-48
SLIDE 48

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

tapply(df$value, df$level, mean)

slide-49
SLIDE 49

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Categorical data encoding

label encoding

  • ne hot encoding

many more!

slide-50
SLIDE 50

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Label encoding

slide-51
SLIDE 51

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Label encoding

slide-52
SLIDE 52

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Label encoding

slide-53
SLIDE 53

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Label encoding

slide-54
SLIDE 54

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Label encoding

slide-55
SLIDE 55

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

One hot encoding

slide-56
SLIDE 56

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

One hot encoding

slide-57
SLIDE 57

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

One hot encoding

slide-58
SLIDE 58

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

One hot encoding

slide-59
SLIDE 59

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

One hot encoding

slide-60
SLIDE 60

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Summary

types of categorical data factors in R categorical data analysis

table() barplot() tapply()

data encoding label encoding

  • ne hot encoding
slide-61
SLIDE 61

Let's practice!

P RACTICIN G S TATIS TICS IN TERVIEW QUES TION S IN R

slide-62
SLIDE 62

Time series

P RACTICIN G S TATIS TICS IN TERVIEW QUES TION S IN R

Zuzanna Chmielewska

Actuary

slide-63
SLIDE 63

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Time series

Application of time series: nance agriculture energy etc.

slide-64
SLIDE 64

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Time series

slide-65
SLIDE 65

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Time series

slide-66
SLIDE 66

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Time series

Time series analysis: trends seasonal variation serial correlation prediction model (e.g. ARIMA)

slide-67
SLIDE 67

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

slide-68
SLIDE 68

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

slide-69
SLIDE 69

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

slide-70
SLIDE 70

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Time series - object

ts <- xts(x = values, order.by = dates)

slide-71
SLIDE 71

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

plot(ts)

slide-72
SLIDE 72

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Analysis - subsetting

dates <- seq(from = as.Date("2010-01-01"), to = as.Date("2010-12-31"))

slide-73
SLIDE 73

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Analysis - subsetting

dates <- seq(from = as.Date("2010-01-01"), to = as.Date("2010-12-31"), by = "1 month")

slide-74
SLIDE 74

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Analysis - subsetting

dates <- seq(from = as.Date("2010-01-01"), to = as.Date("2010-12-31"), by = "1 month") ts[dates]

slide-75
SLIDE 75

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Analysis - merging

slide-76
SLIDE 76

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Analysis - merging

slide-77
SLIDE 77

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Analysis - merging

slide-78
SLIDE 78

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Analysis - merging

slide-79
SLIDE 79

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Analysis - applying a function by calendar period

slide-80
SLIDE 80

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Analysis - applying a function by calendar period

slide-81
SLIDE 81

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Analysis - applying a function by calendar period

slide-82
SLIDE 82

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Summary

denition of time series

xts object in R

wrangling time series: subsetting merging functions over calendar periods

slide-83
SLIDE 83

Let's practice!

P RACTICIN G S TATIS TICS IN TERVIEW QUES TION S IN R

slide-84
SLIDE 84

Principal Component Analysis

P RACTICIN G S TATIS TICS IN TERVIEW QUES TION S IN R

Zuzanna Chmielewska

Actuary

slide-85
SLIDE 85

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Principal Component Analysis

slide-86
SLIDE 86

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Principal Component Analysis

slide-87
SLIDE 87

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Principal Component Analysis

slide-88
SLIDE 88

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Principal Component Analysis

slide-89
SLIDE 89

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Principal Component Analysis

slide-90
SLIDE 90

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Principal Component Analysis

slide-91
SLIDE 91

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Principal Component Analysis

slide-92
SLIDE 92

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Principal Component Analysis

slide-93
SLIDE 93

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

slide-94
SLIDE 94

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

slide-95
SLIDE 95

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

slide-96
SLIDE 96

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

prcomp()

slide-97
SLIDE 97

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

pca <- prcomp(~ v1 + v2 + v3, data = df) predict(pca)

slide-98
SLIDE 98

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

pca <- prcomp(~ v1 + v2 + v3, data = df) predict(pca) summary(pca)

slide-99
SLIDE 99

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

pca <- prcomp(~ v1 + v2 + v3, data = df, rank = 2) predict(pca) summary(pca)

slide-100
SLIDE 100

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

pca <- prcomp(~ v1 + v2 + v3, data = df, tol = 0.25) predict(pca) summary(pca)

Omitted if σ

≤ tol ⋅ σ

P Ci P C1

slide-101
SLIDE 101

PRACTICING STATISTICS INTERVIEW QUESTIONS IN R

Summary

application of PCA rotation of axes in PCA PCA in R: prcomp()

slide-102
SLIDE 102

Let's practice!

P RACTICIN G S TATIS TICS IN TERVIEW QUES TION S IN R