social data science
Statistical Learning
Sebastian Barfort August 15, 2016
University of Copenhagen Department of Economics 1/85
social data science Statistical Learning Sebastian Barfort August - - PowerPoint PPT Presentation
social data science Statistical Learning Sebastian Barfort August 15, 2016 University of Copenhagen Department of Economics 1/85 concepts Cross validation: Split data in test and training data. Train model on training data, test it on test
University of Copenhagen Department of Economics 1/85
2/85
3/85
4/85
n
i=1
n
i=1
5/85
6/85
7/85
8/85
9/85
10/85
11/85
12/85
13/85
14/85
15/85
k
i=1
16/85
17/85
18/85
19/85
20/85
21/85
22/85
∗∗∗p < .01; ∗∗p < .05; ∗p < .1 23/85
24/85
25/85
26/85
27/85
28/85
2 3 4 5 6 7 8 9 0.0 2.5 5.0 7.5 10.0 0.0 2.5 5.0 7.5 10.0 0.0 2.5 5.0 7.5 10.0 0.00 0.25 0.50 0.75 1.00 0.00 0.25 0.50 0.75 1.00 0.00 0.25 0.50 0.75 1.00
29/85
30/85
31/85
32/85
33/85
34/85
35/85
36/85
37/85
38/85
39/85
40/85
n
i=1
p
j=1
j=1 β2 j
j=1 |βj|
41/85
42/85
43/85
44/85
45/85
46/85
47/85
48/85
49/85
50/85
51/85
52/85
53/85
54/85
55/85
56/85
57/85
58/85
59/85
60/85
61/85
62/85
tortilla < 0.5 parmesan >= 0.5 soy >= 0.5 masala >= 0.5 cilantro < 0.5
italian chinese indian italian southern mexican mexican yes no
63/85
64/85
65/85
66/85
67/85
68/85
69/85
70/85
71/85
72/85
73/85
74/85
75/85
76/85
77/85
78/85
79/85
80/85
81/85
82/85
83/85
84/85
85/85