Analysis of Big Dependent Data in Economics and Finance
Ruey S. Tsay Booth Shool of Business, University of Chicago September 2016
Ruey S. Tsay Big Dependent Data 1 / 72
Analysis of Big Dependent Data in Economics and Finance Ruey S. - - PowerPoint PPT Presentation
Analysis of Big Dependent Data in Economics and Finance Ruey S. Tsay Booth Shool of Business, University of Chicago September 2016 Ruey S. Tsay Big Dependent Data 1 / 72 Outline Big data? Machine learning? Data science? What is in for 1
Ruey S. Tsay Big Dependent Data 1 / 72
1
2
3
4
5
6
7
Ruey S. Tsay Big Dependent Data 2 / 72
1
2
3
4
5
6
7
Ruey S. Tsay Big Dependent Data 3 / 72
1
2
3
4
1
2
Ruey S. Tsay Big Dependent Data 4 / 72
Ruey S. Tsay Big Dependent Data 5 / 72
Ruey S. Tsay Big Dependent Data 6 / 72
2≤R
Ruey S. Tsay Big Dependent Data 7 / 72
1
2
3
Ruey S. Tsay Big Dependent Data 8 / 72
1
2
3
4
5
Ruey S. Tsay Big Dependent Data 9 / 72
1
2
3
Ruey S. Tsay Big Dependent Data 10 / 72
Time xt 500 1000 1500 2000 −40000 −35000 −30000 −25000
Ruey S. Tsay Big Dependent Data 11 / 72
* * * * * ** * ** * * * * * ** ** * * * * * ** * * * * * * ** * * * ** * * * * * * * * * 0.0 0.2 0.4 0.6 0.8 1.0 −2e+05 0e+00 2e+05 4e+05 |beta|/max|beta| Standardized Coefficients * * * * ** * ** * * * * * ** ** * * * * * ** * * * * * * ** * * * ** * * * * * * * * * * * * * ** * ** * * * * * ** ** * * * * * ** * * * * * * ** * * * ** * * * * * * * * * * * * * ** * ** * * * * * ** ** * * * * * ** * * * * * * ** * * * ** * * * * * * * * * * * * * ** * ** * * * * * ** ** * * * * * ** * * * * * * ** * * * ** * * * * * * * * * * * * * ** * ** * * * * * ** ** * * * * * ** * * * * * * ** * * * ** * * * * * * * * * * * * * ** * ** * * * * * ** ** * * * * * ** * * * * * * ** * * * ** * * * * * * * * * * * * * ** * ** * * * * * ** * * * * * * * ** * * * * * * ** * * * ** * * * * * * * * * * * * * ** * ** * * * * * * * * * * * * * * ** * * * * * * ** * * * ** * * * * * * * * * * * * * ** * ** * * * * * * * ** * * * * * ** * * * * * * ** * * * ** * * * * * * * * * * * * * ** * ** * * * * * ** ** * * * * * ** * * * * * * ** * * * ** * * * * * * * * * * * * * ** * ** * * * * * ** ** * * * * * ** * * * * * * ** * * * ** * * * * * * * * * * * * * ** * ** * * * * * ** ** * * * * * ** * * * * * * ** * * * ** * * * * * * * * * * * * * ** * ** * * * * * ** ** * * * * * ** * * * * * * ** * * * ** * * * * * * * * * * * * * ** * ** * * * * * ** ** * * * * * ** * * * * * * ** * * * ** * * * * * * * * * * * * * ** * ** * * * * * ** ** * * * * * ** * * * * * * ** * * * ** * * * * * * * * * * * * * ** * ** * * * * * ** ** * * * * * ** * * * * * * ** * * * ** * * * * * * * * * * * * * ** * ** * * * * * ** ** * * * * * ** * * * * * * ** * * * ** * * * * * * * * * * * * * ** * ** * * * * * ** ** * * * * * ** * * * * * * ** * * * ** * * * * * * * * * * * * * ** * ** * * * * * ** ** * * * * * ** * * * * * * ** * * * ** * * * * * * * * *
LASSO
2 6 5 1 1 9 23 28 35 39 40 43 48 50
Ruey S. Tsay Big Dependent Data 12 / 72
Ruey S. Tsay Big Dependent Data 13 / 72
1
2
Ruey S. Tsay Big Dependent Data 14 / 72
1
2
Ruey S. Tsay Big Dependent Data 15 / 72
Ruey S. Tsay Big Dependent Data 16 / 72
5 10 15 20 0.0 0.4 0.8 1:20 β2 5 10 15 20 0.0 0.4 0.8 1:20 β3 5 10 15 20 −0.2 0.2 0.6 1.0 1:20 β4 5 10 15 20 0.0 0.4 0.8 1:22 β2 5 10 15 20 0.0 0.4 0.8 1:22 β3 5 10 15 20 0.0 0.4 0.8 1:22 β4
Ruey S. Tsay Big Dependent Data 17 / 72
1
2
3
4
Ruey S. Tsay Big Dependent Data 18 / 72
1
2
3
4
5
6
Ruey S. Tsay Big Dependent Data 19 / 72
days N(stocks) 100 200 300 400 500 6600 6700 6600 6700 size
Ruey S. Tsay Big Dependent Data 20 / 72
−0.10 0.00 0.10
lnreturn density −0.10 0.00 0.10
lnreturn density
Ruey S. Tsay Big Dependent Data 21 / 72
1000 2000 0.00 0.01 0.02 0.03 0.04 0.05 demand
Monday
1000 2000 demand
Tuesday
1000 2000 demand
Wednesday
1000 2000 demand
Thursday
1000 2000 demand
Friday
1000 2000 demand
Saturday
1000 2000 demand
Sunday
Ruey S. Tsay Big Dependent Data 22 / 72
1980 1990 2000 2010 5 10 15 year urate
State UNRATE: 1976.1 to 2015.9
Ruey S. Tsay Big Dependent Data 23 / 72
1
2
3
Ruey S. Tsay Big Dependent Data 24 / 72
Ruey S. Tsay Big Dependent Data 25 / 72
1
2
3
Ruey S. Tsay Big Dependent Data 26 / 72
Ruey S. Tsay Big Dependent Data 27 / 72
Ruey S. Tsay Big Dependent Data 28 / 72
Ruey S. Tsay Big Dependent Data 29 / 72
1998 2002 2006 7.8 8.4
New England
1998 2002 2006 8.6 9.0 9.4
Middle Atlantic
1998 2002 2006 9.2 9.6
East North Central
year 1998 2002 2006 8.2 8.8 9.4
West Noth Central
1998 2002 2006 10.2 10.8
South Atlantic
1998 2002 2006 8.6 9.0 9.4
East South Central
year 1998 2002 2006 9.2 9.6 10.2
West South Central
1998 2002 2006 9.4 9.8
moutain
1998 2002 2006 9.4 9.8
Pacific
year
Ruey S. Tsay Big Dependent Data 30 / 72
F1[,1]
Time 20 40 60 80 100 120 −1.0 −0.5 0.0 0.5 1.0
F1[,2]
Time 20 40 60 80 100 120 −1.5 −0.5 0.5 1.0 1.5
F2[,1]
Time 2 4 6 8 10 12 −0.6 −0.2 0.0 0.2 0.4
F2[,2]
Time 2 4 6 8 10 12 −0.2 0.0 0.2 0.4
F3[,1]
2 4 6 8 10 12 −0.03 −0.01 0.01 0.03
F3[,2]
2 4 6 8 10 12 −0.8 −0.4 0.0 0.4
Ruey S. Tsay Big Dependent Data 31 / 72
−0.10 −0.05 0.00 0.05
New England
−0.10 −0.05 0.00 0.05
Middle Atlantic
−0.2 −0.1 0.0 0.1
East North Central
−0.20 −0.10 0.00 0.10
West North Central
−0.08 −0.04 0.00 0.04
South Atlantic
−0.20 −0.10 0.00 0.10
East South Central
−0.15 −0.05 0.05
West South Central
−0.10 −0.05 0.00 0.05
Mountain
−0.06 −0.02 0.02 20 40 60 80 100 120
Pacific Index
ts(Gterm)
2 of a fitted DCF model of order
Ruey S. Tsay Big Dependent Data 32 / 72
−0.4 −0.2 0.0 0.2
New England
−0.4 −0.2 0.0 0.2
Middle Atlantic
−0.4 −0.2 0.0 0.2
East North Central
−0.4 −0.2 0.0 0.2
West North Central
−0.4 −0.2 0.0 0.2 0.4
South Atlantic
−0.4 −0.2 0.0 0.2 0.4
East South Central
−0.4 −0.2 0.0 0.2 0.4
West South Central
−0.4 −0.2 0.0 0.2 0.4
Mountain
−0.4 −0.2 0.0 0.2 0.4 20 40 60 80 100 120
Pacific Index
ts(Hterm)
1H′ of a fitted DCF model of order
Ruey S. Tsay Big Dependent Data 33 / 72
−0.10 −0.05 0.00 0.05
New England
−0.10 −0.05 0.00 0.05
Middle Atlantic
−0.10 −0.05 0.00 0.05
East North Central
−0.10 −0.05 0.00 0.05
West North Central
−0.3 −0.2 −0.1 0.0 0.1
South Atlantic
−0.3 −0.2 −0.1 0.0 0.1
East South Central
−0.3 −0.2 −0.1 0.0 0.1
West South Central
−0.10 −0.05 0.00 0.05
Mountain
−0.10 −0.05 0.00 0.05 20 40 60 80 100 120
Pacific Index
ts(GHterm)
3H′ of a fitted DCF model of order
Ruey S. Tsay Big Dependent Data 34 / 72
Ruey S. Tsay Big Dependent Data 35 / 72
1
2
Ruey S. Tsay Big Dependent Data 36 / 72
Ruey S. Tsay Big Dependent Data 37 / 72
1
2
Ruey S. Tsay Big Dependent Data 38 / 72
1
2
3
Ruey S. Tsay Big Dependent Data 39 / 72
Ruey S. Tsay Big Dependent Data 40 / 72
1000 1500 2000 2500 3000 0.000 0.001 0.002 0.003 0.004 megawatts density
Ruey S. Tsay Big Dependent Data 41 / 72
1000 1500 2000 2500 3000 0.000 0.001 0.002 0.003 0.004 Megawatts density
Ruey S. Tsay Big Dependent Data 42 / 72
1
1
2
3
2
3
Ruey S. Tsay Big Dependent Data 43 / 72
Ruey S. Tsay Big Dependent Data 44 / 72
Ruey S. Tsay Big Dependent Data 45 / 72
days VIX 100 200 300 400 500 15 25
days diff(VIX) 100 200 300 400 500 −4 4
Ruey S. Tsay Big Dependent Data 46 / 72
−0.2 −0.1 0.0 0.1 0.2 20 40 60 log−rtn density
dvix > 1.19
−0.2 −0.1 0.0 0.1 0.2 20 40 60 log−rtn density
1.19 >= dvix > 0.39
−0.2 −0.1 0.0 0.1 0.2 20 40 60 log−rtn density
0.39 >= dvix > −0.73
−0.2 −0.1 0.0 0.1 0.2 20 40 60 log−rtn density
dvix <= −0.73
Ruey S. Tsay Big Dependent Data 47 / 72
1
2
3
4
5
Ruey S. Tsay Big Dependent Data 48 / 72
1
2
3
Ruey S. Tsay Big Dependent Data 49 / 72
Ruey S. Tsay Big Dependent Data 50 / 72
1
2
3
4
Ruey S. Tsay Big Dependent Data 51 / 72
Comp.1 Comp.3 Comp.5 Comp.7 Comp.9
Variances 5000 10000 15000 20000
Ruey S. Tsay Big Dependent Data 52 / 72
−0.2 −0.1 0.0 0.1 0.2 400 800 lnreturn pc1 −0.2 −0.1 0.0 0.1 0.2 −150 150 lnreturn pc2 −0.2 −0.1 0.0 0.1 0.2 −100 lnreturn pc3 −0.2 −0.1 0.0 0.1 0.2 −60 40 lnreturn pc4 −0.2 −0.1 0.0 0.1 0.2 −30 0 20 lnreturn pc5 −0.2 −0.1 0.0 0.1 0.2 −20 20 lnreturn pc6
Ruey S. Tsay Big Dependent Data 53 / 72
−0.2 −0.1 0.0 0.1 0.2 −15 0 10 lnreturn pc7 −0.2 −0.1 0.0 0.1 0.2 −10 10 lnreturn pc8 −0.2 −0.1 0.0 0.1 0.2 −10 0 5 lnreturn pc9 −0.2 −0.1 0.0 0.1 0.2 −5 5 lnreturn pc10 −0.2 −0.1 0.0 0.1 0.2 −6 4 lnreturn pc11 −0.2 −0.1 0.0 0.1 0.2 −4 2 lnreturn pc12
Ruey S. Tsay Big Dependent Data 54 / 72
−0.2 −0.1 0.0 0.1 0.2 10 20 30 40 lnreturn pc1
Mean density pm first PC
Ruey S. Tsay Big Dependent Data 55 / 72
−0.2 −0.1 0.0 0.1 0.2 10 20 30 40 lnrturn pc2
Mean density pm 2nd PC
Ruey S. Tsay Big Dependent Data 56 / 72
−0.2 −0.1 0.0 0.1 0.2 10 20 30 40 lnreturn pc3
Mean density pm 3rd PC
Ruey S. Tsay Big Dependent Data 57 / 72
1
2
3
Ruey S. Tsay Big Dependent Data 58 / 72
−4 −2 2 4 −0.05 −0.04 −0.03 −0.02 dvix Loadings
Ruey S. Tsay Big Dependent Data 59 / 72
1
2
3
4
5
6
Ruey S. Tsay Big Dependent Data 60 / 72
Comp.1 Comp.3 Comp.5 Comp.7 Comp.9
Variances 0 4000 10000 Comp.1 Comp.3 Comp.5 Comp.7 Comp.9
Variances 6000
Ruey S. Tsay Big Dependent Data 61 / 72
−0.2 −0.1 0.0 0.1 0.2 300 600 lnreturn pc1 −0.2 −0.1 0.0 0.1 0.2 −100 0 100 lnreturn pc2 −0.2 −0.1 0.0 0.1 0.2 −60 40 lnreturn pc3 −0.2 −0.1 0.0 0.1 0.2 −20 20 lnreturn pc4 −0.2 −0.1 0.0 0.1 0.2 −20 20 lnreturn pc5 −0.2 −0.1 0.0 0.1 0.2 −15 10 lnreturn pc6
Ruey S. Tsay Big Dependent Data 62 / 72
1
2
3
4
5
6
Ruey S. Tsay Big Dependent Data 63 / 72
−0.2 −0.1 0.0 0.1 0.2 15 30 lnreturn density
−0.2 −0.1 0.0 0.1 0.2 −0.4 0.2 0.6 lnreturn difference
Ruey S. Tsay Big Dependent Data 64 / 72
−0.2 −0.1 0.0 0.1 0.2 10 20 30 lnreturn density
density and its fits: day 420
−0.2 −0.1 0.0 0.1 0.2 −0.6 0.0 0.4 lnreturn errors
Errors of approximation: day 420, red(Thr)
Ruey S. Tsay Big Dependent Data 65 / 72
1
2
3
Ruey S. Tsay Big Dependent Data 66 / 72
1
2
3
1
2
Ruey S. Tsay Big Dependent Data 67 / 72
1985 1990 1995 2000 2005 2010 2015 5 10 15
Ruey S. Tsay Big Dependent Data 68 / 72
Comp.1 Comp.2 Comp.3 Comp.4 Comp.5 Comp.6 Comp.7 Comp.8 Comp.9
p1
20 40 60 80
Ruey S. Tsay Big Dependent Data 69 / 72
1985 1990 1995 2000 2005 2010 2015 −10 10 20 30
Ruey S. Tsay Big Dependent Data 70 / 72
1
2
3
Ruey S. Tsay Big Dependent Data 71 / 72
1
2
3
4
5
6
Ruey S. Tsay Big Dependent Data 72 / 72