Seeking Gold in Sand
Applying Random Matrix Theory to Separation of Signals from Noise in Stock Market Data Mike S. Wang
Department of Chemistry University of Cambridge August, 2016
Mike S. Wang Seeking Gold in Sand with Random Matrix Theory
Seeking Gold in Sand Applying Random Matrix Theory to Separation of - - PowerPoint PPT Presentation
Seeking Gold in Sand Applying Random Matrix Theory to Separation of Signals from Noise in Stock Market Data Mike S. Wang Department of Chemistry University of Cambridge August, 2016 Mike S. Wang Seeking Gold in Sand with Random Matrix Theory
Applying Random Matrix Theory to Separation of Signals from Noise in Stock Market Data Mike S. Wang
Department of Chemistry University of Cambridge August, 2016
Mike S. Wang Seeking Gold in Sand with Random Matrix Theory
1
What is Random Matrix Theory?
2
Project investigations: Seeking Gold in Sand
3
Looking beyond Financial Applications
4
Q&A Time
Mike S. Wang Seeking Gold in Sand with Random Matrix Theory
Random Matrix Theory (RMT) is the study of matrices with random variable entries, e.g. X11 X12 · · · X21 X22 · · · . . . . . . ... . In particular, we are interested in the emergent behaviours of random matrices in the asymptotic limit. Introduced by Wishart (1928), RMT gained prominence when Wigner (1950) applied the theory in nuclear physics.
Mike S. Wang Seeking Gold in Sand with Random Matrix Theory
We will
500);
the purpose of signals/noise detection. Improving a ‘crude’ prediction mode analysis + clustering analysis ⇓ new correlation matrix model ⇓ RMT better analytic predictions for signal and noise separation ⇓ more profitable investment portfolios
Mike S. Wang Seeking Gold in Sand with Random Matrix Theory
Price indices → logarithmic returns → de-meaned and normalised data:
P stocks T days
126.8 30.5 · · · 126.3 30.7 · · · . . . . . . ...
→ −0.9 1.7 · · · 1.5 0.3 · · · . . . . . . ...
→ 1.0 0.2 · · · 0.2 1.0 · · · . . . . . . ...
If X is the standardised data matrix, the correlation matrix E is calculated as E = X tX T .
Mike S. Wang Seeking Gold in Sand with Random Matrix Theory
2 4 6 8 10
eigenvalues
0.2 0.4 0.6 0.8 1 1.2
normalised distribution density
20 40 60 80 100 0.02 0.04 0.06
market mode
(a) Eigenvalues → different modes
100 200 300 400 50 100 150 200 250 300 350 400 450
0.5 1
sector clustering
(b) Heatmap → suggests clustering
Study of the structure Mode analysis: localisation of modes. Clustering analysis: hierarchical clustering structure.
Mike S. Wang Seeking Gold in Sand with Random Matrix Theory
If the stock prices are independent and purely random, then whatever the distribution, then 1 ... 1
⇒
0.5 1 1.5 2 2.5 3
x
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9
ν(x)
ν(x) =
1 2π
√
(λ+−x)(x−λ ) cx
c = P
T
λ± = σ2(1 ± √c)2 All noise!!
Marˇ cenko-Pastur (M-P) distribution
In reality, the stock prices of course cannot be completely independent and purely random, and we see that ? ⇒
1 2 3 4 5
eigenvalues
0.2 0.4 0.6 0.8 1 1.2 1.4
normalised distribution density histogram of observed eigenvalues Marchenko-Pastur law signals incorrect noise band
Mike S. Wang Seeking Gold in Sand with Random Matrix Theory
We have studied localisation of specific modes.
50 100 150 200 250 300 350 400 450 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08
1st mode; eigenvalue = 99.1247; IPR = 0.0029781
Energy Financials Healthcare Industrials IT Materials Telecom Utilities
(a) The market mode: uniform.
50 100 150 200 250 300 350 400 450
0.2 0.4 0.6
452-th mode; eigenvalue = 0.059557; IPR = 0.14855
Energy Financials Healthcare Industrials IT Materials Telecom Utilities
(b) The lowest mode: localised.
Mike S. Wang Seeking Gold in Sand with Random Matrix Theory
Having removed the market mode, the internal structure of the stock market is
0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 CVS HCBK WAG ANF GPS TJX LTD COST FDO TGT WMT KSS JCP BBY HD JWN BBBY RSH ROST SPLS TIF AZO ORLY COH
(a) A hierarchical dendrogram. (b) The minimum spanning tree.
Mike S. Wang Seeking Gold in Sand with Random Matrix Theory
A new model for the underlying correlation matrix structure: What next? a model for underlying correlation matrix ⇓ better analytic predictions for noise bands ⇓ more profitable investment portfolios
Mike S. Wang Seeking Gold in Sand with Random Matrix Theory
Many other mathematical and scientific fields:
. . . and biochemistry!
Mike S. Wang Seeking Gold in Sand with Random Matrix Theory
Covariance analysis of protein sequence alignments can be used to infer protein structure and function.
Mike S. Wang Seeking Gold in Sand with Random Matrix Theory
Thanks for listening!
Mike S. Wang Seeking Gold in Sand with Random Matrix Theory
We have made many simplifications in deriving a new analytic prediction. In particular, we could further investigate:
matrix structure model?
Mike S. Wang Seeking Gold in Sand with Random Matrix Theory
20 40 60 70 80 100 120 20 40 60 70 80 100 120 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
(a) market mode unremoved
20 40 60 80 100 120 20 40 60 80 100 120
0.2 0.4 0.6 0.8
(b) market mode removed
Mike S. Wang Seeking Gold in Sand with Random Matrix Theory