Data & Science
A m mandate f for d data d driven c corporate i innovation
By Igor Stojković Enterprise Analytics & Data Phillip Morris International
Data & Science A m mandate f for d data d driven c - - PowerPoint PPT Presentation
Data & Science A m mandate f for d data d driven c corporate i innovation By Igor Stojkovi Enterprise Analytics & Data Phillip Morris International Contents Mathematics for data science in commercial environment To prove
A m mandate f for d data d driven c corporate i innovation
By Igor Stojković Enterprise Analytics & Data Phillip Morris International
mathematical models
prep/modeling/performance valuation
2
mathematicise them
markets and products
3
4
Data Scientist Domain Expert Data Engineer/Hunter
Development Team
Senior Stakeholders
Product owner
Scrum Master
to build new API services for internal and external usage
5
EAD (Basel) CR models
management
discussions:
a hotel), that is relevant to creditors as well as buyers and/or suppliers of entities considered?
6
7
require deep modeling itself
representations – sometimes deeply analytical
8
patterns at PC level
9
𝑍
":=𝑌" ∗ 𝛽" + 𝜁" (, 𝜁" (~𝑂(0, Σ(), 𝑌" − 𝑞𝑠𝑓𝑒𝑗𝑑𝑢𝑗𝑤𝑓 𝑔𝑓𝑏𝑢𝑣𝑠𝑓𝑡 𝑙𝑜𝑝𝑥𝑜 𝑏𝑢 𝑢
𝛽" ≔ 𝐺 ∗ 𝛽"DE+𝜁"
F, 𝜁" F~𝑂 0, ΣF ,
Σ(, ΣF - unknown covariance matrices 𝐺- unknown matrix to be estimated This is a generalization of the local level model.
performance
parameters per cluster (iteratively, passing results at end of an epoch as input to the next epoch within a cluster)
10
t1 t2 t3 t4 t8 t9 t10 t11 t12 t5 t6 t7 t31 t32 t33 t34 t35 t36 Subseries 1 Subseries 2 Subseries 3 Subseries 11
TRAIN PARTITION TEST PARTITION 11
quarters) as target values
series behavior
12 1 .……. …………….......
One-hot representation of PC’s
Weights Relu activatons Weights Target sub-series One hot PC1 One hot PC1 One hot PC3000 One hot PC30000 Series 1 PC1 Series 2 PC1 Series k PC3000 Series k PC3000
t1 to t6 t31 to t36 t1 to t6 t31 to t36
E*x+𝑥E), x – one hot representation of a PC area, 𝑋
E and 𝑥E weights of the hidden layer
M ∗h+𝑥M), 𝑋
M and 𝑥M are weights of the output layer
O, … , 𝑨Q O), for 𝑨 ∈ ℝQ
E, 𝑥E, 𝑋 M, 𝑥M):= Ε(𝑡 − 𝑢 ℎ 𝑦
)M, 𝑡 𝑗𝑡 𝑢𝑏𝑠𝑓𝑢 𝑡𝑓𝑠𝑗𝑓𝑡, 𝐹 𝑗𝑡 𝑢𝑏𝑙𝑓𝑜 𝑥. 𝑠. 𝑢. 𝑒𝑏𝑢𝑏
joint Kalman filter inference:
E*x+𝑥E (∈ ℝ[, 𝑚 = 6 𝑢𝑝 10)
13
14
LSTM layer LSTM cell
Input series x1,…,x6 LSTM layer 1 LSTM layer 2
𝑋
_
𝑉_ 𝑋
E
𝑉E
x7
𝑋
M
𝑋
a
𝑋
a
Dense layer 1 Dense layer 2 Target
Our LSTM architecture
c − 𝑢𝑠𝑣𝑓 𝑤𝑏𝑚𝑣𝑓 𝑝𝑔 𝑡𝑏𝑚𝑓𝑡 𝑔𝑝𝑠 𝑄𝐷 𝑞 𝑏𝑢 𝑢𝑗𝑛𝑓 𝑢
c
" c: = h ij
kDhj k
hj
k
𝑐𝑏𝑡𝑓_𝑓𝑠𝑠
" c: = hjno
k
Dhj
k
hj
k
" and 𝑐𝑏𝑡𝑓_𝑓𝑠𝑠 "(aggregate over PC’s)
15