Bayesian Nonparametric Models for Data Exploration
Melanie F. Pradier
Friday 15th September, 2017
Bayesian Nonparametric Models for Data Exploration Melanie F. - - PowerPoint PPT Presentation
Bayesian Nonparametric Models for Data Exploration Melanie F. Pradier Friday 15 th September, 2017 Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions Outline 1 Introduction 2 Bayesian
Melanie F. Pradier
Friday 15th September, 2017
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
1 Introduction 2 Bayesian nonparametrics 3 ADDP mixture model for marathon model 4 C-IBP feature model for clinical trials 5 PFA models for international trade 6 Conclusions
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 1/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 2/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 2/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 2/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 2/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 2/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 2/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Data Exploitation Age
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 2/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Data Exploitation Age · · · but are we making the
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 2/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
An example: personalized medicine
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 3/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
An example: personalized medicine
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 3/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
An example: personalized medicine
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 3/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
An example: personalized medicine
Challenges
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 3/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
An example: personalized medicine
Challenges
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 3/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
An example: personalized medicine
Challenges
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 3/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
An example: personalized medicine
Challenges
→ data exploration
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 3/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
An example: personalized medicine
Challenges
→ data exploration
2018 EU General Data Protection Regulation
“right to explanation”
(Goodman et.al. 2016)
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 3/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Focus: data exploration
In this thesis . . .
1 How does aging impact our athletic performance? (Ch. 3) 2 What are the underlying mechanisms of cancer? (Ch. 4 & 5) 3 Which factors make countries wealthier than others? (Ch. 6)
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 4/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Focus: data exploration
In this thesis . . .
1 How does aging impact our athletic performance? (Ch. 3) 2 What are the underlying mechanisms of cancer? (Ch. 4 & 5) 3 Which factors make countries wealthier than others? (Ch. 6)
Main goal
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 4/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Focus: data exploration
In this thesis . . .
1 How does aging impact our athletic performance? (Ch. 3) 2 What are the underlying mechanisms of cancer? (Ch. 4 & 5) 3 Which factors make countries wealthier than others? (Ch. 6)
Main goal
Our Approach
Bayesian nonparametrics
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 4/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
input data (hypothesis generation)
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 5/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
input data (hypothesis generation)
Data Dirichlet process (DP) Gaussian process (GP) Bernoulli Process (BeP) Beta process (BP)
...
Expert Knowledge
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 5/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
input data (hypothesis generation)
Data Dirichlet process (DP) Gaussian process (GP) Bernoulli Process (BeP) Beta process (BP)
...
Expert Knowledge
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 5/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Goal: build useful BNP models for specific data exploration tasks.
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 6/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Goal: build useful BNP models for specific data exploration tasks.
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 6/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Goal: build useful BNP models for specific data exploration tasks.
Atom-dependent DP mixture model
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 6/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Goal: build useful BNP models for specific data exploration tasks.
Atom-dependent DP mixture model
Case-control IBP feature model
heterogeneous structured data
group-specific effects
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 6/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Goal: build useful BNP models for specific data exploration tasks.
Atom-dependent DP mixture model
Case-control IBP feature model
heterogeneous structured data
group-specific effects
Poisson factor analysis (PFA) models
→ flexible feature models for count data
1 Hierarchical PFA:
2 Three-parameter Restricted PFA:
latent space 3 Dynamic PFA:
activation of latent factors
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 6/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Goal: build useful BNP models for specific data exploration tasks.
Atom-dependent DP mixture model
Case-control IBP feature model
heterogeneous structured data
group-specific effects
Poisson factor analysis (PFA) models
→ flexible feature models for count data
1 Hierarchical PFA:
2 Three-parameter Restricted PFA:
latent space 3 Dynamic PFA:
activation of latent factors
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 6/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
1 Introduction 2 Bayesian nonparametrics 3 ADDP mixture model for marathon model 4 C-IBP feature model for clinical trials 5 PFA models for international trade 6 Conclusions
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 7/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
dataset
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 8/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
dataset
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 8/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
G ∼ DP(α, H) G =
∞
πkδφk
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 9/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
G ∼ DP(α, H) G =
∞
πkδφk
mixture models
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 9/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
G ∼ DP(α, H) G =
∞
πkδφk
mixture models
Stick-breaking representation
(Ishwaran et.al, 2001)
For k = 1, · · · , ∞
vk ∼ Beta(α, 1), πk = vk
k−1
(1−vℓ)
1 . . . k = 1 k = 2 k = 3
π1 π2 π3
π ∼ GEM(α)
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 9/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
G ∼ DP(α, H) G =
∞
πkδφk
mixture models
Stick-breaking representation
(Ishwaran et.al, 2001)
For k = 1, · · · , ∞
vk ∼ Beta(α, 1), πk = vk
k−1
(1−vℓ)
1 . . . k = 1 k = 2 k = 3
π1 π2 π3
π ∼ GEM(α)
For k = 1, · · · , ∞
φk ∼ H
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 9/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 10/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
1
G =
∞
πkδφk ∼ BP(c, α, H)
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 10/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
1
G =
∞
πkδφk ∼ BP(c, α, H)
For n = 1, · · · , ∞ ζn =
∞
znkδφk ∼ BeP(G)
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 10/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
1
G =
∞
πkδφk ∼ BP(c, α, H)
For n = 1, · · · , ∞ ζn =
∞
znkδφk ∼ BeP(G)
Z ∼ IBP(α)
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 10/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
1 Introduction 2 Bayesian nonparametrics 3 ADDP mixture model for marathon model 4 C-IBP feature model for clinical trials 5 PFA models for international trade 6 Conclusions
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 11/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
1 What is the impact of age and gender
2 Can we compare different runners in a
fair manner?
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 12/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
1 What is the impact of age and gender
2 Can we compare different runners in a
fair manner?
Our Approach
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 12/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
(MacEachern,2000)
J: number of groups
Gj =
∞
πjkδφjk
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 13/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
(MacEachern,2000)
J: number of groups
Gj =
∞
πjkδφjk
Gj =
∞
πjkδφk
Gj =
∞
πkδφjk hierarchical DP G0 ∼ DP (α, H) Gj ∼ DP (γ, G0) single-p DDP G0 ∼ DP (α, H) Gj = Tj [G0]
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 13/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Generative model
xi ≡ marathon finishing time for runner i
π|α ∼ GEM(α) ci|π ∼ Cat(π) µk ∼ N
x ∼ IG (a, b)
xi|other vars ∼ N
x
Bayesian Nonparametric Models for Data Exploration 2017-09-15 14/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Generative model
xji ≡ marathon finishing time for runner i in age group j
π|α ∼ GEM(α) cji|π ∼ Cat(π) µk ∼ N
x ∼ IG (a, b)
θ ∼ N (0, Σθ) xji|other vars ∼ N
x
θ exp
2ν2
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 14/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Generative model
xji ≡ marathon finishing time for runner i in age group j gji ≡ gender
π|α ∼ GEM(α) cji|π ∼ Cat(π) µk ∼ N
x ∼ IG (a, b)
θ ∼ N (0, Σθ) δ ∼N
ω
xji|other vars ∼ N
x
θ exp
2ν2
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 15/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Impact of age
2 4 6 8 0.2 0.4 0.6 0.8
Finishing time (hours)
Histogram pdf by ADDP
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 16/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Impact of age
20 30 40 50 60 70 2 3 4 5
Age Finishing time (hours)
New York City Boston London WMA
2 4 6 8 0.2 0.4 0.6 0.8
Finishing time (hours)
Histogram pdf by ADDP
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 16/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Impact of age
20 30 40 50 60 70 2 3 4 5
Age Finishing time (hours)
µ1 + θj µ2 + θj New York City Boston London WMA
2 4 6 8 0.2 0.4 0.6 0.8
Finishing time (hours)
Histogram pdf by ADDP
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 16/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Impact of gender
10 20 30 40 50 60 70 26 28 30 32 34
age (years) δ + ωj (mins)
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 17/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Impact of gender
10 20 30 40 50 60 70 26 28 30 32 34
age (years) δ + ωj (mins)
Other Results
5 10 15 20 25 30 35 40 42.2 6 7 8 9 10 11 12 km Speed (km/h) 5 10 15 20 25 30 35 40 42.2 20 40 60 80 100 Elevation (m) Cluster 0 (7.2%, T=3.80h) Cluster 1 (24.4%, T=3.93h) Cluster 1− (14.9%, T=4.03h) Cluster 1− − (3.6%, T=4.16h) Cluster 2A (13.4%, T=4.17h) Cluster 2A− (11.3%, T=4.27h) Cluster 2A− − (3.2%, T=4.43h) Cluster 2B (1.1%, T=4.32h) Cluster 2B− (1.6%, T=4.47h) Cluster 3 (3.4%, T=4.56h) Cluster 3− (4.4%, T=4.59h) Cluster 3−− (1.4%, T=4.88h)
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 17/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
1 Introduction 2 Bayesian nonparametrics 3 ADDP mixture model for marathon model 4 C-IBP feature model for clinical trials 5 PFA models for international trade 6 Conclusions
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 18/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Def: ”any variable that can be used as an indicator of a particular disease state”.
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 19/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Def: ”any variable that can be used as an indicator of a particular disease state”.
We want to discover:
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 19/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Def: ”any variable that can be used as an indicator of a particular disease state”.
We want to discover:
1 Indicators of disease progression: prognostic biomarkers
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 19/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Def: ”any variable that can be used as an indicator of a particular disease state”.
We want to discover:
1 Indicators of disease progression: prognostic biomarkers 2 Indicators of (positive) drug response: predictive biomarkers
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 19/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Def: ”any variable that can be used as an indicator of a particular disease state”.
1
We want to discover:
1 Indicators of disease progression: prognostic biomarkers 2 Indicators of (positive) drug response: predictive biomarkers
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 19/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
(Valera et.al, 2017)
Latent feature model for heterogeneous datasets
Y•d X φd Z B•d
σ2
B
α
d = 1 . . . D Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 20/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
(Valera et.al, 2017)
Latent feature model for heterogeneous datasets
Y•d X φd Z B•d
σ2
B
α
d = 1 . . . D
data for each dimension d
xnd = Td(ynd; φd) ynd|Z, B ∼ N(Zn•B•d, σ2
y)
Bkd ∼ N(0, σ2
B)
Z ∼ IBP(α)
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 20/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
(Valera et.al, 2017)
Latent feature model for heterogeneous datasets
Y•d X φd Z B•d
σ2
B
α
d = 1 . . . D
data for each dimension d
xnd = Td(ynd; φd) ynd|Z, B ∼ N(Zn•B•d, σ2
y)
Bkd ∼ N(0, σ2
B)
Z ∼ IBP(α)
Our contribution to GLFM project
https://github.com/ivaleraM/GLFM Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 20/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Y•d X R φd Z B•d
σ2
B
α
W C•d
σ2
C
α
d = 1 . . . D Rn: drug indicator por patient n
xnd = Td(ynd; φd) ynd|Z, W,B, C, R ∼ N(Zn•B•d+✶[Rn = 1]Wn•C•d, σ2
y)
Bkd ∼ N(0, σ2
B)
Z ∼ IBP(α) Ckd∼ N(0, σ2
C)
W∼ IBP(α)
accelerated Gibbs sampling
multiple hypothesis testing
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 21/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
GPC3 Antibody Treatment against Liver Cancer (J. Hepatology. 2016 Apr, Abou-Alfa et.al.)
Sub-population Drug Identifier F1 F2 F3 Size (number
Mean PFS (months) Median PFS (months) 1. 33.37 3.06 1.65 2. 1 4.07 2.29 2.24 3. 1 17.84 2.72 1.81 4. 1 1 4.72 7.05 7.18 5. 1 51.52 3.22 2.55 6. 1 1 16.77 4.17 3.65 7. 1 1 8.38 1.74 1.33 8. 1 1 1 2.07 2.69 2.65 9. 1 1 29.88 3.36 2.03 10. 1 1 1 4.90 4.44 4.34 11. 1 1 1 4.53 6.31 5.31 12. 1 1 1 1 1.94 10.04 10.01 Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 22/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
GPC3 Antibody Treatment against Liver Cancer (J. Hepatology. 2016 Apr, Abou-Alfa et.al.)
Sub-population Drug Identifier F1 F2 F3 Size (number
Mean PFS (months) Median PFS (months) 1. 33.37 3.06 1.65 2. 1 4.07 2.29 2.24 3. 1 17.84 2.72 1.81 4. 1 1 4.72 7.05 7.18 5. 1 51.52 3.22 2.55 6. 1 1 16.77 4.17 3.65 7. 1 1 8.38 1.74 1.33 8. 1 1 1 2.07 2.69 2.65 9. 1 1 29.88 3.36 2.03 10. 1 1 1 4.90 4.44 4.34 11. 1 1 1 4.53 6.31 5.31 12. 1 1 1 1 1.94 10.04 10.01 Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 22/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
GPC3 Antibody Treatment against Liver Cancer (J. Hepatology. 2016 Apr, Abou-Alfa et.al.)
Sub-population Drug Identifier F1 F2 F3 Size (number
Mean PFS (months) Median PFS (months) 1. 33.37 3.06 1.65 2. 1 4.07 2.29 2.24 3. 1 17.84 2.72 1.81 4. 1 1 4.72 7.05 7.18 5. 1 51.52 3.22 2.55 6. 1 1 16.77 4.17 3.65 7. 1 1 8.38 1.74 1.33 8. 1 1 1 2.07 2.69 2.65 9. 1 1 29.88 3.36 2.03 10. 1 1 1 4.90 4.44 4.34 11. 1 1 1 4.53 6.31 5.31 12. 1 1 1 1 1.94 10.04 10.01 Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 22/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
GPC3 Antibody Treatment against Liver Cancer (J. Hepatology. 2016 Apr, Abou-Alfa et.al.)
Sub-population Drug Identifier F1 F2 F3 Size (number
Mean PFS (months) Median PFS (months) 1. 33.37 3.06 1.65 2. 1 4.07 2.29 2.24 3. 1 17.84 2.72 1.81 4. 1 1 4.72 7.05 7.18 5. 1 51.52 3.22 2.55 6. 1 1 16.77 4.17 3.65 7. 1 1 8.38 1.74 1.33 8. 1 1 1 2.07 2.69 2.65 9. 1 1 29.88 3.36 2.03 10. 1 1 1 4.90 4.44 4.34 11. 1 1 1 4.53 6.31 5.31 12. 1 1 1 1 1.94 10.04 10.01
5 10 15
1 2 3 4 5 6 7 8 9 1 1 1 1 2
PFS (months)
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 22/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Treatment-specific feature F3
−4 −2 2
A g e W e i g h t H e i g h t B M I D C t r
g h A F P A A T C R P C P s c
e C D 1 6 C D 3 P C D 1 6 P C D 3 P S t r
a C D 3 / C D 1 6 n e c r
i c C D 3 / C D 1 6 t u m
C D 3 / C D 1 6 v i a b l e P N e c r
i c P T u m
P V i a b l e H s c
e C y t H s c
e M e m A D C C C D 1 7 A D C C C D 1 6 C D 4 5 B C D 3 C D 4 C D 8 C D 4 / C D 8 C D 8 N K C D 1 6 C D 5 6
D 1 6 + C D 5 6 b r i g h t C D 5 6 d i m C D 1 6
D 5 6 d i m C D 1 6 b r i g h t N K C D 5 6 N K P 4 6 D N D P C D 1 6 M E S F N K P 4 6 M E S F s G P C 3 1 1 4 / 1 6 5 s G P C 3 3 / 5 7 s G P C 3 3 / 6 7 s G P C 3 1 1 / 9 6 S D T L
∆d
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 23/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
1 Introduction 2 Bayesian nonparametrics 3 ADDP mixture model for marathon model 4 C-IBP feature model for clinical trials 5 PFA models for international trade 6 Conclusions
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 24/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
What makes some countries wealthier than others?
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 25/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
What makes some countries wealthier than others?
Classical view
1776; Ricardo, 1817)
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 25/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
What makes some countries wealthier than others?
Classical view
1776; Ricardo, 1817)
economic efficiency
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 25/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
What makes some countries wealthier than others?
Classical view
1776; Ricardo, 1817)
economic efficiency
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 25/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
What makes some countries wealthier than others?
Classical view
1776; Ricardo, 1817)
economic efficiency
→ block-structure
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 25/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
The reality:
200 400 600 40 80 120
Products Countries
RCAnd = End/
p End
n,d End
xnd =
if RCAnd ≥ 1 0,
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 26/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
The reality:
200 400 600 40 80 120
Products Countries
RCAnd = End/
p End
n,d End
xnd =
if RCAnd ≥ 1 0,
Properties:
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 26/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
The reality:
200 400 600 40 80 120
Products Countries
RCAnd = End/
p End
n,d End
xnd =
if RCAnd ≥ 1 0,
Properties:
1 Triangularity
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 26/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
The reality:
200 400 600 40 80 120
Products Countries
RCAnd = End/
p End
n,d End
xnd =
if RCAnd ≥ 1 0,
Properties:
1 Triangularity 2 D ≫ N
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 26/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
The reality:
200 400 600 40 80 120
Products Countries
RCAnd = End/
p End
n,d End
xnd =
if RCAnd ≥ 1 0,
Properties:
1 Triangularity 2 D ≫ N
Our Approach
1 Develop an infinite Poisson factor analysis model . . .
2 Design a time-varying extension
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 26/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 27/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Generative Model
xnd ∼ Poisson
∼ Gamma
αB
∼ IBP(α)
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 27/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
j=1 1 j )
5 5 10 15 20 25 30 35 40 45 50 nz = 97 α = 1 10 20 30 40 5 10 15 20 25 30 35 40 45 50 nz = 339 α = 10
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 28/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Three-parameter IBP (Teh et.al, 2007)
feature weights
Zn• ∼ BeP(µ) (5.1) µ ∼ SBP(1, α, H, c, σ) (5.2) p (Jnew) ∼ Poisson
Γ(n + c)Γ(c + σ)
Bayesian Nonparametric Models for Data Exploration 2017-09-15 29/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Three-parameter IBP (Teh et.al, 2007)
feature weights
Zn• ∼ BeP(µ) (5.1) µ ∼ SBP(1, α, H, c, σ) (5.2) p (Jnew) ∼ Poisson
Γ(n + c)Γ(c + σ)
Bayesian Nonparametric Models for Data Exploration 2017-09-15 29/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Three-parameter IBP (Teh et.al, 2007)
feature weights
Zn• ∼ BeP(µ) (5.1) µ ∼ SBP(1, α, H, c, σ) (5.2) p (Jnew) ∼ Poisson
Γ(n + c)Γ(c + σ)
(Doshi-Velez et.al, 2015)
Zn• ∼ R-BeP(µ, f) (5.3) µ ∼ BP(1, α, H) (5.4)
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 29/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Three-parameter IBP (Teh et.al, 2007)
feature weights
Zn• ∼ BeP(µ) (5.1) µ ∼ SBP(1, α, H, c, σ) (5.2) p (Jnew) ∼ Poisson
Γ(n + c)Γ(c + σ)
(Doshi-Velez et.al, 2015)
Zn• ∼ R-BeP(µ, f) (5.3) µ ∼ BP(1, α, H) (5.4)
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 29/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
5 5 10 15 20 25 30 35 40 45 50 nz = 97 α = 1 10 20 30 40 5 10 15 20 25 30 35 40 45 50 nz = 339 α = 10
3RBeP-PFA for static scenario
xnd ∼ Poisson
∼ Gamma
αB
∼ 3R-IBP(α, c, σ, f)
programming (Doshi-Velez et.al, 2015)
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 30/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
5 5 10 15 20 25 30 35 40 45 50 nz = 97 α = 1 10 20 30 40 5 10 15 20 25 30 35 40 45 50 nz = 339 α = 10
3RBeP-PFA for static scenario
xnd ∼ Poisson
∼ Gamma
αB
∼ 3R-IBP(α, c, σ, f)
programming (Doshi-Velez et.al, 2015)
dBeP-PFA for dynamic scenario
x(t)
nd
∼ Poisson
n•B•d
∼ Gamma
αB
n•
∼ mIBP(α, γ, δ)
backward-sampling (Gael et.al, 2009)
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 30/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Quantitative analysis: accuracy Vs interpretability
Metric PMF NNMF BeP-PFA sBeP-PFA 3RBeP-PFA Log Perplexity 1.68 ± 0.01 1.61 ± 0.01 1.59 ± 0.04 3.26 ± 0.17 1.62 ± 0.01 Coherence −264.60 ± 4.74 −263.27 ± 7.45 −149.36 ± 7.56 −178.44 ± 4.50 −140.51 ± 2.73
(a) 2010 SITC database (N = 126, D = 744)
Metric PMF NNMF BeP-PFA sBeP-PFA 3RBeP-PFA Log Perplexity 1.48 ± 0.01 1.47 ± 0.01 1.58 ± 0.01 2.56 ± 0.12 1.57 ± 0.02 Coherence −264.73 ± 3.11 −264.67 ± 6.22 −148.91 ± 10.57 −168.39 ± 13.16 −134.51 ± 4.43
(b) 2010 HS database (N = 123, D = 4890)
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 31/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Capturing input sparsity structure 400 400 Empirical Inferred
(a) Baseline
400 400 Empirical Inferred
(b) BeP-PFA
400 400 Empirical Inferred
(c) sBeP-PFA
400 400 Empirical Inferred
(d) 3RBeP-PFA
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 32/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Interpretability
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 33/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Capabilities F0 Bias F1 Agriculture F2 Clothing I F3 Farming F4 Clothing II F5 Electronics I F6 Processed Materials F7 Electronics II F8 Materials I F9 Machinery I F10 Materials II F11 Automobile F12 Chemicals I F13 Chemicals II F14 Machinery II F15 Miscellaneous 1965 1975 1985 1995 2005 0.5 1 Indonesia
F4 F5 F9 F2 F15
1965 1975 1985 1995 2005 0.5 1 Egypt
F4 F11 F1 F2 Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 34/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Capabilities F0 Bias F1 Agriculture F2 Clothing I F3 Farming F4 Clothing II F5 Electronics I F6 Processed Materials F7 Electronics II F8 Materials I F9 Machinery I F10 Materials II F11 Automobile F12 Chemicals I F13 Chemicals II F14 Machinery II F15 Miscellaneous 1965 1975 1985 1995 2005 0.5 1 Indonesia
F4 F5 F9 F2 F15
1965 1975 1985 1995 2005 0.5 1 Egypt
F4 F11 F1 F2 Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 34/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Capabilities F0 Bias F1 Agriculture F2 Clothing I F3 Farming F4 Clothing II F5 Electronics I F6 Processed Materials F7 Electronics II F8 Materials I F9 Machinery I F10 Materials II F11 Automobile F12 Chemicals I F13 Chemicals II F14 Machinery II F15 Miscellaneous 1965 1975 1985 1995 2005 0.5 1 Indonesia
F4 F5 F9 F2 F15
1965 1975 1985 1995 2005 0.5 1 Egypt
F4 F11 F1 F2 Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 34/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Model extension
x(t)
nd
∼ Poisson
n•B•d
∼ Gamma
αB
n•
∼ mIBP(α, γ, δ)
mIBP: markov Indian buffet process (Gael et.al, 2009) 1965 1975 1985 1995 2005 0.5 1 Indonesia
F1 F4 F7 F10 F13
1965 1975 1985 1995 2005 0.5 1 Egypt
F3 F5 F7 F10 Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 35/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
1 Introduction 2 Bayesian nonparametrics 3 ADDP mixture model for marathon model 4 C-IBP feature model for clinical trials 5 PFA models for international trade 6 Conclusions
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 36/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
BNPs
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 37/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
BNPs
Sports science
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 37/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
BNPs
Sports science
Cancer research
learning
associations
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 37/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
BNPs
Sports science
Cancer research
learning
associations
Economics
countries over time
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 37/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
1 Modeling
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 38/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
1 Modeling
2 Inference
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 38/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
1 Modeling
2 Inference
3 Validation
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 38/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 39/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
processes: An application to marathon modeling,” PLoS ONE, vol. 11, no. 1, pp. e0147402, Jan. 2016, doi:10.1371/journal.pone.0147402.
Perez-Cruz, and Oscar Puig, “Indian Buffet process identifies NK cell biomarkers as predictors of response to Codrituzumab in patients with advanced hepatocellular carcinoma.,” Submitted to BMC Cancer, September 2017.
heterogeneous datasets,” Submitted to Journal of Machine Learning Research, June 2017, arXiv:1706.03779.
with a lossy-compressed bit,” Entropy, vol. 18, no. 12, pp. 449, 2016, doi:10.3390/e18120449. Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 40/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
exploration tasks,” Workshop on Human Iinterpretability in Machine Learning at Neural Information Processing Systems, 2017, arXiv:1707.08352.
Perez-Cruz, “Bayesian Poisson factorization for genetic associations with clinical features in cancer,” in Machine Learning for Healthcare Workshop in Neural Information Processing Systems, 2015.
Non-parametric: the Next Generation Workshop in Neural Information Processing Systems, 2015.
sentence clustering from electronic health records for genetic associations in cancer,” in Machine Learning for Computational Biology Workshop in Neural Information Processing Systems, 2015.
Fernando Perez-Cruz, “Map/reduce uncollapsed Gibbs sampling for Bayesian nonparametric models,” in Software Engineering for Machine Learning Workshop in Neural Information Processing Systems, 2014. Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 41/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Global feature F1
−2 2 4
A g e W e i g h t H e i g h t B M I D C t r
g h A F P A A T C R P C P s c
e C D 1 6 C D 3 P C D 1 6 P C D 3 P S t r
a C D 3 / C D 1 6 n e c r
i c C D 3 / C D 1 6 t u m
C D 3 / C D 1 6 v i a b l e P N e c r
i c P T u m
P V i a b l e H s c
e C y t H s c
e M e m A D C C C D 1 7 A D C C C D 1 6 C D 4 5 B C D 3 C D 4 C D 8 C D 4 / C D 8 C D 8 N K C D 1 6 C D 5 6
D 1 6 + C D 5 6 b r i g h t C D 5 6 d i m C D 1 6
D 5 6 d i m C D 1 6 b r i g h t N K C D 5 6 N K P 4 6 D N D P C D 1 6 M E S F N K P 4 6 M E S F s G P C 3 1 1 4 / 1 6 5 s G P C 3 3 / 5 7 s G P C 3 3 / 6 7 s G P C 3 1 1 / 9 6 S D T L
∆d
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 42/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Global feature F2
−2 2 4
A g e W e i g h t H e i g h t B M I D C t r
g h A F P A A T C R P C P s c
e C D 1 6 C D 3 P C D 1 6 P C D 3 P S t r
a C D 3 / C D 1 6 n e c r
i c C D 3 / C D 1 6 t u m
C D 3 / C D 1 6 v i a b l e P N e c r
i c P T u m
P V i a b l e H s c
e C y t H s c
e M e m A D C C C D 1 7 A D C C C D 1 6 C D 4 5 B C D 3 C D 4 C D 8 C D 4 / C D 8 C D 8 N K C D 1 6 C D 5 6
D 1 6 + C D 5 6 b r i g h t C D 5 6 d i m C D 1 6
D 5 6 d i m C D 1 6 b r i g h t N K C D 5 6 N K P 4 6 D N D P C D 1 6 M E S F N K P 4 6 M E S F s G P C 3 1 1 4 / 1 6 5 s G P C 3 3 / 5 7 s G P C 3 3 / 6 7 s G P C 3 1 1 / 9 6 S D T L
∆d
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 43/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
1 1 1 1 1
Feature inactive Feature active
A) T wo-sample hypothesis testing for each dimension d
global features treatment- speci c features
B) E ect size Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 44/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
1 1 1 1 1
Feature inactive Feature active
A) T wo-sample hypothesis testing for each dimension d
global features treatment- speci c features
M posterior samples L bootstrap instances
B) E ect size Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 44/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
xnd =
K
′
nd,k
where x
′
nd,k ∼ Poisson(Zn•B•d)
(Doshi-Velez et.al, 2015)
(Gael et.al, 2009)
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 45/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Interpretability
Top Products (decay 30%) Bkd Bovine 0.49 Miscellaneous Refrigeration Equipment 0.43 Radioactive Chemicals 0.41 Blocks of Iron and Steel 0.41 Rape Seeds 0.40 Animal meat, misc 0.39 Refined Sugars 0.38 Miscellaneous Tire Parts 0.38 Leather Accessories 0.38 Liquor 0.38 Bovine meat 0.38 Embroidery 0.37 Unmilled Barley 0.37 Dried Vegetables 0.36 Textile Fabrics Clothing Accessories 0.36 Horse Meat 0.35 Iron Bars and Rods 0.35 Analog Navigation Devices 0.35
(c) SVD
Top Products (decay 30%) Bkd Miscellaneous Animal Oils 0.78 Bovine and Equine Entrails 0.72 Bovine meat 0.68 Preserved Milk 0.63 Equine 0.62 Butter 0.58
0.57 Glues 0.56
(d) S3R-IBP
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 46/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
1 “Simple” and “advanced” capabilities 2 Countries divided in two big groups: “quiescence” trap.
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 47/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
1 “Simple” and “advanced” capabilities 2 Countries divided in two big groups: “quiescence” trap.
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 47/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Dynamic PFA
x(t)
nd
∼ Poisson
n• B•d
∼ Gamma
αB ) ak ∼ Beta( α K , 1), bk ∼ Beta(γ, δ), z(t)
nk|ak, bk
∼ Bernoulli
1−z(t−1)
nk
k
b
z(t−1)
nk
k
Qk = 1 − ak ak 1 − bk bk
Bayesian Nonparametric Models for Data Exploration 2017-09-15 48/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Inference
nd = K k=1 r(t) nd,k
p(X(1:t)
n• , z(t) nk|−) = p(X(t) n•|z(t) nk, −)
nk
p(X(1:t−1)
n•
, z(t−1)
nk
|−)p(z(t)
nk|z(t−1) nk
)
nk|X(1:t) n• , Z(t) n,¬k, B)
nk|z(t+1) nk
, X(1:t)
n• , Z(t) n,¬k, B)
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 49/50
Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions
Id Top-3 products with highest weights
F0 (bias) crude petroleum, crustaceans, cereals F1 light fixtures, locksmith hardw., misc. ceramic ornaments F2 inorganic esters, chemical products, nitrogen compound F3 iron sheets, iron wire, thin iron sheets F4
F5 soaps, confectionary sugar, baked goods F6 bovine – equine entrails, bovine meat, misc. prepared meats F7 knit clothing accessories, linens, leather accessor. F8 glazes, textiles fabrics for machinery, mineral wool F9
F10 inorganic bases, nitrogenous fertilizers, lubricating petrol. oils F11 imitation jewellery, embroidery, synth. precious stones F12 coffee, non-coniferous worked wood, cane sugar F13 copper ores, chemical wood pulp, misc. non-ferrous ores F14 pepper, vegetable planting materials, natural rubber F15 raw cotton, cotton linters, green groundnuts
1965 1975 1985 1995 2005 China Switzerland Colombia Denmark Egypt Spain Finland France Hungary Indonesia India Ireland Iran Israel Italy Jordan Japan Uruguay United States Venezuela
Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 50/50