Bayesian Nonparametric Models for Data Exploration Melanie F. - - PowerPoint PPT Presentation

bayesian nonparametric models for data exploration
SMART_READER_LITE
LIVE PREVIEW

Bayesian Nonparametric Models for Data Exploration Melanie F. - - PowerPoint PPT Presentation

Bayesian Nonparametric Models for Data Exploration Melanie F. Pradier Friday 15 th September, 2017 Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions Outline 1 Introduction 2 Bayesian


slide-1
SLIDE 1

Bayesian Nonparametric Models for Data Exploration

Melanie F. Pradier

Friday 15th September, 2017

slide-2
SLIDE 2

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Outline

1 Introduction 2 Bayesian nonparametrics 3 ADDP mixture model for marathon model 4 C-IBP feature model for clinical trials 5 PFA models for international trade 6 Conclusions

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 1/50

slide-3
SLIDE 3

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 2/50

slide-4
SLIDE 4

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 2/50

slide-5
SLIDE 5

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 2/50

slide-6
SLIDE 6

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 2/50

slide-7
SLIDE 7

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 2/50

slide-8
SLIDE 8

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 2/50

slide-9
SLIDE 9

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation

Data Exploitation Age

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 2/50

slide-10
SLIDE 10

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation

Data Exploitation Age · · · but are we making the

  • utmost out of data?

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 2/50

slide-11
SLIDE 11

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation

An example: personalized medicine

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 3/50

slide-12
SLIDE 12

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation

An example: personalized medicine

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 3/50

slide-13
SLIDE 13

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation

An example: personalized medicine

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 3/50

slide-14
SLIDE 14

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation

An example: personalized medicine

Challenges

  • Complexity

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 3/50

slide-15
SLIDE 15

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation

An example: personalized medicine

Challenges

  • Complexity
  • Missing data

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 3/50

slide-16
SLIDE 16

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation

An example: personalized medicine

Challenges

  • Complexity
  • Missing data
  • Small data within big data
  • · · ·

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 3/50

slide-17
SLIDE 17

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation

An example: personalized medicine

Challenges

  • Complexity
  • Missing data
  • Small data within big data
  • · · ·
  • Research focus

→ data exploration

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 3/50

slide-18
SLIDE 18

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation

An example: personalized medicine

Challenges

  • Complexity
  • Missing data
  • Small data within big data
  • · · ·
  • Research focus

→ data exploration

2018 EU General Data Protection Regulation

“right to explanation”

(Goodman et.al. 2016)

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 3/50

slide-19
SLIDE 19

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation

Focus: data exploration

In this thesis . . .

1 How does aging impact our athletic performance? (Ch. 3) 2 What are the underlying mechanisms of cancer? (Ch. 4 & 5) 3 Which factors make countries wealthier than others? (Ch. 6)

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 4/50

slide-20
SLIDE 20

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation

Focus: data exploration

In this thesis . . .

1 How does aging impact our athletic performance? (Ch. 3) 2 What are the underlying mechanisms of cancer? (Ch. 4 & 5) 3 Which factors make countries wealthier than others? (Ch. 6)

Main goal

  • Knowledge discovery
  • Hypothesis generation

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 4/50

slide-21
SLIDE 21

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation

Focus: data exploration

In this thesis . . .

1 How does aging impact our athletic performance? (Ch. 3) 2 What are the underlying mechanisms of cancer? (Ch. 4 & 5) 3 Which factors make countries wealthier than others? (Ch. 6)

Main goal

  • Knowledge discovery
  • Hypothesis generation

Our Approach

Bayesian nonparametrics

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 4/50

slide-22
SLIDE 22

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Why Bayesian nonparametrics?

  • Bayesian: to handle uncertainty
  • Nonparametric: to adapt model complexity depending on

input data (hypothesis generation)

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 5/50

slide-23
SLIDE 23

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Why Bayesian nonparametrics?

  • Bayesian: to handle uncertainty
  • Nonparametric: to adapt model complexity depending on

input data (hypothesis generation)

Data Dirichlet process (DP) Gaussian process (GP) Bernoulli Process (BeP) Beta process (BP)

...

Expert Knowledge

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 5/50

slide-24
SLIDE 24

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Why Bayesian nonparametrics?

  • Bayesian: to handle uncertainty
  • Nonparametric: to adapt model complexity depending on

input data (hypothesis generation)

Data Dirichlet process (DP) Gaussian process (GP) Bernoulli Process (BeP) Beta process (BP)

...

Expert Knowledge

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 5/50

slide-25
SLIDE 25

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Contributions

Goal: build useful BNP models for specific data exploration tasks.

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 6/50

slide-26
SLIDE 26

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Contributions

Goal: build useful BNP models for specific data exploration tasks.

  • Novel applications
  • Make things work with real data (modeling, inference, validation)
  • Interpretability, sharing across observations, replicability
  • Open-source software and databases

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 6/50

slide-27
SLIDE 27

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Contributions

Goal: build useful BNP models for specific data exploration tasks.

Atom-dependent DP mixture model

  • estimates density in stratified data
  • suitable for fairness requirements
  • link to mixture of experts

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 6/50

slide-28
SLIDE 28

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Contributions

Goal: build useful BNP models for specific data exploration tasks.

Atom-dependent DP mixture model

  • estimates density in stratified data
  • suitable for fairness requirements
  • link to mixture of experts

Case-control IBP feature model

  • infers latent features in

heterogeneous structured data

  • suitable to separate global and

group-specific effects

  • combined with statistical testing

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 6/50

slide-29
SLIDE 29

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Contributions

Goal: build useful BNP models for specific data exploration tasks.

Atom-dependent DP mixture model

  • estimates density in stratified data
  • suitable for fairness requirements
  • link to mixture of experts

Case-control IBP feature model

  • infers latent features in

heterogeneous structured data

  • suitable to separate global and

group-specific effects

  • combined with statistical testing

Poisson factor analysis (PFA) models

→ flexible feature models for count data

1 Hierarchical PFA:

  • deals with stratified data

2 Three-parameter Restricted PFA:

  • imposes structured sparsity in

latent space 3 Dynamic PFA:

  • allows for time-varying

activation of latent factors

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 6/50

slide-30
SLIDE 30

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Contributions

Goal: build useful BNP models for specific data exploration tasks.

Atom-dependent DP mixture model

  • estimates density in stratified data
  • suitable for fairness requirements
  • link to mixture of experts
  • Application: marathon

Case-control IBP feature model

  • infers latent features in

heterogeneous structured data

  • suitable to separate global and

group-specific effects

  • combined with statistical testing
  • Application: clinical trial

Poisson factor analysis (PFA) models

→ flexible feature models for count data

1 Hierarchical PFA:

  • deals with stratified data

2 Three-parameter Restricted PFA:

  • imposes structured sparsity in

latent space 3 Dynamic PFA:

  • allows for time-varying

activation of latent factors

  • Application: international trade

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 6/50

slide-31
SLIDE 31

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Outline

1 Introduction 2 Bayesian nonparametrics 3 ADDP mixture model for marathon model 4 C-IBP feature model for clinical trials 5 PFA models for international trade 6 Conclusions

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 7/50

slide-32
SLIDE 32

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Bayesian nonparametrics (BNPs)

  • Bayesian framework for model selection
  • Nonparametric: number of parameters grows with the amount
  • f data:
  • Prior over infinite-dimensional parameter space
  • Only a finite subset of parameters is used for any finite

dataset

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 8/50

slide-33
SLIDE 33

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Bayesian nonparametrics (BNPs)

  • Bayesian framework for model selection
  • Nonparametric: number of parameters grows with the amount
  • f data:
  • Prior over infinite-dimensional parameter space
  • Only a finite subset of parameters is used for any finite

dataset

  • Rely on stochastic processes:
  • Dirichlet process
  • Beta process
  • Gaussian process
  • · · ·

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 8/50

slide-34
SLIDE 34

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Dirichlet process (DP)

G ∼ DP(α, H) G =

  • k=1

πkδφk

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 9/50

slide-35
SLIDE 35

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Dirichlet process (DP)

G ∼ DP(α, H) G =

  • k=1

πkδφk

  • central block for infinite

mixture models

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 9/50

slide-36
SLIDE 36

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Dirichlet process (DP)

G ∼ DP(α, H) G =

  • k=1

πkδφk

  • central block for infinite

mixture models

Stick-breaking representation

(Ishwaran et.al, 2001)

For k = 1, · · · , ∞

vk ∼ Beta(α, 1), πk = vk

k−1

  • ℓ=1

(1−vℓ)

1 . . . k = 1 k = 2 k = 3

π1 π2 π3

π ∼ GEM(α)

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 9/50

slide-37
SLIDE 37

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Dirichlet process (DP)

G ∼ DP(α, H) G =

  • k=1

πkδφk

  • central block for infinite

mixture models

Stick-breaking representation

(Ishwaran et.al, 2001)

For k = 1, · · · , ∞

vk ∼ Beta(α, 1), πk = vk

k−1

  • ℓ=1

(1−vℓ)

1 . . . k = 1 k = 2 k = 3

π1 π2 π3

π ∼ GEM(α)

For k = 1, · · · , ∞

φk ∼ H

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 9/50

slide-38
SLIDE 38

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Indian buffet process (IBP)

  • central block for infinite latent feature models

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 10/50

slide-39
SLIDE 39

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Indian buffet process (IBP)

  • central block for infinite latent feature models
  • hierarchy of a Beta process (BP) with multiple Bernoulli processes (BeP)

1

G =

  • k=1

πkδφk ∼ BP(c, α, H)

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 10/50

slide-40
SLIDE 40

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Indian buffet process (IBP)

  • central block for infinite latent feature models
  • hierarchy of a Beta process (BP) with multiple Bernoulli processes (BeP)

1

G =

  • k=1

πkδφk ∼ BP(c, α, H)

For n = 1, · · · , ∞ ζn =

  • k=1

znkδφk ∼ BeP(G)

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 10/50

slide-41
SLIDE 41

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Indian buffet process (IBP)

  • central block for infinite latent feature models
  • hierarchy of a Beta process (BP) with multiple Bernoulli processes (BeP)

1

G =

  • k=1

πkδφk ∼ BP(c, α, H)

For n = 1, · · · , ∞ ζn =

  • k=1

znkδφk ∼ BeP(G)

Z ∼ IBP(α)

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 10/50

slide-42
SLIDE 42

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Outline

1 Introduction 2 Bayesian nonparametrics 3 ADDP mixture model for marathon model 4 C-IBP feature model for clinical trials 5 PFA models for international trade 6 Conclusions

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 11/50

slide-43
SLIDE 43

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation

1 What is the impact of age and gender

  • n runners performance?

2 Can we compare different runners in a

fair manner?

  • entry requirements
  • rewards

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 12/50

slide-44
SLIDE 44

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation

1 What is the impact of age and gender

  • n runners performance?

2 Can we compare different runners in a

fair manner?

  • entry requirements
  • rewards

Our Approach

  • dependent density estimation model
  • delivers scientific knowledge in sport sciences
  • constitutes a fair age-gender grading system
  • relies on dependent Dirichlet process

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 12/50

slide-45
SLIDE 45

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Dependent Dirichlet process (DDP)

(MacEachern,2000)

J: number of groups

Gj =

  • k=1

πjkδφjk

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 13/50

slide-46
SLIDE 46

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Dependent Dirichlet process (DDP)

(MacEachern,2000)

J: number of groups

Gj =

  • k=1

πjkδφjk

  • hierarchical DP (Teh et.al, 2005)

Gj =

  • k=1

πjkδφk

  • single-p DDP (MacEachern, 2000)

Gj =

  • k=1

πkδφjk hierarchical DP G0 ∼ DP (α, H) Gj ∼ DP (γ, G0) single-p DDP G0 ∼ DP (α, H) Gj = Tj [G0]

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 13/50

slide-47
SLIDE 47

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Atom-dependent DP mixture model

Generative model

xi ≡ marathon finishing time for runner i

π|α ∼ GEM(α) ci|π ∼ Cat(π) µk ∼ N

  • µ0, σ2
  • σ2

x ∼ IG (a, b)

xi|other vars ∼ N

  • xi|µci, σ2

x

  • Melanie F. Pradier (UC3M)

Bayesian Nonparametric Models for Data Exploration 2017-09-15 14/50

slide-48
SLIDE 48

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Atom-dependent DP mixture model

Generative model

xji ≡ marathon finishing time for runner i in age group j

π|α ∼ GEM(α) cji|π ∼ Cat(π) µk ∼ N

  • µ0, σ2
  • σ2

x ∼ IG (a, b)

θ ∼ N (0, Σθ) xji|other vars ∼ N

  • xji|µcji+θj, σ2

x

  • (Σθ)ℓq = σ2

θ exp

  • −(ℓ − q)2

2ν2

  • + κδ(ℓ − q)

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 14/50

slide-49
SLIDE 49

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Atom-dependent DP mixture model

Generative model

xji ≡ marathon finishing time for runner i in age group j gji ≡ gender

π|α ∼ GEM(α) cji|π ∼ Cat(π) µk ∼ N

  • µ0, σ2
  • σ2

x ∼ IG (a, b)

θ ∼ N (0, Σθ) δ ∼N

  • 0, σ2

ω

  • ω ∼N (0, Σω)

xji|other vars ∼ N

  • xji|µcji+θj+✶[gji = 1](δ + ωj), σ2

x

  • (Σθ)ℓq = σ2

θ exp

  • −(ℓ − q)2

2ν2

  • + κδ(ℓ − q)

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 15/50

slide-50
SLIDE 50

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Results

Impact of age

  • MCMC approach
  • conditional conjugacy
  • block Gibbs sampler
  • 1/4 M runners

2 4 6 8 0.2 0.4 0.6 0.8

Finishing time (hours)

Histogram pdf by ADDP

  • Indiv. clusters

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 16/50

slide-51
SLIDE 51

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Results

Impact of age

20 30 40 50 60 70 2 3 4 5

Age Finishing time (hours)

New York City Boston London WMA

  • MCMC approach
  • conditional conjugacy
  • block Gibbs sampler
  • 1/4 M runners

2 4 6 8 0.2 0.4 0.6 0.8

Finishing time (hours)

Histogram pdf by ADDP

  • Indiv. clusters

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 16/50

slide-52
SLIDE 52

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Results

Impact of age

20 30 40 50 60 70 2 3 4 5

Age Finishing time (hours)

µ1 + θj µ2 + θj New York City Boston London WMA

  • MCMC approach
  • conditional conjugacy
  • block Gibbs sampler
  • 1/4 M runners

2 4 6 8 0.2 0.4 0.6 0.8

Finishing time (hours)

Histogram pdf by ADDP

  • Indiv. clusters

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 16/50

slide-53
SLIDE 53

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Results

Impact of gender

10 20 30 40 50 60 70 26 28 30 32 34

age (years) δ + ωj (mins)

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 17/50

slide-54
SLIDE 54

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Results

Impact of gender

10 20 30 40 50 60 70 26 28 30 32 34

age (years) δ + ωj (mins)

Other Results

  • Speed-dependent cluster means
  • Link to mixture of experts
  • Analysis of running patterns
  • Prediction of finishing time

5 10 15 20 25 30 35 40 42.2 6 7 8 9 10 11 12 km Speed (km/h) 5 10 15 20 25 30 35 40 42.2 20 40 60 80 100 Elevation (m) Cluster 0 (7.2%, T=3.80h) Cluster 1 (24.4%, T=3.93h) Cluster 1− (14.9%, T=4.03h) Cluster 1− − (3.6%, T=4.16h) Cluster 2A (13.4%, T=4.17h) Cluster 2A− (11.3%, T=4.27h) Cluster 2A− − (3.2%, T=4.43h) Cluster 2B (1.1%, T=4.32h) Cluster 2B− (1.6%, T=4.47h) Cluster 3 (3.4%, T=4.56h) Cluster 3− (4.4%, T=4.59h) Cluster 3−− (1.4%, T=4.88h)

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 17/50

slide-55
SLIDE 55

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Outline

1 Introduction 2 Bayesian nonparametrics 3 ADDP mixture model for marathon model 4 C-IBP feature model for clinical trials 5 PFA models for international trade 6 Conclusions

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 18/50

slide-56
SLIDE 56

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation: biomarker discovery in clinical trials

Def: ”any variable that can be used as an indicator of a particular disease state”.

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 19/50

slide-57
SLIDE 57

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation: biomarker discovery in clinical trials

Def: ”any variable that can be used as an indicator of a particular disease state”.

We want to discover:

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 19/50

slide-58
SLIDE 58

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation: biomarker discovery in clinical trials

Def: ”any variable that can be used as an indicator of a particular disease state”.

We want to discover:

1 Indicators of disease progression: prognostic biomarkers

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 19/50

slide-59
SLIDE 59

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation: biomarker discovery in clinical trials

Def: ”any variable that can be used as an indicator of a particular disease state”.

We want to discover:

1 Indicators of disease progression: prognostic biomarkers 2 Indicators of (positive) drug response: predictive biomarkers

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 19/50

slide-60
SLIDE 60

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation: biomarker discovery in clinical trials

Def: ”any variable that can be used as an indicator of a particular disease state”.

1

We want to discover:

1 Indicators of disease progression: prognostic biomarkers 2 Indicators of (positive) drug response: predictive biomarkers

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 19/50

slide-61
SLIDE 61

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

General latent feature model (GLFM)

(Valera et.al, 2017)

Latent feature model for heterogeneous datasets

Y•d X φd Z B•d

σ2

B

α

d = 1 . . . D Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 20/50

slide-62
SLIDE 62

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

General latent feature model (GLFM)

(Valera et.al, 2017)

Latent feature model for heterogeneous datasets

Y•d X φd Z B•d

σ2

B

α

d = 1 . . . D

  • Link functions Td depend on type of

data for each dimension d

xnd = Td(ynd; φd) ynd|Z, B ∼ N(Zn•B•d, σ2

y)

Bkd ∼ N(0, σ2

B)

Z ∼ IBP(α)

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 20/50

slide-63
SLIDE 63

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

General latent feature model (GLFM)

(Valera et.al, 2017)

Latent feature model for heterogeneous datasets

Y•d X φd Z B•d

σ2

B

α

d = 1 . . . D

  • Link functions Td depend on type of

data for each dimension d

xnd = Td(ynd; φd) ynd|Z, B ∼ N(Zn•B•d, σ2

y)

Bkd ∼ N(0, σ2

B)

Z ∼ IBP(α)

Our contribution to GLFM project

  • Open-source python code
  • Simulations for data exploration

https://github.com/ivaleraM/GLFM Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 20/50

slide-64
SLIDE 64

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Our contribution: Case-control IBP (C-IBP)

Y•d X R φd Z B•d

σ2

B

α

W C•d

σ2

C

α

d = 1 . . . D Rn: drug indicator por patient n

xnd = Td(ynd; φd) ynd|Z, W,B, C, R ∼ N(Zn•B•d+✶[Rn = 1]Wn•C•d, σ2

y)

Bkd ∼ N(0, σ2

B)

Z ∼ IBP(α) Ckd∼ N(0, σ2

C)

W∼ IBP(α)

  • Inference: MCMC approach with

accelerated Gibbs sampling

  • Biomarker discovery: statistical

multiple hypothesis testing

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 21/50

slide-65
SLIDE 65

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Results: subpopulations

GPC3 Antibody Treatment against Liver Cancer (J. Hepatology. 2016 Apr, Abou-Alfa et.al.)

  • 180 patients: 60 took a placebo, 120 took the drug
  • PFS: Progression Free Survival

Sub-population Drug Identifier F1 F2 F3 Size (number

  • f patients)

Mean PFS (months) Median PFS (months) 1. 33.37 3.06 1.65 2. 1 4.07 2.29 2.24 3. 1 17.84 2.72 1.81 4. 1 1 4.72 7.05 7.18 5. 1 51.52 3.22 2.55 6. 1 1 16.77 4.17 3.65 7. 1 1 8.38 1.74 1.33 8. 1 1 1 2.07 2.69 2.65 9. 1 1 29.88 3.36 2.03 10. 1 1 1 4.90 4.44 4.34 11. 1 1 1 4.53 6.31 5.31 12. 1 1 1 1 1.94 10.04 10.01 Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 22/50

slide-66
SLIDE 66

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Results: subpopulations

GPC3 Antibody Treatment against Liver Cancer (J. Hepatology. 2016 Apr, Abou-Alfa et.al.)

  • 180 patients: 60 took a placebo, 120 took the drug
  • PFS: Progression Free Survival

Sub-population Drug Identifier F1 F2 F3 Size (number

  • f patients)

Mean PFS (months) Median PFS (months) 1. 33.37 3.06 1.65 2. 1 4.07 2.29 2.24 3. 1 17.84 2.72 1.81 4. 1 1 4.72 7.05 7.18 5. 1 51.52 3.22 2.55 6. 1 1 16.77 4.17 3.65 7. 1 1 8.38 1.74 1.33 8. 1 1 1 2.07 2.69 2.65 9. 1 1 29.88 3.36 2.03 10. 1 1 1 4.90 4.44 4.34 11. 1 1 1 4.53 6.31 5.31 12. 1 1 1 1 1.94 10.04 10.01 Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 22/50

slide-67
SLIDE 67

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Results: subpopulations

GPC3 Antibody Treatment against Liver Cancer (J. Hepatology. 2016 Apr, Abou-Alfa et.al.)

  • 180 patients: 60 took a placebo, 120 took the drug
  • PFS: Progression Free Survival

Sub-population Drug Identifier F1 F2 F3 Size (number

  • f patients)

Mean PFS (months) Median PFS (months) 1. 33.37 3.06 1.65 2. 1 4.07 2.29 2.24 3. 1 17.84 2.72 1.81 4. 1 1 4.72 7.05 7.18 5. 1 51.52 3.22 2.55 6. 1 1 16.77 4.17 3.65 7. 1 1 8.38 1.74 1.33 8. 1 1 1 2.07 2.69 2.65 9. 1 1 29.88 3.36 2.03 10. 1 1 1 4.90 4.44 4.34 11. 1 1 1 4.53 6.31 5.31 12. 1 1 1 1 1.94 10.04 10.01 Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 22/50

slide-68
SLIDE 68

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Results: subpopulations

GPC3 Antibody Treatment against Liver Cancer (J. Hepatology. 2016 Apr, Abou-Alfa et.al.)

  • 180 patients: 60 took a placebo, 120 took the drug
  • PFS: Progression Free Survival

Sub-population Drug Identifier F1 F2 F3 Size (number

  • f patients)

Mean PFS (months) Median PFS (months) 1. 33.37 3.06 1.65 2. 1 4.07 2.29 2.24 3. 1 17.84 2.72 1.81 4. 1 1 4.72 7.05 7.18 5. 1 51.52 3.22 2.55 6. 1 1 16.77 4.17 3.65 7. 1 1 8.38 1.74 1.33 8. 1 1 1 2.07 2.69 2.65 9. 1 1 29.88 3.36 2.03 10. 1 1 1 4.90 4.44 4.34 11. 1 1 1 4.53 6.31 5.31 12. 1 1 1 1 1.94 10.04 10.01

5 10 15

1 2 3 4 5 6 7 8 9 1 1 1 1 2

PFS (months)

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 22/50

slide-69
SLIDE 69

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Results: biomarker discovery

Treatment-specific feature F3

−4 −2 2

A g e W e i g h t H e i g h t B M I D C t r

  • u

g h A F P A A T C R P C P s c

  • r

e C D 1 6 C D 3 P C D 1 6 P C D 3 P S t r

  • m

a C D 3 / C D 1 6 n e c r

  • t

i c C D 3 / C D 1 6 t u m

  • r

C D 3 / C D 1 6 v i a b l e P N e c r

  • t

i c P T u m

  • r

P V i a b l e H s c

  • r

e C y t H s c

  • r

e M e m A D C C C D 1 7 A D C C C D 1 6 C D 4 5 B C D 3 C D 4 C D 8 C D 4 / C D 8 C D 8 N K C D 1 6 C D 5 6

  • C

D 1 6 + C D 5 6 b r i g h t C D 5 6 d i m C D 1 6

  • C

D 5 6 d i m C D 1 6 b r i g h t N K C D 5 6 N K P 4 6 D N D P C D 1 6 M E S F N K P 4 6 M E S F s G P C 3 1 1 4 / 1 6 5 s G P C 3 3 / 5 7 s G P C 3 3 / 6 7 s G P C 3 1 1 / 9 6 S D T L

∆d

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 23/50

slide-70
SLIDE 70

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Outline

1 Introduction 2 Bayesian nonparametrics 3 ADDP mixture model for marathon model 4 C-IBP feature model for clinical trials 5 PFA models for international trade 6 Conclusions

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 24/50

slide-71
SLIDE 71

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation: wealth of nations

What makes some countries wealthier than others?

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 25/50

slide-72
SLIDE 72

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation: wealth of nations

What makes some countries wealthier than others?

Classical view

  • Division of labor (A. Smith,

1776; Ricardo, 1817)

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 25/50

slide-73
SLIDE 73

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation: wealth of nations

What makes some countries wealthier than others?

Classical view

  • Division of labor (A. Smith,

1776; Ricardo, 1817)

  • Specialization leads to

economic efficiency

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 25/50

slide-74
SLIDE 74

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation: wealth of nations

What makes some countries wealthier than others?

Classical view

  • Division of labor (A. Smith,

1776; Ricardo, 1817)

  • Specialization leads to

economic efficiency

  • Export portfolios

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 25/50

slide-75
SLIDE 75

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation: wealth of nations

What makes some countries wealthier than others?

Classical view

  • Division of labor (A. Smith,

1776; Ricardo, 1817)

  • Specialization leads to

economic efficiency

  • Export portfolios

→ block-structure

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 25/50

slide-76
SLIDE 76

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation: wealth of nations

The reality:

200 400 600 40 80 120

Products Countries

RCAnd = End/

p End

  • n End/

n,d End

xnd =

  • 1,

if RCAnd ≥ 1 0,

  • therwise

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 26/50

slide-77
SLIDE 77

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation: wealth of nations

The reality:

200 400 600 40 80 120

Products Countries

RCAnd = End/

p End

  • n End/

n,d End

xnd =

  • 1,

if RCAnd ≥ 1 0,

  • therwise

Properties:

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 26/50

slide-78
SLIDE 78

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation: wealth of nations

The reality:

200 400 600 40 80 120

Products Countries

RCAnd = End/

p End

  • n End/

n,d End

xnd =

  • 1,

if RCAnd ≥ 1 0,

  • therwise

Properties:

1 Triangularity

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 26/50

slide-79
SLIDE 79

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation: wealth of nations

The reality:

200 400 600 40 80 120

Products Countries

RCAnd = End/

p End

  • n End/

n,d End

xnd =

  • 1,

if RCAnd ≥ 1 0,

  • therwise

Properties:

1 Triangularity 2 D ≫ N

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 26/50

slide-80
SLIDE 80

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Motivation: wealth of nations

The reality:

200 400 600 40 80 120

Products Countries

RCAnd = End/

p End

  • n End/

n,d End

xnd =

  • 1,

if RCAnd ≥ 1 0,

  • therwise

Properties:

1 Triangularity 2 D ≫ N

Our Approach

1 Develop an infinite Poisson factor analysis model . . .

  • flexible prior
  • feature sparsity

2 Design a time-varying extension

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 26/50

slide-81
SLIDE 81

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Bernoulli process Poisson factor analysis (BeP-PFA)

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 27/50

slide-82
SLIDE 82

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Bernoulli process Poisson factor analysis (BeP-PFA)

Generative Model

xnd ∼ Poisson

  • Zn•B•d
  • Bkd

∼ Gamma

  • αB, µB

αB

  • Z

∼ IBP(α)

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 27/50

slide-83
SLIDE 83

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Limitation of the IBP

  • Number of ones per row Jn ∝ Poisson(α)
  • Number of non-empty features K ∝ Poisson(α N

j=1 1 j )

  • Mass parameter α couples both Jn and K

5 5 10 15 20 25 30 35 40 45 50 nz = 97 α = 1 10 20 30 40 5 10 15 20 25 30 35 40 45 50 nz = 339 α = 10

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 28/50

slide-84
SLIDE 84

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Beyond the standard IBP

Three-parameter IBP (Teh et.al, 2007)

  • More flexible distribution for

feature weights

Zn• ∼ BeP(µ) (5.1) µ ∼ SBP(1, α, H, c, σ) (5.2) p (Jnew) ∼ Poisson

  • αΓ(1 + c)Γ(n + c + σ − 1)

Γ(n + c)Γ(c + σ)

  • Melanie F. Pradier (UC3M)

Bayesian Nonparametric Models for Data Exploration 2017-09-15 29/50

slide-85
SLIDE 85

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Beyond the standard IBP

Three-parameter IBP (Teh et.al, 2007)

  • More flexible distribution for

feature weights

Zn• ∼ BeP(µ) (5.1) µ ∼ SBP(1, α, H, c, σ) (5.2) p (Jnew) ∼ Poisson

  • αΓ(1 + c)Γ(n + c + σ − 1)

Γ(n + c)Γ(c + σ)

  • Melanie F. Pradier (UC3M)

Bayesian Nonparametric Models for Data Exploration 2017-09-15 29/50

slide-86
SLIDE 86

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Beyond the standard IBP

Three-parameter IBP (Teh et.al, 2007)

  • More flexible distribution for

feature weights

Zn• ∼ BeP(µ) (5.1) µ ∼ SBP(1, α, H, c, σ) (5.2) p (Jnew) ∼ Poisson

  • αΓ(1 + c)Γ(n + c + σ − 1)

Γ(n + c)Γ(c + σ)

  • Restricted IBP

(Doshi-Velez et.al, 2015)

  • Arbitrary prior f over Jn

Zn• ∼ R-BeP(µ, f) (5.3) µ ∼ BP(1, α, H) (5.4)

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 29/50

slide-87
SLIDE 87

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Beyond the standard IBP

Three-parameter IBP (Teh et.al, 2007)

  • More flexible distribution for

feature weights

Zn• ∼ BeP(µ) (5.1) µ ∼ SBP(1, α, H, c, σ) (5.2) p (Jnew) ∼ Poisson

  • αΓ(1 + c)Γ(n + c + σ − 1)

Γ(n + c)Γ(c + σ)

  • Restricted IBP

(Doshi-Velez et.al, 2015)

  • Arbitrary prior f over Jn

Zn• ∼ R-BeP(µ, f) (5.3) µ ∼ BP(1, α, H) (5.4)

  • Combination of both
  • Flexible prior

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 29/50

slide-88
SLIDE 88

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Our contributions

5 5 10 15 20 25 30 35 40 45 50 nz = 97 α = 1 10 20 30 40 5 10 15 20 25 30 35 40 45 50 nz = 339 α = 10

3RBeP-PFA for static scenario

xnd ∼ Poisson

  • Zn•B•d
  • Bkd

∼ Gamma

  • αB, µB

αB

  • Z

∼ 3R-IBP(α, c, σ, f)

  • Inference: aux. vars + dynamic

programming (Doshi-Velez et.al, 2015)

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 30/50

slide-89
SLIDE 89

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Our contributions

5 5 10 15 20 25 30 35 40 45 50 nz = 97 α = 1 10 20 30 40 5 10 15 20 25 30 35 40 45 50 nz = 339 α = 10

3RBeP-PFA for static scenario

xnd ∼ Poisson

  • Zn•B•d
  • Bkd

∼ Gamma

  • αB, µB

αB

  • Z

∼ 3R-IBP(α, c, σ, f)

  • Inference: aux. vars + dynamic

programming (Doshi-Velez et.al, 2015)

dBeP-PFA for dynamic scenario

x(t)

nd

∼ Poisson

  • Z(t)

n•B•d

  • Bkd

∼ Gamma

  • αB, µB

αB

  • Z(•)

n•

∼ mIBP(α, γ, δ)

  • Inference: forward-filtering

backward-sampling (Gael et.al, 2009)

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 30/50

slide-90
SLIDE 90

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Results in static scenario

Quantitative analysis: accuracy Vs interpretability

Metric PMF NNMF BeP-PFA sBeP-PFA 3RBeP-PFA Log Perplexity 1.68 ± 0.01 1.61 ± 0.01 1.59 ± 0.04 3.26 ± 0.17 1.62 ± 0.01 Coherence −264.60 ± 4.74 −263.27 ± 7.45 −149.36 ± 7.56 −178.44 ± 4.50 −140.51 ± 2.73

(a) 2010 SITC database (N = 126, D = 744)

Metric PMF NNMF BeP-PFA sBeP-PFA 3RBeP-PFA Log Perplexity 1.48 ± 0.01 1.47 ± 0.01 1.58 ± 0.01 2.56 ± 0.12 1.57 ± 0.02 Coherence −264.73 ± 3.11 −264.67 ± 6.22 −148.91 ± 10.57 −168.39 ± 13.16 −134.51 ± 4.43

(b) 2010 HS database (N = 123, D = 4890)

  • PMF: Probabilistic matrix factorization (Mnih et.al, 2008)
  • NNMF: Non-negative matrix factorization (Schmidt et.al, 2009)
  • BeP-PFA: Bernoulli process Poisson factor analysis
  • sBeP-PFA: sparse Bernoulli process Poisson factor analysis
  • 3RBeP-PFA: Three-parameter Restricted Bernoulli process Poisson factor analysis

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 31/50

slide-91
SLIDE 91

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Results in static scenario

Capturing input sparsity structure 400 400 Empirical Inferred

(a) Baseline

400 400 Empirical Inferred

(b) BeP-PFA

400 400 Empirical Inferred

(c) sBeP-PFA

400 400 Empirical Inferred

(d) 3RBeP-PFA

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 32/50

slide-92
SLIDE 92

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Results in static scenario

Interpretability

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 33/50

slide-93
SLIDE 93

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Temporal Dynamics

Capabilities F0 Bias F1 Agriculture F2 Clothing I F3 Farming F4 Clothing II F5 Electronics I F6 Processed Materials F7 Electronics II F8 Materials I F9 Machinery I F10 Materials II F11 Automobile F12 Chemicals I F13 Chemicals II F14 Machinery II F15 Miscellaneous 1965 1975 1985 1995 2005 0.5 1 Indonesia

F4 F5 F9 F2 F15

1965 1975 1985 1995 2005 0.5 1 Egypt

F4 F11 F1 F2 Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 34/50

slide-94
SLIDE 94

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Temporal Dynamics

Capabilities F0 Bias F1 Agriculture F2 Clothing I F3 Farming F4 Clothing II F5 Electronics I F6 Processed Materials F7 Electronics II F8 Materials I F9 Machinery I F10 Materials II F11 Automobile F12 Chemicals I F13 Chemicals II F14 Machinery II F15 Miscellaneous 1965 1975 1985 1995 2005 0.5 1 Indonesia

F4 F5 F9 F2 F15

1965 1975 1985 1995 2005 0.5 1 Egypt

F4 F11 F1 F2 Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 34/50

slide-95
SLIDE 95

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Temporal Dynamics

Capabilities F0 Bias F1 Agriculture F2 Clothing I F3 Farming F4 Clothing II F5 Electronics I F6 Processed Materials F7 Electronics II F8 Materials I F9 Machinery I F10 Materials II F11 Automobile F12 Chemicals I F13 Chemicals II F14 Machinery II F15 Miscellaneous 1965 1975 1985 1995 2005 0.5 1 Indonesia

F4 F5 F9 F2 F15

1965 1975 1985 1995 2005 0.5 1 Egypt

F4 F11 F1 F2 Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 34/50

slide-96
SLIDE 96

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Model extension: Dynamic PFA

Model extension

x(t)

nd

∼ Poisson

  • Z(t)

n•B•d

  • Bkd

∼ Gamma

  • αB, µB

αB

  • Z(•)

n•

∼ mIBP(α, γ, δ)

mIBP: markov Indian buffet process (Gael et.al, 2009) 1965 1975 1985 1995 2005 0.5 1 Indonesia

F1 F4 F7 F10 F13

1965 1975 1985 1995 2005 0.5 1 Egypt

F3 F5 F7 F10 Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 35/50

slide-97
SLIDE 97

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Outline

1 Introduction 2 Bayesian nonparametrics 3 ADDP mixture model for marathon model 4 C-IBP feature model for clinical trials 5 PFA models for international trade 6 Conclusions

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 36/50

slide-98
SLIDE 98

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Conclusions

BNPs

  • useful BNP models for specific data exploration tasks
  • Fair density estimation model
  • Structured general latent feature model (global and group-specific factors)
  • Flexible Poisson factor analysis models in static/dynamic scenarios

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 37/50

slide-99
SLIDE 99

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Conclusions

BNPs

  • useful BNP models for specific data exploration tasks
  • Fair density estimation model
  • Structured general latent feature model (global and group-specific factors)
  • Flexible Poisson factor analysis models in static/dynamic scenarios

Sports science

  • age-gender curves
  • fair grading system
  • running patterns
  • ver time

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 37/50

slide-100
SLIDE 100

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Conclusions

BNPs

  • useful BNP models for specific data exploration tasks
  • Fair density estimation model
  • Structured general latent feature model (global and group-specific factors)
  • Flexible Poisson factor analysis models in static/dynamic scenarios

Sports science

  • age-gender curves
  • fair grading system
  • running patterns
  • ver time

Cancer research

  • subpopulation

learning

  • biomarker discovery
  • clinico-genetic

associations

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 37/50

slide-101
SLIDE 101

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Conclusions

BNPs

  • useful BNP models for specific data exploration tasks
  • Fair density estimation model
  • Structured general latent feature model (global and group-specific factors)
  • Flexible Poisson factor analysis models in static/dynamic scenarios

Sports science

  • age-gender curves
  • fair grading system
  • running patterns
  • ver time

Cancer research

  • subpopulation

learning

  • biomarker discovery
  • clinico-genetic

associations

Economics

  • meaningful features
  • evolution of

countries over time

  • transition model

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 37/50

slide-102
SLIDE 102

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Future Work

1 Modeling

  • encode complex prior knowledge
  • generalized ADDP: multiple-input/output, other applications
  • atom-dependent latent feature model
  • . . .

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 38/50

slide-103
SLIDE 103

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Future Work

1 Modeling

  • encode complex prior knowledge
  • generalized ADDP: multiple-input/output, other applications
  • atom-dependent latent feature model
  • . . .

2 Inference

  • scale algorithms (e.g., black-box variational inference)
  • better exploration (e.g., split-merge moves)

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 38/50

slide-104
SLIDE 104

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Future Work

1 Modeling

  • encode complex prior knowledge
  • generalized ADDP: multiple-input/output, other applications
  • atom-dependent latent feature model
  • . . .

2 Inference

  • scale algorithms (e.g., black-box variational inference)
  • better exploration (e.g., split-merge moves)

3 Validation

  • new “data exploration” metrics
  • how to quantify model utility?

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 38/50

slide-105
SLIDE 105

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Thank you for listening!

Any questions?

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 39/50

slide-106
SLIDE 106

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Journal Publications

  • Melanie F. Pradier, Francisco J. R. Ruiz, and Fernando Perez-Cruz, “Prior design for dependent Dirichlet

processes: An application to marathon modeling,” PLoS ONE, vol. 11, no. 1, pp. e0147402, Jan. 2016, doi:10.1371/journal.pone.0147402.

  • Melanie F. Pradier, Bernhard Reis, Lori Jukofsky, Francesca Milletti, Toshihiko Ohtomo, Fernando

Perez-Cruz, and Oscar Puig, “Indian Buffet process identifies NK cell biomarkers as predictors of response to Codrituzumab in patients with advanced hepatocellular carcinoma.,” Submitted to BMC Cancer, September 2017.

  • Isabel Valera, Melanie F. Pradier, and Zoubin Ghaharamani, “General latent feature model for

heterogeneous datasets,” Submitted to Journal of Machine Learning Research, June 2017, arXiv:1706.03779.

  • Melanie F. Pradier, Pablo M. Olmos, and Fernando Perez-Cruz, “Entropy-constrained scalar quantization

with a lossy-compressed bit,” Entropy, vol. 18, no. 12, pp. 449, 2016, doi:10.3390/e18120449. Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 40/50

slide-107
SLIDE 107

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Workshop Publications

  • Isabel Valera, Melanie F. Pradier, and Zoubin Ghahramani, “General latent feature modeling for data

exploration tasks,” Workshop on Human Iinterpretability in Machine Learning at Neural Information Processing Systems, 2017, arXiv:1707.08352.

  • Melanie F. Pradier, Theofanis Karaletsos, Stefan Stark, Julia E. Vogt, Gunnar Ratsch, and Fernando

Perez-Cruz, “Bayesian Poisson factorization for genetic associations with clinical features in cancer,” in Machine Learning for Healthcare Workshop in Neural Information Processing Systems, 2015.

  • Melanie F. Pradier and Fernando Perez-Cruz, “Infinite mixture of global Gaussian processes,” in Bayesian

Non-parametric: the Next Generation Workshop in Neural Information Processing Systems, 2015.

  • Melanie F. Pradier, Stefan Stark, Stephanie Hyland, Julia E. Vogt, and Gunnar Ratsch, “Large-scale

sentence clustering from electronic health records for genetic associations in cancer,” in Machine Learning for Computational Biology Workshop in Neural Information Processing Systems, 2015.

  • Melanie F. Pradier, Pablo G. Moreno, Francisco J. R. Ruiz, Isabel Valera, Harold Molina-Bulla, and

Fernando Perez-Cruz, “Map/reduce uncollapsed Gibbs sampling for Bayesian nonparametric models,” in Software Engineering for Machine Learning Workshop in Neural Information Processing Systems, 2014. Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 41/50

slide-108
SLIDE 108

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Results: biomarker discovery

Global feature F1

−2 2 4

A g e W e i g h t H e i g h t B M I D C t r

  • u

g h A F P A A T C R P C P s c

  • r

e C D 1 6 C D 3 P C D 1 6 P C D 3 P S t r

  • m

a C D 3 / C D 1 6 n e c r

  • t

i c C D 3 / C D 1 6 t u m

  • r

C D 3 / C D 1 6 v i a b l e P N e c r

  • t

i c P T u m

  • r

P V i a b l e H s c

  • r

e C y t H s c

  • r

e M e m A D C C C D 1 7 A D C C C D 1 6 C D 4 5 B C D 3 C D 4 C D 8 C D 4 / C D 8 C D 8 N K C D 1 6 C D 5 6

  • C

D 1 6 + C D 5 6 b r i g h t C D 5 6 d i m C D 1 6

  • C

D 5 6 d i m C D 1 6 b r i g h t N K C D 5 6 N K P 4 6 D N D P C D 1 6 M E S F N K P 4 6 M E S F s G P C 3 1 1 4 / 1 6 5 s G P C 3 3 / 5 7 s G P C 3 3 / 6 7 s G P C 3 1 1 / 9 6 S D T L

∆d

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 42/50

slide-109
SLIDE 109

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Results: biomarker discovery

Global feature F2

−2 2 4

A g e W e i g h t H e i g h t B M I D C t r

  • u

g h A F P A A T C R P C P s c

  • r

e C D 1 6 C D 3 P C D 1 6 P C D 3 P S t r

  • m

a C D 3 / C D 1 6 n e c r

  • t

i c C D 3 / C D 1 6 t u m

  • r

C D 3 / C D 1 6 v i a b l e P N e c r

  • t

i c P T u m

  • r

P V i a b l e H s c

  • r

e C y t H s c

  • r

e M e m A D C C C D 1 7 A D C C C D 1 6 C D 4 5 B C D 3 C D 4 C D 8 C D 4 / C D 8 C D 8 N K C D 1 6 C D 5 6

  • C

D 1 6 + C D 5 6 b r i g h t C D 5 6 d i m C D 1 6

  • C

D 5 6 d i m C D 1 6 b r i g h t N K C D 5 6 N K P 4 6 D N D P C D 1 6 M E S F N K P 4 6 M E S F s G P C 3 1 1 4 / 1 6 5 s G P C 3 3 / 5 7 s G P C 3 3 / 6 7 s G P C 3 1 1 / 9 6 S D T L

∆d

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 43/50

slide-110
SLIDE 110

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Statistical procedure for biomarker discovery

1 1 1 1 1

{

{

Feature inactive Feature active

A) T wo-sample hypothesis testing for each dimension d

global features treatment- speci c features

B) E ect size Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 44/50

slide-111
SLIDE 111

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Statistical procedure for biomarker discovery

1 1 1 1 1

{

{

Feature inactive Feature active

A) T wo-sample hypothesis testing for each dimension d

global features treatment- speci c features

M posterior samples L bootstrap instances

B) E ect size Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 44/50

slide-112
SLIDE 112

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Appendix: Inference in PFA models

  • Markov Chain Monte Carlo approach.
  • Conditional conjugacy using auxiliary variables.

xnd =

K

  • x

nd,k

where x

nd,k ∼ Poisson(Zn•B•d)

  • Truncated approximation of feature weights
  • In 3RBeP-PFA, dynamic programming to compute likelihood

(Doshi-Velez et.al, 2015)

  • In dBeP-PFA, forward-filtering backward-sampling procedure

(Gael et.al, 2009)

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 45/50

slide-113
SLIDE 113

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Appendix: Results for 3RBeP-PFA

Interpretability

Top Products (decay 30%) Bkd Bovine 0.49 Miscellaneous Refrigeration Equipment 0.43 Radioactive Chemicals 0.41 Blocks of Iron and Steel 0.41 Rape Seeds 0.40 Animal meat, misc 0.39 Refined Sugars 0.38 Miscellaneous Tire Parts 0.38 Leather Accessories 0.38 Liquor 0.38 Bovine meat 0.38 Embroidery 0.37 Unmilled Barley 0.37 Dried Vegetables 0.36 Textile Fabrics Clothing Accessories 0.36 Horse Meat 0.35 Iron Bars and Rods 0.35 Analog Navigation Devices 0.35

(c) SVD

Top Products (decay 30%) Bkd Miscellaneous Animal Oils 0.78 Bovine and Equine Entrails 0.72 Bovine meat 0.68 Preserved Milk 0.63 Equine 0.62 Butter 0.58

  • Misc. Animal Origin Materials

0.57 Glues 0.56

(d) S3R-IBP

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 46/50

slide-114
SLIDE 114

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Deep 3RBeP-PFA: using a 2nd layer

1 “Simple” and “advanced” capabilities 2 Countries divided in two big groups: “quiescence” trap.

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 47/50

slide-115
SLIDE 115

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Deep 3RBeP-PFA: using a 2nd layer

1 “Simple” and “advanced” capabilities 2 Countries divided in two big groups: “quiescence” trap.

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 47/50

slide-116
SLIDE 116

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Appendix: Modeling dBeP-PFA

Dynamic PFA

  • T timestamps (years)
  • markov IBP to account for temporal dynamics (Gael et.al, 2009)
  • Generative model:

x(t)

nd

∼ Poisson

  • Z(t)

n• B•d

  • Bkd

∼ Gamma

  • αB, µB

αB ) ak ∼ Beta( α K , 1), bk ∼ Beta(γ, δ), z(t)

nk|ak, bk

∼ Bernoulli

  • a

1−z(t−1)

nk

k

b

z(t−1)

nk

k

  • The transition matrix Qk for feature k is given by:

Qk = 1 − ak ak 1 − bk bk

  • Melanie F. Pradier (UC3M)

Bayesian Nonparametric Models for Data Exploration 2017-09-15 48/50

slide-117
SLIDE 117

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Appendix: Inference dBeP-PFA

Inference

  • MCMC approach, e.g., Gibbs sampler + slice sampler for the IBP
  • K Poisson-distributed auxiliary random variables, i.e., x(t)

nd = K k=1 r(t) nd,k

  • Forward Filtering Backward Sampling (FFBS) to approximate p(Z|X, B)

p(X(1:t)

n• , z(t) nk|−) = p(X(t) n•|z(t) nk, −)

  • z(t−1)

nk

p(X(1:t−1)

n•

, z(t−1)

nk

|−)p(z(t)

nk|z(t−1) nk

)

  • Forward step: compute p(z(t)

nk|X(1:t) n• , Z(t) n,¬k, B)

  • Backward step: sample from p(z(t)

nk|z(t+1) nk

, X(1:t)

n• , Z(t) n,¬k, B)

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 49/50

slide-118
SLIDE 118

Intro BNPs ADDP for Marathon Modeling C-IBP for Clinical Trial PFAs for International Trade Conclusions

Appendix: Results for dBeP-PFA

Id Top-3 products with highest weights

F0 (bias) crude petroleum, crustaceans, cereals F1 light fixtures, locksmith hardw., misc. ceramic ornaments F2 inorganic esters, chemical products, nitrogen compound F3 iron sheets, iron wire, thin iron sheets F4

  • misc. elect. machinery, typewriters, misc. office equipment

F5 soaps, confectionary sugar, baked goods F6 bovine – equine entrails, bovine meat, misc. prepared meats F7 knit clothing accessories, linens, leather accessor. F8 glazes, textiles fabrics for machinery, mineral wool F9

  • misc. vegetables, grapes – raisins, misc. fruit

F10 inorganic bases, nitrogenous fertilizers, lubricating petrol. oils F11 imitation jewellery, embroidery, synth. precious stones F12 coffee, non-coniferous worked wood, cane sugar F13 copper ores, chemical wood pulp, misc. non-ferrous ores F14 pepper, vegetable planting materials, natural rubber F15 raw cotton, cotton linters, green groundnuts

1965 1975 1985 1995 2005 China Switzerland Colombia Denmark Egypt Spain Finland France Hungary Indonesia India Ireland Iran Israel Italy Jordan Japan Uruguay United States Venezuela

Melanie F. Pradier (UC3M) Bayesian Nonparametric Models for Data Exploration 2017-09-15 50/50