Pseudo-Bayes Factors Stefano Cabras 1 , Walter Racugno 1 and Laura - - PowerPoint PPT Presentation

pseudo bayes factors
SMART_READER_LITE
LIVE PREVIEW

Pseudo-Bayes Factors Stefano Cabras 1 , Walter Racugno 1 and Laura - - PowerPoint PPT Presentation

Pseudo-Bayes Factors Stefano Cabras 1 , Walter Racugno 1 and Laura Ventura 2 1 Department of Mathematics, University of Cagliari 2 Department of Statistics, University of Padova Compstat, 2010 Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris,


slide-1
SLIDE 1

Pseudo-Bayes Factors

Stefano Cabras1, Walter Racugno1 and Laura Ventura2

1 Department of Mathematics, University of Cagliari 2 Department of Statistics, University of Padova

Compstat, 2010

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 1 / 15

slide-2
SLIDE 2

Aims Consider a sampling model p(y; θ), with θ = (ψ, λ) ∈ Θ ⊆ Rp, where

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 2 / 15

slide-3
SLIDE 3

Aims Consider a sampling model p(y; θ), with θ = (ψ, λ) ∈ Θ ⊆ Rp, where

ψ is a scalar parameter of interest

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 2 / 15

slide-4
SLIDE 4

Aims Consider a sampling model p(y; θ), with θ = (ψ, λ) ∈ Θ ⊆ Rp, where

ψ is a scalar parameter of interest λ is a p − 1 dimensional nuisance parameter.

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 2 / 15

slide-5
SLIDE 5

Aims Consider a sampling model p(y; θ), with θ = (ψ, λ) ∈ Θ ⊆ Rp, where

ψ is a scalar parameter of interest λ is a p − 1 dimensional nuisance parameter.

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 2 / 15

slide-6
SLIDE 6

Aims Consider a sampling model p(y; θ), with θ = (ψ, λ) ∈ Θ ⊆ Rp, where

ψ is a scalar parameter of interest λ is a p − 1 dimensional nuisance parameter.

We are interested in testing H0 : ψ ∈ Ψ0 against H1 : ψ ∈ Ψ1

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 2 / 15

slide-7
SLIDE 7

Aims Consider a sampling model p(y; θ), with θ = (ψ, λ) ∈ Θ ⊆ Rp, where

ψ is a scalar parameter of interest λ is a p − 1 dimensional nuisance parameter.

We are interested in testing H0 : ψ ∈ Ψ0 against H1 : ψ ∈ Ψ1 Classical approach: use Bayes Factor (BF) based on the ratio of integrated likelihoods. It requires

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 2 / 15

slide-8
SLIDE 8

Aims Consider a sampling model p(y; θ), with θ = (ψ, λ) ∈ Θ ⊆ Rp, where

ψ is a scalar parameter of interest λ is a p − 1 dimensional nuisance parameter.

We are interested in testing H0 : ψ ∈ Ψ0 against H1 : ψ ∈ Ψ1 Classical approach: use Bayes Factor (BF) based on the ratio of integrated likelihoods. It requires

prior elicitation on λ;

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 2 / 15

slide-9
SLIDE 9

Aims Consider a sampling model p(y; θ), with θ = (ψ, λ) ∈ Θ ⊆ Rp, where

ψ is a scalar parameter of interest λ is a p − 1 dimensional nuisance parameter.

We are interested in testing H0 : ψ ∈ Ψ0 against H1 : ψ ∈ Ψ1 Classical approach: use Bayes Factor (BF) based on the ratio of integrated likelihoods. It requires

prior elicitation on λ; calculation on a p-dimensional integral

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 2 / 15

slide-10
SLIDE 10

Aims Consider a sampling model p(y; θ), with θ = (ψ, λ) ∈ Θ ⊆ Rp, where

ψ is a scalar parameter of interest λ is a p − 1 dimensional nuisance parameter.

We are interested in testing H0 : ψ ∈ Ψ0 against H1 : ψ ∈ Ψ1 Classical approach: use Bayes Factor (BF) based on the ratio of integrated likelihoods. It requires

prior elicitation on λ; calculation on a p-dimensional integral

Proposed approach: use a pseudo-BF based on a pseudo-likelihood L∗(ψ), which is a function of ψ only.

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 2 / 15

slide-11
SLIDE 11

Classical Bayesian Hypothesis Testing (Kass, Raftery, 1995) Test H0 : ψ ∈ Ψ0 against H1 : ψ ∈ Ψ1, with the Bayes Factor BF =

  • Ψ0
  • Λ L(ψ, λ)π0(λ|ψ)π0(ψ) dλ dψ
  • Ψ1
  • Λ L(ψ, λ)π1(λ|ψ)π1(ψ) dλ dψ,

where

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 3 / 15

slide-12
SLIDE 12

Classical Bayesian Hypothesis Testing (Kass, Raftery, 1995) Test H0 : ψ ∈ Ψ0 against H1 : ψ ∈ Ψ1, with the Bayes Factor BF =

  • Ψ0
  • Λ L(ψ, λ)π0(λ|ψ)π0(ψ) dλ dψ
  • Ψ1
  • Λ L(ψ, λ)π1(λ|ψ)π1(ψ) dλ dψ,

where

L(ψ, λ) = L(ψ, λ; y) is the full likelihood based on data y;

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 3 / 15

slide-13
SLIDE 13

Classical Bayesian Hypothesis Testing (Kass, Raftery, 1995) Test H0 : ψ ∈ Ψ0 against H1 : ψ ∈ Ψ1, with the Bayes Factor BF =

  • Ψ0
  • Λ L(ψ, λ)π0(λ|ψ)π0(ψ) dλ dψ
  • Ψ1
  • Λ L(ψ, λ)π1(λ|ψ)π1(ψ) dλ dψ,

where

L(ψ, λ) = L(ψ, λ; y) is the full likelihood based on data y; πk(ψ) for ψ ∈ Ψk, k = 0, 1 are priors under H0 and H1 respectively;

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 3 / 15

slide-14
SLIDE 14

Classical Bayesian Hypothesis Testing (Kass, Raftery, 1995) Test H0 : ψ ∈ Ψ0 against H1 : ψ ∈ Ψ1, with the Bayes Factor BF =

  • Ψ0
  • Λ L(ψ, λ)π0(λ|ψ)π0(ψ) dλ dψ
  • Ψ1
  • Λ L(ψ, λ)π1(λ|ψ)π1(ψ) dλ dψ,

where

L(ψ, λ) = L(ψ, λ; y) is the full likelihood based on data y; πk(ψ) for ψ ∈ Ψk, k = 0, 1 are priors under H0 and H1 respectively; πk(λ|ψ) for ψ ∈ Ψk are priors on the nuisance parameter given ψ;

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 3 / 15

slide-15
SLIDE 15

Classical Bayesian Hypothesis Testing (Kass, Raftery, 1995) Test H0 : ψ ∈ Ψ0 against H1 : ψ ∈ Ψ1, with the Bayes Factor BF =

  • Ψ0
  • Λ L(ψ, λ)π0(λ|ψ)π0(ψ) dλ dψ
  • Ψ1
  • Λ L(ψ, λ)π1(λ|ψ)π1(ψ) dλ dψ,

where

L(ψ, λ) = L(ψ, λ; y) is the full likelihood based on data y; πk(ψ) for ψ ∈ Ψk, k = 0, 1 are priors under H0 and H1 respectively; πk(λ|ψ) for ψ ∈ Ψk are priors on the nuisance parameter given ψ;

This approach

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 3 / 15

slide-16
SLIDE 16

Classical Bayesian Hypothesis Testing (Kass, Raftery, 1995) Test H0 : ψ ∈ Ψ0 against H1 : ψ ∈ Ψ1, with the Bayes Factor BF =

  • Ψ0
  • Λ L(ψ, λ)π0(λ|ψ)π0(ψ) dλ dψ
  • Ψ1
  • Λ L(ψ, λ)π1(λ|ψ)π1(ψ) dλ dψ,

where

L(ψ, λ) = L(ψ, λ; y) is the full likelihood based on data y; πk(ψ) for ψ ∈ Ψk, k = 0, 1 are priors under H0 and H1 respectively; πk(λ|ψ) for ψ ∈ Ψk are priors on the nuisance parameter given ψ;

This approach

requires elicitation of priors πk(λ|ψ)

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 3 / 15

slide-17
SLIDE 17

Classical Bayesian Hypothesis Testing (Kass, Raftery, 1995) Test H0 : ψ ∈ Ψ0 against H1 : ψ ∈ Ψ1, with the Bayes Factor BF =

  • Ψ0
  • Λ L(ψ, λ)π0(λ|ψ)π0(ψ) dλ dψ
  • Ψ1
  • Λ L(ψ, λ)π1(λ|ψ)π1(ψ) dλ dψ,

where

L(ψ, λ) = L(ψ, λ; y) is the full likelihood based on data y; πk(ψ) for ψ ∈ Ψk, k = 0, 1 are priors under H0 and H1 respectively; πk(λ|ψ) for ψ ∈ Ψk are priors on the nuisance parameter given ψ;

This approach

requires elicitation of priors πk(λ|ψ)

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 3 / 15

slide-18
SLIDE 18

Classical Bayesian Hypothesis Testing (Kass, Raftery, 1995) Test H0 : ψ ∈ Ψ0 against H1 : ψ ∈ Ψ1, with the Bayes Factor BF =

  • Ψ0
  • Λ L(ψ, λ)π0(λ|ψ)π0(ψ) dλ dψ
  • Ψ1
  • Λ L(ψ, λ)π1(λ|ψ)π1(ψ) dλ dψ,

where

L(ψ, λ) = L(ψ, λ; y) is the full likelihood based on data y; πk(ψ) for ψ ∈ Ψk, k = 0, 1 are priors under H0 and H1 respectively; πk(λ|ψ) for ψ ∈ Ψk are priors on the nuisance parameter given ψ;

This approach

requires elicitation of priors πk(λ|ψ) ... critical when p is large and/or λ has not physical meaning;

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 3 / 15

slide-19
SLIDE 19

Classical Bayesian Hypothesis Testing (Kass, Raftery, 1995) Test H0 : ψ ∈ Ψ0 against H1 : ψ ∈ Ψ1, with the Bayes Factor BF =

  • Ψ0
  • Λ L(ψ, λ)π0(λ|ψ)π0(ψ) dλ dψ
  • Ψ1
  • Λ L(ψ, λ)π1(λ|ψ)π1(ψ) dλ dψ,

where

L(ψ, λ) = L(ψ, λ; y) is the full likelihood based on data y; πk(ψ) for ψ ∈ Ψk, k = 0, 1 are priors under H0 and H1 respectively; πk(λ|ψ) for ψ ∈ Ψk are priors on the nuisance parameter given ψ;

This approach

requires elicitation of priors πk(λ|ψ) ... critical when p is large and/or λ has not physical meaning; needs p dimensional integrations on (Ψk, Λ).

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 3 / 15

slide-20
SLIDE 20

Pseudo likelihoods (Pace, Salvan, 1997; Severini, 2000) in Bayesian inference (Severini, 1999; Ventura et al., 2009; 2010) Nuisance parameters λ are eliminated using a pseudo-likelihood L∗(ψ).

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 4 / 15

slide-21
SLIDE 21

Pseudo likelihoods (Pace, Salvan, 1997; Severini, 2000) in Bayesian inference (Severini, 1999; Ventura et al., 2009; 2010) Nuisance parameters λ are eliminated using a pseudo-likelihood L∗(ψ). Properties of L∗(ψ) are similar to those of a genuine likelihood function.

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 4 / 15

slide-22
SLIDE 22

Pseudo likelihoods (Pace, Salvan, 1997; Severini, 2000) in Bayesian inference (Severini, 1999; Ventura et al., 2009; 2010) Nuisance parameters λ are eliminated using a pseudo-likelihood L∗(ψ). Properties of L∗(ψ) are similar to those of a genuine likelihood function. Examples of L∗(ψ) are the conditional, marginal, profile and the Modified Profile likelihoods (MPL)

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 4 / 15

slide-23
SLIDE 23

Pseudo likelihoods (Pace, Salvan, 1997; Severini, 2000) in Bayesian inference (Severini, 1999; Ventura et al., 2009; 2010) Nuisance parameters λ are eliminated using a pseudo-likelihood L∗(ψ). Properties of L∗(ψ) are similar to those of a genuine likelihood function. Examples of L∗(ψ) are the conditional, marginal, profile and the Modified Profile likelihoods (MPL) Using L∗(ψ) as a true likelihood, a posterior distribution of ψ can be considered π∗(ψ|y) ∝ L∗(ψ)π(ψ), where π(ψ) is a suitable prior on ψ only.

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 4 / 15

slide-24
SLIDE 24

Pseudo likelihoods (Pace, Salvan, 1997; Severini, 2000) in Bayesian inference (Severini, 1999; Ventura et al., 2009; 2010) Nuisance parameters λ are eliminated using a pseudo-likelihood L∗(ψ). Properties of L∗(ψ) are similar to those of a genuine likelihood function. Examples of L∗(ψ) are the conditional, marginal, profile and the Modified Profile likelihoods (MPL) Using L∗(ψ) as a true likelihood, a posterior distribution of ψ can be considered π∗(ψ|y) ∝ L∗(ψ)π(ψ), where π(ψ) is a suitable prior on ψ only. Advantages in using π∗(ψ|y):

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 4 / 15

slide-25
SLIDE 25

Pseudo likelihoods (Pace, Salvan, 1997; Severini, 2000) in Bayesian inference (Severini, 1999; Ventura et al., 2009; 2010) Nuisance parameters λ are eliminated using a pseudo-likelihood L∗(ψ). Properties of L∗(ψ) are similar to those of a genuine likelihood function. Examples of L∗(ψ) are the conditional, marginal, profile and the Modified Profile likelihoods (MPL) Using L∗(ψ) as a true likelihood, a posterior distribution of ψ can be considered π∗(ψ|y) ∝ L∗(ψ)π(ψ), where π(ψ) is a suitable prior on ψ only. Advantages in using π∗(ψ|y):

no elicitation on λ;

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 4 / 15

slide-26
SLIDE 26

Pseudo likelihoods (Pace, Salvan, 1997; Severini, 2000) in Bayesian inference (Severini, 1999; Ventura et al., 2009; 2010) Nuisance parameters λ are eliminated using a pseudo-likelihood L∗(ψ). Properties of L∗(ψ) are similar to those of a genuine likelihood function. Examples of L∗(ψ) are the conditional, marginal, profile and the Modified Profile likelihoods (MPL) Using L∗(ψ) as a true likelihood, a posterior distribution of ψ can be considered π∗(ψ|y) ∝ L∗(ψ)π(ψ), where π(ψ) is a suitable prior on ψ only. Advantages in using π∗(ψ|y):

no elicitation on λ;

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 4 / 15

slide-27
SLIDE 27

Pseudo likelihoods (Pace, Salvan, 1997; Severini, 2000) in Bayesian inference (Severini, 1999; Ventura et al., 2009; 2010) Nuisance parameters λ are eliminated using a pseudo-likelihood L∗(ψ). Properties of L∗(ψ) are similar to those of a genuine likelihood function. Examples of L∗(ψ) are the conditional, marginal, profile and the Modified Profile likelihoods (MPL) Using L∗(ψ) as a true likelihood, a posterior distribution of ψ can be considered π∗(ψ|y) ∝ L∗(ψ)π(ψ), where π(ψ) is a suitable prior on ψ only. Advantages in using π∗(ψ|y):

no elicitation on λ; no multi-dimensional integration over Λ.

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 4 / 15

slide-28
SLIDE 28

Pseudo-Bayes Factors BF ∗ We use π∗(ψ|y) in order to test H0 : ψ ∈ Ψ0 against H1 : ψ ∈ Ψ1.

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 5 / 15

slide-29
SLIDE 29

Pseudo-Bayes Factors BF ∗ We use π∗(ψ|y) in order to test H0 : ψ ∈ Ψ0 against H1 : ψ ∈ Ψ1. In our approach we only need:

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 5 / 15

slide-30
SLIDE 30

Pseudo-Bayes Factors BF ∗ We use π∗(ψ|y) in order to test H0 : ψ ∈ Ψ0 against H1 : ψ ∈ Ψ1. In our approach we only need:

a pseudo-likelihood L∗(ψ)

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 5 / 15

slide-31
SLIDE 31

Pseudo-Bayes Factors BF ∗ We use π∗(ψ|y) in order to test H0 : ψ ∈ Ψ0 against H1 : ψ ∈ Ψ1. In our approach we only need:

a pseudo-likelihood L∗(ψ) priors πk(ψ) for ψ ∈ Ψk, k = 0, 1

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 5 / 15

slide-32
SLIDE 32

Pseudo-Bayes Factors BF ∗ We use π∗(ψ|y) in order to test H0 : ψ ∈ Ψ0 against H1 : ψ ∈ Ψ1. In our approach we only need:

a pseudo-likelihood L∗(ψ) priors πk(ψ) for ψ ∈ Ψk, k = 0, 1

We then define the Pseudo-Bayes Factor: BF ∗ =

  • Ψ0 L∗(ψ)π0(ψ) dψ
  • Ψ1 L∗(ψ)π1(ψ) dψ =
  • Ψ0 π∗

0(ψ|y) dψ

  • Ψ1 π∗

1(ψ|y) dψ,

which needs only 1 dimensional integrations over Ψk.

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 5 / 15

slide-33
SLIDE 33

Comparison of BF ∗ versus BF The null model H0 is favored when BF ∗ > 1 o (BF > 1).

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 6 / 15

slide-34
SLIDE 34

Comparison of BF ∗ versus BF The null model H0 is favored when BF ∗ > 1 o (BF > 1). To evaluate the behavior of BF ∗ we compare it versus BF in terms

  • f the corresponding the Frequentist Risks, R∗(ψ, λ) and R(ψ, λ),

where R∗(ψ, λ) = Pr(BF ∗(y) < 1|H0) + Pr(BF ∗(y) > 1|H1) R(ψ, λ) = Pr(BF(y) < 1|H0) + Pr(BF(y) > 1|H1)

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 6 / 15

slide-35
SLIDE 35

Comparison of BF ∗ versus BF The null model H0 is favored when BF ∗ > 1 o (BF > 1). To evaluate the behavior of BF ∗ we compare it versus BF in terms

  • f the corresponding the Frequentist Risks, R∗(ψ, λ) and R(ψ, λ),

where R∗(ψ, λ) = Pr(BF ∗(y) < 1|H0) + Pr(BF ∗(y) > 1|H1) R(ψ, λ) = Pr(BF(y) < 1|H0) + Pr(BF(y) > 1|H1) If R∗(ψ, λ) < R(ψ, λ) then, at point (ψ, λ), BF ∗ outperforms BF.

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 6 / 15

slide-36
SLIDE 36

Comparison of BF ∗ versus BF The null model H0 is favored when BF ∗ > 1 o (BF > 1). To evaluate the behavior of BF ∗ we compare it versus BF in terms

  • f the corresponding the Frequentist Risks, R∗(ψ, λ) and R(ψ, λ),

where R∗(ψ, λ) = Pr(BF ∗(y) < 1|H0) + Pr(BF ∗(y) > 1|H1) R(ψ, λ) = Pr(BF(y) < 1|H0) + Pr(BF(y) > 1|H1) If R∗(ψ, λ) < R(ψ, λ) then, at point (ψ, λ), BF ∗ outperforms BF. In the next examples we assume a favorable scenario for BF: πk(λ|ψ) = π(λ) for k = 0, 1, with π(λ) concentrated on the true values.

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 6 / 15

slide-37
SLIDE 37

Comparison of BF ∗ versus BF The null model H0 is favored when BF ∗ > 1 o (BF > 1). To evaluate the behavior of BF ∗ we compare it versus BF in terms

  • f the corresponding the Frequentist Risks, R∗(ψ, λ) and R(ψ, λ),

where R∗(ψ, λ) = Pr(BF ∗(y) < 1|H0) + Pr(BF ∗(y) > 1|H1) R(ψ, λ) = Pr(BF(y) < 1|H0) + Pr(BF(y) > 1|H1) If R∗(ψ, λ) < R(ψ, λ) then, at point (ψ, λ), BF ∗ outperforms BF. In the next examples we assume a favorable scenario for BF: πk(λ|ψ) = π(λ) for k = 0, 1, with π(λ) concentrated on the true values. Both R∗(ψ, λ) and R(ψ, λ) are approximated by Monte Carlo simulations.

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 6 / 15

slide-38
SLIDE 38

Examples We discuss two examples:

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 7 / 15

slide-39
SLIDE 39

Examples We discuss two examples:

1

stress-strength model;

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 7 / 15

slide-40
SLIDE 40

Examples We discuss two examples:

1

stress-strength model;

2

logistic regression.

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 7 / 15

slide-41
SLIDE 41

Examples We discuss two examples:

1

stress-strength model;

2

logistic regression.

We mainly focus on the Modified Profile Likelihood (Severini, 2000)

Lmp(ψ) = Lp(ψ) |jλλ(ψ, ˆ λψ)|1/2 |I(ψ, ˆ λψ; ˆ θ)| , where Lp(ψ) = L(ψ, ˆ λψ), ˆ θ is the MLE for θ, ˆ λψ is the conditional MLE of λ and I(ψ, λ; θ0) = Eθ0(ℓλ(ψ, λ)ℓλ(ψ0, λ0)T) , with θ0 = (ψ0, λ0) and ℓλ(ψ, λ) = ∂ℓ(ψ, λ)/∂λ.

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 7 / 15

slide-42
SLIDE 42

Example 1. Stress-strength model (Kotz et al., 2003) Let X ∼ Exp(α) and Y ∼ Exp(β) be random variables. Interest is

  • n

ψ = Pr(X < Y) = α/(α + β) with λ = α.

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 8 / 15

slide-43
SLIDE 43

Example 1. Stress-strength model (Kotz et al., 2003) Let X ∼ Exp(α) and Y ∼ Exp(β) be random variables. Interest is

  • n

ψ = Pr(X < Y) = α/(α + β) with λ = α. Given random samples of size n and m from from X and Y respectively, we test H0 : ψ < 1/2 against H1 : ψ > 1/2, ∀λ > 0 assuming the following priors

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 8 / 15

slide-44
SLIDE 44

Example 1. Stress-strength model (Kotz et al., 2003) Let X ∼ Exp(α) and Y ∼ Exp(β) be random variables. Interest is

  • n

ψ = Pr(X < Y) = α/(α + β) with λ = α. Given random samples of size n and m from from X and Y respectively, we test H0 : ψ < 1/2 against H1 : ψ > 1/2, ∀λ > 0 assuming the following priors

π0(ψ) = U(0, 1/2)

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 8 / 15

slide-45
SLIDE 45

Example 1. Stress-strength model (Kotz et al., 2003) Let X ∼ Exp(α) and Y ∼ Exp(β) be random variables. Interest is

  • n

ψ = Pr(X < Y) = α/(α + β) with λ = α. Given random samples of size n and m from from X and Y respectively, we test H0 : ψ < 1/2 against H1 : ψ > 1/2, ∀λ > 0 assuming the following priors

π0(ψ) = U(0, 1/2) π1(ψ) = U(1/2, 1)

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 8 / 15

slide-46
SLIDE 46

Example 1. Stress-strength model (Kotz et al., 2003) Let X ∼ Exp(α) and Y ∼ Exp(β) be random variables. Interest is

  • n

ψ = Pr(X < Y) = α/(α + β) with λ = α. Given random samples of size n and m from from X and Y respectively, we test H0 : ψ < 1/2 against H1 : ψ > 1/2, ∀λ > 0 assuming the following priors

π0(ψ) = U(0, 1/2) π1(ψ) = U(1/2, 1) π(λ) = Gamma(1, 1)

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 8 / 15

slide-47
SLIDE 47

Example 1. Stress-strength model (Kotz et al., 2003) Let X ∼ Exp(α) and Y ∼ Exp(β) be random variables. Interest is

  • n

ψ = Pr(X < Y) = α/(α + β) with λ = α. Given random samples of size n and m from from X and Y respectively, we test H0 : ψ < 1/2 against H1 : ψ > 1/2, ∀λ > 0 assuming the following priors

π0(ψ) = U(0, 1/2) π1(ψ) = U(1/2, 1) π(λ) = Gamma(1, 1)

The MPL is Lmp(ψ) = −(n + m − 2) log

  • sx + sy

1−ψ ψ

  • + m log 1−ψ

ψ ,

where sx = xi and sy = yi.

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 8 / 15

slide-48
SLIDE 48

Example 1. Stress-strength model (Kotz et al., 2003) Let X ∼ Exp(α) and Y ∼ Exp(β) be random variables. Interest is

  • n

ψ = Pr(X < Y) = α/(α + β) with λ = α. Given random samples of size n and m from from X and Y respectively, we test H0 : ψ < 1/2 against H1 : ψ > 1/2, ∀λ > 0 assuming the following priors

π0(ψ) = U(0, 1/2) π1(ψ) = U(1/2, 1) π(λ) = Gamma(1, 1)

The MPL is Lmp(ψ) = −(n + m − 2) log

  • sx + sy

1−ψ ψ

  • + m log 1−ψ

ψ ,

where sx = xi and sy = yi. Both BF ∗ and BF are evaluated numerically.

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 8 / 15

slide-49
SLIDE 49

Example 1. Stress-strength model (Kotz et al., 2003) R R∗ λ = 0.5 λ = 2.5 λ = 0.5 λ = 2.5 ψ = 0.4 10% 33 % 21% 23 % ψ = 0.6 29% 22 % 21% 26 %

Table: Values of risks (with n = m = 5).

For some (ψ, λ), R < R∗ just because prior π(λ) = Gamma(1, 1) gives high probability to the “true” values of λ.

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 9 / 15

slide-50
SLIDE 50

Example 1. Stress-strength model (Kotz et al., 2003) R R∗ λ = 0.5 λ = 2.5 λ = 0.5 λ = 2.5 ψ = 0.4 10% 33 % 21% 23 % ψ = 0.6 29% 22 % 21% 26 %

Table: Values of risks (with n = m = 5).

For some (ψ, λ), R < R∗ just because prior π(λ) = Gamma(1, 1) gives high probability to the “true” values of λ. However, R is much more sensitive to λ than R∗.

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 9 / 15

slide-51
SLIDE 51

Example 1. Stress-strength model (Kotz et al., 2003)

0.2 0.4 0.6 0.8 0.5 1.0 1.5 2.0 2.5 3.0

BF Risk − BFP Risk, n=2

ψ λ −0.4 −0.2 0.0 0.2 0.2 0.4 0.6 0.8 0.5 1.0 1.5 2.0 2.5 3.0

BF Risk − BFP Risk, n=5

ψ λ −0.4 −0.3 −0.2 −0.1 0.0 0.1 0.2 0.2 0.4 0.6 0.8 0.5 1.0 1.5 2.0 2.5 3.0

BF Risk − BFP Risk, n=10

ψ λ −0.3 −0.2 −0.1 0.0 0.1 0.2 0.2 0.4 0.6 0.8 0.5 1.0 1.5 2.0 2.5 3.0

BF Risk − BFP Risk, n=20

ψ λ −0.2 −0.1 0.0 0.1 0.2

Figure: Values of R(ψ, λ) − R∗(ψ, λ) for n = m = 2, 5, 10, 20.

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 10 / 15

slide-52
SLIDE 52

Example 2. Logistic Regression The logistic regression model has likelihood function

L(β) = exp   

n

  • i=1

yi

p

  • j=1

βjxij −

n

  • i=1

log

  • 1 + e

p

j=1 βjxij

   with β = (β1, . . . , βp) unknown regression coefficient and xij fixed constants, i = 1, . . . , n and j = 1, . . . , p.

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 11 / 15

slide-53
SLIDE 53

Example 2. Logistic Regression The logistic regression model has likelihood function

L(β) = exp   

n

  • i=1

yi

p

  • j=1

βjxij −

n

  • i=1

log

  • 1 + e

p

j=1 βjxij

   with β = (β1, . . . , βp) unknown regression coefficient and xij fixed constants, i = 1, . . . , n and j = 1, . . . , p.

Assume ψ = βp and let λ = (β1, . . . , βp−1) be the nuisance parameter.

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 11 / 15

slide-54
SLIDE 54

Example 2. Logistic Regression The logistic regression model has likelihood function

L(β) = exp   

n

  • i=1

yi

p

  • j=1

βjxij −

n

  • i=1

log

  • 1 + e

p

j=1 βjxij

   with β = (β1, . . . , βp) unknown regression coefficient and xij fixed constants, i = 1, . . . , n and j = 1, . . . , p.

Assume ψ = βp and let λ = (β1, . . . , βp−1) be the nuisance parameter. Interest on testing H0 : ψ > 0 against H1 : ψ < 0, ∀λ assuming the following priors:

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 11 / 15

slide-55
SLIDE 55

Example 2. Logistic Regression The logistic regression model has likelihood function

L(β) = exp   

n

  • i=1

yi

p

  • j=1

βjxij −

n

  • i=1

log

  • 1 + e

p

j=1 βjxij

   with β = (β1, . . . , βp) unknown regression coefficient and xij fixed constants, i = 1, . . . , n and j = 1, . . . , p.

Assume ψ = βp and let λ = (β1, . . . , βp−1) be the nuisance parameter. Interest on testing H0 : ψ > 0 against H1 : ψ < 0, ∀λ assuming the following priors:

π0(ψ) = TruncNormal(0, 1, ψ = 0, ψ = ∞);

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 11 / 15

slide-56
SLIDE 56

Example 2. Logistic Regression The logistic regression model has likelihood function

L(β) = exp   

n

  • i=1

yi

p

  • j=1

βjxij −

n

  • i=1

log

  • 1 + e

p

j=1 βjxij

   with β = (β1, . . . , βp) unknown regression coefficient and xij fixed constants, i = 1, . . . , n and j = 1, . . . , p.

Assume ψ = βp and let λ = (β1, . . . , βp−1) be the nuisance parameter. Interest on testing H0 : ψ > 0 against H1 : ψ < 0, ∀λ assuming the following priors:

π0(ψ) = TruncNormal(0, 1, ψ = 0, ψ = ∞); π1(ψ) = TruncNormal(0, 1, ψ = −∞, ψ = 0);

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 11 / 15

slide-57
SLIDE 57

Example 2. Logistic Regression The logistic regression model has likelihood function

L(β) = exp   

n

  • i=1

yi

p

  • j=1

βjxij −

n

  • i=1

log

  • 1 + e

p

j=1 βjxij

   with β = (β1, . . . , βp) unknown regression coefficient and xij fixed constants, i = 1, . . . , n and j = 1, . . . , p.

Assume ψ = βp and let λ = (β1, . . . , βp−1) be the nuisance parameter. Interest on testing H0 : ψ > 0 against H1 : ψ < 0, ∀λ assuming the following priors:

π0(ψ) = TruncNormal(0, 1, ψ = 0, ψ = ∞); π1(ψ) = TruncNormal(0, 1, ψ = −∞, ψ = 0); π(λ) = π(β1) · . . . · π(βp−1), π(βj) = N(0, 1012), j = 1, 2, . . . , p − 1.

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 11 / 15

slide-58
SLIDE 58

Example 2. Logistic Regression - Urine data (Davison, Hinkley, 1997). For illustration, we analyze the presence/absence of calcium

  • xalate crystals in urine samples Y together with the values of

p = 6 quantitative covariates on n = 77 individuals.

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 12 / 15

slide-59
SLIDE 59

Example 2. Logistic Regression - Urine data (Davison, Hinkley, 1997). For illustration, we analyze the presence/absence of calcium

  • xalate crystals in urine samples Y together with the values of

p = 6 quantitative covariates on n = 77 individuals. Assume that ψ is the coefficient of the effect of the variable urea concentration.

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 12 / 15

slide-60
SLIDE 60

Example 2. Logistic Regression - Urine data (Davison, Hinkley, 1997). For illustration, we analyze the presence/absence of calcium

  • xalate crystals in urine samples Y together with the values of

p = 6 quantitative covariates on n = 77 individuals. Assume that ψ is the coefficient of the effect of the variable urea concentration. The weights of evidence (Good, 1985), W = log BF and

W ∗ = log BF ∗, are W and W ∗ BF based on L(β) 4.2 BF ∗ based on Lmp(ψ) 4.2 According to the Jeffreys’ scale, the evidence is substantial in favor of positive effect of urea concentration.

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 12 / 15

slide-61
SLIDE 61

Example 2. Logistic Regression - p large. In order to assess the behaviour of BF ∗, with respect to large p, we evaluate the corresponding W ∗ in 1000 data sets with p = 20 coefficients

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 13 / 15

slide-62
SLIDE 62

Example 2. Logistic Regression - p large. In order to assess the behaviour of BF ∗, with respect to large p, we evaluate the corresponding W ∗ in 1000 data sets with p = 20 coefficients, with 5 positive, 5 negative and 10 zero coefficients.

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

Weight of Evidence based on Modified Profile Likelihood

Logistic Regression Coefficients Weight of Evidence −20 −10 10 20

Figure: Empirical mean W ∗ for the sign of 20 coefficients.

(Horizontal dashed lines are the levels of strong evidence).

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 13 / 15

slide-63
SLIDE 63

Conclusions The use of pseudo-Bayes factor BF ∗ may be of potential interest in situations with many nuisance parameters having a weak physical meaning.

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 14 / 15

slide-64
SLIDE 64

Conclusions The use of pseudo-Bayes factor BF ∗ may be of potential interest in situations with many nuisance parameters having a weak physical meaning. For the analyzed examples we obtained that:

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 14 / 15

slide-65
SLIDE 65

Conclusions The use of pseudo-Bayes factor BF ∗ may be of potential interest in situations with many nuisance parameters having a weak physical meaning. For the analyzed examples we obtained that:

in general, R∗(ψ, λ) < R(ψ, λ)

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 14 / 15

slide-66
SLIDE 66

Conclusions The use of pseudo-Bayes factor BF ∗ may be of potential interest in situations with many nuisance parameters having a weak physical meaning. For the analyzed examples we obtained that:

in general, R∗(ψ, λ) < R(ψ, λ) R∗ is almost constant with respect to λ

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 14 / 15

slide-67
SLIDE 67

Conclusions The use of pseudo-Bayes factor BF ∗ may be of potential interest in situations with many nuisance parameters having a weak physical meaning. For the analyzed examples we obtained that:

in general, R∗(ψ, λ) < R(ψ, λ) R∗ is almost constant with respect to λ

Main advantages of BF ∗:

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 14 / 15

slide-68
SLIDE 68

Conclusions The use of pseudo-Bayes factor BF ∗ may be of potential interest in situations with many nuisance parameters having a weak physical meaning. For the analyzed examples we obtained that:

in general, R∗(ψ, λ) < R(ψ, λ) R∗ is almost constant with respect to λ

Main advantages of BF ∗:

avoid elicitation of priors on nuisance parameters

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 14 / 15

slide-69
SLIDE 69

Conclusions The use of pseudo-Bayes factor BF ∗ may be of potential interest in situations with many nuisance parameters having a weak physical meaning. For the analyzed examples we obtained that:

in general, R∗(ψ, λ) < R(ψ, λ) R∗ is almost constant with respect to λ

Main advantages of BF ∗:

avoid elicitation of priors on nuisance parameters no multidimensional integrations.

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 14 / 15

slide-70
SLIDE 70

Conclusions The use of pseudo-Bayes factor BF ∗ may be of potential interest in situations with many nuisance parameters having a weak physical meaning. For the analyzed examples we obtained that:

in general, R∗(ψ, λ) < R(ψ, λ) R∗ is almost constant with respect to λ

Main advantages of BF ∗:

avoid elicitation of priors on nuisance parameters no multidimensional integrations.

BF ∗ may be also obtained for semi-parametric models or complex models, when the L(ψ, λ) is difficult or even impossible to compute,

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 14 / 15

slide-71
SLIDE 71

Conclusions The use of pseudo-Bayes factor BF ∗ may be of potential interest in situations with many nuisance parameters having a weak physical meaning. For the analyzed examples we obtained that:

in general, R∗(ψ, λ) < R(ψ, λ) R∗ is almost constant with respect to λ

Main advantages of BF ∗:

avoid elicitation of priors on nuisance parameters no multidimensional integrations.

BF ∗ may be also obtained for semi-parametric models or complex models, when the L(ψ, λ) is difficult or even impossible to compute,

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 14 / 15

slide-72
SLIDE 72

Conclusions The use of pseudo-Bayes factor BF ∗ may be of potential interest in situations with many nuisance parameters having a weak physical meaning. For the analyzed examples we obtained that:

in general, R∗(ψ, λ) < R(ψ, λ) R∗ is almost constant with respect to λ

Main advantages of BF ∗:

avoid elicitation of priors on nuisance parameters no multidimensional integrations.

BF ∗ may be also obtained for semi-parametric models or complex models, when the L(ψ, λ) is difficult or even impossible to compute, in fact, it is possible to resort to quasi-profile likelihoods

(Ventura et al., 2010), empirical likelihoods and composite

likelihoods (Pauli et al., 2010).

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 14 / 15

slide-73
SLIDE 73

Some references

BRAZZALE, A.R., DAVISON, A.C. and REID, N. (2007): Applied Asymptotics. Cambridge University Press, Cambridge. DAVISON, A.C. and HINKLEY, D.V. (1997): Bootstrap Methods and Their Application. Cambridge University Press, Cambridge. GOOD, I.J. (1985): Weight of evidence: A brief survey. Bayesian Statistics 2, 249–269. KASS, R.E. and RAFTERY, A.E. (1995): Bayes factors. JASA 90 (430), 773–795. KOTZ, S., LUMELSKII, Y. and PENSKY, M. (2003): The Stress-Strength Model and its

  • Generalizations. World Scientific, Singapore.

PACE, L. and SALVAN, A. (1997): Principles of Statistical Inference from a Neo-Fisherian

  • Perspective. World Scientific, Singapore.

PAULI, F., VENTURA, L., and RACUGNO, W. (2010): Bayesian composite marginal likelihoods, Stat. Sinica, to appear. SEVERINI, T.A. (2000): Likelihood Methods in Statistics. Oxford University Press. VENTURA, L., CABRAS, S. and RACUGNO, W. (2009): Prior distributions from pseudo-likelihoods in the presence of nuisance parameters, JASA 104 (486), 768–777. VENTURA, L., CABRAS, S. and RACUGNO, W. (2010): Default prior distributions from quasi- and quasi-profile likelihoods. JSPI to appear.

Cabras et al. (CAG-PAD) Pseudo-Bayes Factors Paris, 2010 15 / 15